Claude Code Dynamic Workflows: Agent Orchestration

Anthropic launched dynamic workflows in Claude Code on May 28, 2026, with Claude Opus 4.8. The idea: instead of a single agent processing a task linearly, Claude generates an orchestration plan and deploys dozens to hundreds of sub-agents in parallel within the same session, with adversarial agents assigned to verify and challenge results before they are accepted. This is a capability with no real precedent in mainstream AI coding tools. What it concretely changes for an engineering team, how the mechanism works, when it is useful, and when it is overkill: a pragmatic breakdown.

What dynamic workflows actually are

Before dynamic workflows, Claude Code operated as a single agent: it received an instruction, used tools (file read/write, command execution, API calls), progressed step by step, and returned to the user for validation or clarification. Useful, but structurally linear.

Dynamic workflows break that linearity. Claude no longer works alone, one step at a time: it first plans an orchestration architecture, then delegates sub-tasks to dozens or even hundreds of agents that run in parallel within the same session. Each agent has a precise scope, works independently of the others, and has its results submitted to adversarial verification agents before being integrated.

As Ken Takao, Lead Systems Engineer at Anthropic, put it: dynamic workflows fill the gap between launching a single sub-agent and building a full agent team. That gap was, until now, the primary bottleneck for handling truly large-scale engineering tasks.

Research preview

Dynamic workflows have been available in research preview since May 28, 2026, with Claude Opus 4.8. The feature is still stabilizing. For production-critical use, expect interface and behavior changes in the weeks ahead.

To understand how this differs from conventional workflow tooling such as what teams build in n8n or Make, see our article on workflow vs AI agent: when to use which, which draws the foundational distinction.

How the orchestration works in practice

A dynamic workflow always follows the same four-stage loop.

Stage 1: plan generation

Claude analyzes the requested task and produces an explicit orchestration plan: which sub-tasks, what execution order or parallelization, which dependencies exist between outputs. This plan is visible to the user before execution when manual activation is chosen.

Stage 2: parallel sub-agent deployment

Sub-agents are launched simultaneously within the same session. Each has a targeted context, tools matched to its sub-task, and a precise objective. Coordination happens outside the main conversation thread: the user is not flooded with intermediate exchanges. Progress is saved continuously, allowing the session to resume after an interruption without starting over.

Stage 3: adversarial verification

Independent agents receive the results and have an explicit mandate to refute or validate them: find errors, test edge cases, detect regressions. This is the red-teaming principle applied to agent orchestration. If a result is rejected, the loop restarts on the relevant sub-task.

Stage 4: convergence and delivery

The system iterates until all sub-tasks pass adversarial validation. Claude then delivers a consolidated result, with a progress report if the task was long. Tasks spanning several hours or days benefit from automatic resumption.

This mechanism mirrors what engineering teams structure manually through code reviews, CI/CD pipelines, and automated tests. The difference: the entire cycle runs inside a Claude Code session, with no additional tooling to configure.

For a deeper look at how multi-agent systems are structured more broadly, see our multi-agent orchestration comparison, which covers the main frameworks and their trade-offs.

How to enable and configure dynamic workflows

Two activation modes are available.

Explicit mode: you ask Claude directly to create a workflow for the task. For example: "Create a workflow to migrate all v1 API calls to v2 across the repository." Claude presents its plan and waits for your confirmation before launching agents. This is the recommended mode for getting familiar with the feature and keeping control over what gets executed.

Ultracode mode (effort xhigh): by activating the ultracode setting in Claude Code, you let Claude judge on its own when a task warrants a workflow. It can then autonomously decide to deploy parallel agents if a task exceeds a certain complexity threshold. Convenient for advanced users, but it increases token consumption in a non-deterministic way.

Governance rules

The first trigger of a dynamic workflow always requests user confirmation. Organization administrators can disable the feature at account level. Availability by plan: on by default for Max and Team; admin activation required for Enterprise.

Dynamic workflows work across the entire Claude Code ecosystem: CLI, Desktop application, VS Code extension, direct Claude API, Amazon Bedrock, Google Vertex AI, and Microsoft Foundry. No environment switch is needed regardless of which platform you already use.

Evaluating Claude Code for your engineering team?

We run a scoped test on your actual codebase and measure what dynamic workflows genuinely deliver before any commitment.

Request an audit

Enterprise use cases: what they are built for

Dynamic workflows are not an enhanced general-purpose AI assistant. They are designed for a specific class of tasks: high volume, parallelizable, with an error risk that justifies independent verification.

Large-scale code migrations

This is the most direct use case. Changing frameworks on a codebase of tens of thousands of lines, porting a service from one language version to another, updating hundreds of calls to a deprecated API: all of these tasks are repetitive, parallelizable by file or module, and critical (a migration error can go undetected for months).

With a workflow, Claude first maps dependencies, generates transformations in parallel, and submits each modified file to cross-validation before integrating it. Work that would take a team weeks of manual review can be scoped and executed in days.

Large-scale repository audits

Alessio Vallero, Senior Engineering Manager at Klarna, put it directly: dynamic workflows are very useful for discovery and review on large codebases, and help engineers move faster on maintenance and refactoring.

In practice: security audits across an entire repository (exposed secrets, potential injections, vulnerable dependencies), systematic bug hunting with cross-verification, dead-code detection on voluminous historical codebases. These are tasks where a single agent hits its context limits, and where a human team cannot work exhaustively without dedicated tooling.

For structured approaches to securing AI-adjacent workloads, see our article on EU AI Act compliance, which covers governance frameworks relevant to any engineering team deploying agentic systems.

Critical work requiring adversarial verification

Some tasks cannot settle for "probably correct." A language port going to production, a database architecture overhaul, a cryptographic library swap: errors carry serious consequences, and results must be verified independently, not just re-read by the same agent that produced them.

That is precisely the role of adversarial agents in a dynamic workflow: they have no access to the first agent's reasoning, they start from the output and attempt to break it. This is a form of separation of concerns applied to AI.

For the broader reasoning behind multi-agent design patterns, our article on AI agents vs chatbots covers the fundamentals useful for any engineering or product leader.

The Bun example: 750,000 lines of Rust in eleven days

The reference example published by Anthropic at launch is the migration of Bun (a high-performance JavaScript runtime) from Zig to Rust, carried out by Jarred Sumner using dynamic workflows.

The published figures:

Approximately 750,000 lines of Rust generated
99.8% compatibility with the existing test suite
First commit to merge: eleven days

The project used several successive parallel workflows, each with a precise objective:

A workflow mapping Rust lifetimes (the language's memory management model)
A workflow generating .rs files with independent reviewers on each module
A continuous correction loop on build errors and failing tests
An overnight optimization workflow launched during off-peak hours

Context

Bun is a high-profile open-source project, carried out by an engineer with deep expertise in both target languages. A comparable result on an internal enterprise codebase requires active human supervision, a solid pre-existing test suite, and rigorous scope definition. The tooling amplifies the productivity of a strong team; it does not replace domain expertise.

What this example mostly demonstrates: the value of dynamic workflows is not in the raw speed of code generation, but in the ability to maintain coherence and catch errors at the scale of the entire project, not file by file.

Real token cost: what to plan for

Anthropic is explicit on this point and it would be misleading to downplay it: dynamic workflows consume far more tokens than a standard session. Launching dozens to hundreds of agents in parallel means that many contexts initialized, that many reasoning threads started, and that many results exchanged between agents.

A few reference points for calibration:

A standard Claude Code session on a targeted refactoring task: a few thousand to a few tens of thousands of tokens.
A dynamic workflow on a mid-size repository audit: several million tokens depending on repository size and agent count.
A large-scale migration workflow running over several days: potentially tens of millions of tokens.

At Opus 4.8 pricing ($5 per million input tokens, $25 per million output tokens), a multi-million-token workflow amounts to roughly $100 to several hundred dollars depending on volume. Not negligible, but to be compared against the cost of one or more engineer-weeks on the same task.

Anthropic's recommendation is clear: start with well-scoped tasks and measure consumption before scaling up. The ultracode mode in particular can trigger unexpected workflows on tasks you considered simple.

For a fuller picture of how token costs fit into a complete AI project budget, see our guide on the cost of an internal AI assistant and our breakdown of RAG project total cost of ownership.

Managing AI costs for your organization?

We scope the architecture, size the token consumption, and give you a realistic estimate before you commit.

Get in touch

Justified or overkill: the decision grid

Dynamic workflows are not the right answer to every engineering problem. Here is the grid for deciding whether the feature fits your situation.

Situation	Dynamic workflow	Recommended alternative
Framework migration on 50,000+ lines	Justified	N/A
Security audit across an entire repository	Justified	N/A
Critical refactor requiring independent verification	Justified	N/A
Fixing a bug in a single file	Overkill	Standard Claude Code session
Internal AI assistant on your documents	Overkill	RAG with a suitable LLM
Automating a repetitive business process	Overkill	n8n / Make workflow with a simple agent
Large-scale document extraction and processing	Overkill	Batch processing pipeline with a lightweight model

The clearest signal: if your task can be broken down into identical, parallelizable sub-tasks at scale, and if an error in one sub-task carries serious consequences, dynamic workflows are in their zone of value. If the task is one-off, modest in size, or does not require adversarial verification, a standard session or a classic agent will be faster and cheaper.

For typical enterprise AI use cases (document assistant, data processing, process automation), RAG remains the most direct and controllable path. Our article on agentic RAG explains why retrieval-augmented architectures are often sufficient before reaching for full multi-agent orchestration.

If you are evaluating multiple model and architecture options before deciding, our guide to Mistral vs OpenAI vs Anthropic provides a practical comparison across the main providers. And for structured outputs that tie agent results together cleanly, see our article on structured outputs for LLMs in production.

Dynamic workflows sit within the broader trajectory of Opus 4.8, whose agentic capabilities (69.2% on SWE-Bench Pro) and multi-level effort architecture form the foundation. For the full model portrait, see Claude Opus 4.8 for enterprise, which covers benchmarks, fast mode, and effort levels in full.

Finally, if you want to understand how Model Context Protocol fits into these agentic setups, our Model Context Protocol guide covers how tool exposure and context boundaries work across agent sessions.

Frequently asked questions

Dynamic workflows are a research-preview feature launched on May 28, 2026, with Claude Opus 4.8 inside Claude Code. They allow Claude to generate an orchestration plan, spawn dozens to hundreds of sub-agents in parallel within a single session, verify results with independent adversarial agents tasked with refuting conclusions, and iterate until convergence. Coordination happens outside the main conversation thread and progress is saved continuously so a session can resume after interruption.

There are two ways to enable dynamic workflows: ask Claude explicitly to create a workflow for the target task, or activate the ultracode setting (effort xhigh) which lets Claude decide on its own when a workflow is warranted. The first trigger always requires user confirmation. Organization administrators can disable the feature. It is on by default for Max and Team plans; Enterprise plans require admin approval.

Dynamic workflows are available in the Claude Code CLI, Claude Code Desktop, the VS Code Claude Code extension, and directly via the Claude API. They also work on cloud platforms: Amazon Bedrock, Google Vertex AI, and Microsoft Foundry. On by default for Max and Team plans; Enterprise activation requires administrator approval.

Yes, substantially more. Launching dozens to hundreds of sub-agents in parallel multiplies token consumption compared to a linear session. Anthropic states this explicitly and recommends starting with well-scoped tasks before scaling up. Dynamic workflows are designed for work spanning hours or days across very large codebases or datasets. The extra cost is justified when time savings or error-risk reduction clearly outweigh the token bill.

The best-documented use cases are: large-scale code migrations (framework or language changes across hundreds of thousands of lines), repository audits (bug hunting, security audits, dead-code detection with independent verification), and critical refactors requiring multiple parallel attempts verified by adversarial agents. The reference example is the Bun rewrite from Zig to Rust: 750,000 lines, 99.8% test compatibility, first commit to merge in eleven days.

It depends on the organization's technical profile. A company with a significant codebase, a large technical-debt backlog, or an active migration project may find genuine value. For typical AI use cases (internal assistant, document processing, process automation), a single agent or a standard n8n/Make workflow is perfectly sufficient and far cheaper. Dynamic workflows are primarily aimed at engineering teams managing large repositories.

Claude Code Dynamic Workflows: Orchestrating Hundreds of AI Agents in Parallel

What dynamic workflows actually are

How the orchestration works in practice

Stage 1: plan generation

Stage 2: parallel sub-agent deployment

Stage 3: adversarial verification

Stage 4: convergence and delivery

How to enable and configure dynamic workflows

Enterprise use cases: what they are built for

Large-scale code migrations

Large-scale repository audits

Critical work requiring adversarial verification

The Bun example: 750,000 lines of Rust in eleven days

Real token cost: what to plan for

Justified or overkill: the decision grid

Frequently asked questions

Further reading

Related reading

Why 15% of Your JSON Prompts Fail (And How to Fix It in 2026)

Cash Flow Forecasting AI: A Practical Guide for SMBs

Computer Vision for Quality Inspection in Industry

Credit Risk Scoring with Machine Learning: A B2B Guide

Custom AI Model Cost: A Realistic Breakdown

Custom Model Training: Build vs Fine-tune vs API