Skip to main content
Find your task, copy the route. Every route links to a complete example. (Agents: this page + Snippets + the Workflow Schema is the full assembly kit.)
I want to…Runtime + methodAdd these blocksExample
Fix a bug / implement a feature in a repoanthropic-agent-sdk + sdktools, sandbox(edit), budgetMinimal single agent
Same, but survive outages/rate limits+ ordered fallback to openai-codex-sdk, gemini-antigravity-cliselection, retryResilient worker
Research a topic, then build, then reviewgemini-genai-sdkanthropic-agent-sdkopenai-codex-sdkpair_profilesPair recipe
Get an unbiased code reviewopenai-codex-sdk + cli, isolate_context: truetool_policy(read-only)Pair recipe
Gate completion on multi-vendor agreementany worker + review panelreview_consensusReview panel
Extract structured data (JSON)openai-responses-api + api or any SDK runtimestate_machine + output_schemaStructured Output
Cheap bulk classification/evalsbatch surfacesstage that submits, stage that collectsCost, Batch & Cache
Multi-stage pipeline with typed handoffsanystate_machineState machine
Hard problem — buy more reasoningAnthropic thinking / OpenAI reasoning_effort / Antigravity levelagent.thinkingThinking
Do a web task with no API (forms, dashboards)openai-operator + apibudget, approval-friendly policyComputer Use
Voice panel of expertsroster with voice/voice_namerosterVoice & Realtime
Keep data on-device / offlinelocal-llm + apibudgetLocal provider
Block specific tool calls deterministicallyClaude-native hooks (settings)tool_policy as backstopHooks
Plug in external SaaS toolsparticipant with mcp_server_configtoolsTools & MCP
Cut spend on repeated contextcache_policy, HeadroomCost, Batch & Cache

The 60-second mental model

  1. Workflow markdown is the only control plane. Everything an agent may do — runtime, model, tools, network, budget, review — is declared in one file. If a power isn’t declared, it doesn’t exist at runtime.
  2. Cadence is the governor. It validates your file, picks the concrete route, enforces budgets, and only advances stages on typed events.
  3. States are honest. Every capability is implemented, model-dependent, partial-compatible, policy-blocked, vendor-blocked, or unsupported — each provider page has a decision-rule legend.
  4. Test cheap first. Validate (think_workflow_md_validate), run with a small budget, read the session ledger, then scale.