Codex + Claude Workflows / Foundation

Claude Code Just Dropped Workflows (An Actual Game Changer)

This video explains Claude Code's new Workflows feature by showing how it moves sub-agent orchestration out of the bloated main chat window and into a deterministic workflow.js script, then demonstrates it live with the deep research skill (which burned 105 agents and 3 million tokens) and a self-generated 'Startup Forge' fan-out pipeline.

Mansel Scheffel16 minTranscript found

Quick learning frame

Read this before watching.

Coding-agent workflow is the loop of inspect, plan, edit, verify, summarize, and route the next task to the right tool.

New playlist item from Mansel Scheffel; queued for transcript-backed review, topic mapping, and a practical learning artifact.

Skill you build: Deciding when and how to use Claude Code Workflows versus skills, and reasoning about the agent count, token cost, and orchestration tradeoffs before running a workflow.

Watch for the shift from claim to mechanism. The learning value is the point where the transcript reveals a repeatable action, tool boundary, context move, review habit, or artifact.

Concept diagram

Where this video fits.

01Inspect

02Plan

03Edit

04Verify

05Review

06Route

Deep lesson

Turn this video into working knowledge.

4,071 cleaned transcript words reviewed across 1,142 timed caption segments.

Thesis

Claude Code Just Dropped Workflows (An Actual Game Changer) teaches a practical codex + claude workflows move: This video explains Claude Code's new Workflows feature by showing how it moves sub-agent orchestration out of the bloated main chat window and into a deterministic workflow.js script, then demonstrates it live with the deep research skill (which burned 105 agents and 3 million tokens) and a self-generated 'Startup Forge' fan-out pipeline.

The goal is not to remember the video. The goal is to extract the operating principle, tie it to timestamped evidence, test how far the claim transfers, and make something reusable.

0:59

Sub-agents offload context

“we're having in this main window. And that's one of the reasons that we would actually have sub agents. So again, if we look at our main claude code session over here, what we can do is we...”

A sub-agent is a fresh Claude Code session with its own isolated context window; it does the heavy 60k-token work and returns only the ~500-token answer, so the main session stays free of tool, MCP, and reasoning bloat instead of relying on the 1M ceiling. In your own Claude Code session, take a research task that normally fills your context and delegate it to a sub-agent, then compare how many tokens land back in the main window.

6:13

Script replaces orchestrator

“tool and took 17 seconds. We can see in the fetch phase all of the agents that are currently running, how many tokens they're using, what tools they're using, and how long they've been running for. While that...”

Workflows move the manager role out of Claude's context (which struggles holding intermediate state at scale) and into a workflow.js script that holds state in variables, runs deterministic loops, spawns agents as a separate runtime process, and uses a journal to enable pause/resume; capped at 16 concurrent agents but up to 1000 total per run. Trigger a workflow by saying the word 'workflow' to Claude, then open the generated workflow.js and locate where it stores state and defines its loop boundaries.

13:04

Cost vs determinism tradeoff

“claims times three independent verify agents each. Like I said, this is probably going to be your biggest one most of the time that you're running workflows. So, flipping back to our slides, we now know with a...”

The deterministic loop runs relentlessly until its goal is met, so deep research consumed ~75 of its 105 agents on adversarial three-vote fact-checking (25 claims x 3 verifiers) and 3M tokens; reserve workflows for tasks that fan out across many similar items, need deterministic loops, or need resumability, and use plain skills for everyday reliable work. Before running a workflow, set a budget guard in the script and predict which phase (likely verification) will consume the most agents, then check your prediction against the per-phase agent breakdown afterward.

01

Inspect

Start with this video's job: This video explains Claude Code's new Workflows feature by showing how it moves sub-agent orchestration out of the bloated main chat window and into a deterministic workflow.js script, then demonstrates it live with the deep research skill (which burned 105 agents and 3 million tokens) and a self-generated 'Startup Forge' fan-out pipeline. Treat "Inspect" as the outcome you are trying to make visible, not a topic label. Anchor it to 0:59, where the video says: “we're having in this main window. And that's one of the reasons that we would actually have sub agents. So again, if we look at our main claude code session over here, what we can do is we...”

02

Plan

Use "Plan" to locate the part of the codex + claude workflows workflow the video is demonstrating. Ask what changes in your real setup if this claim is true. Anchor it to 6:13, where the video says: “tool and took 17 seconds. We can see in the fetch phase all of the agents that are currently running, how many tokens they're using, what tools they're using, and how long they've been running for. While that...”

03

Edit

Turn "Edit" into the reusable artifact for this lesson: A routing matrix for when to use Codex, Claude, browser checks, or manual review. This is where watching becomes something you can inspect and reuse.

04

Verify

Use "Verify" as the application surface. Decide whether the idea touches a browser flow, a local file, a model choice, a source document, a UI, or a review step.

05

Review

Use "Review" to prove the lesson. The evidence should connect back to the video title, transcript anchors, and a concrete output, not a generic best-practice claim.

06

Route

Use "Route" to carry the idea forward: save the prompt, checklist, diagram, or operating rule that would make the next agent run better.

Example

Source-backed work packet

Convert the video into a scoped task that includes the transcript claim, target workflow, acceptance criteria, and proof. The output should be a routing matrix for when to use codex, claude, browser checks, or manual review..

Example

Claim vs. demo brief

Separate what the speaker claims, what the demo actually proves, and what still needs outside verification before you adopt the workflow.

Example

Teach-back module

Transform the lesson into a definition, a mechanism diagram, one misconception, one practice exercise, and a check-for-understanding question.

Do not learn it wrong

Treating the title as the lesson without checking what the transcript actually says.
Letting the prompt drift into generic advice that could apply to any video in the playlist.
Copying the tool setup without identifying the operating principle that transfers to your own stack.
Skipping the artifact, which means the learning never becomes operational or inspectable.

Transcript-derived moments

Use timestamps to study the actual video.

Problem frame

“we're having in this main window. And that's one of the reasons that we would actually have sub agents. So again, if we look at our main claude code session over here, what we can do is we...”

Working mechanism

“tool and took 17 seconds. We can see in the fetch phase all of the agents that are currently running, how many tokens they're using, what tools they're using, and how long they've been running for. While that...”

Transfer moment

“claims times three independent verify agents each. Like I said, this is probably going to be your biggest one most of the time that you're running workflows. So, flipping back to our slides, we now know with a...”

Quality check

Do not count this as learned until these are true.

01

State the transcript-backed claim in your own words: This video explains Claude Code's new Workflows feature by showing how it moves sub-agent orchestration out of the bloated main chat window and into a deterministic workflow.js script, then demonstrates it live with the deep research skill (which burned 105 agents and 3 million tokens) and a self-generated 'Startup Forge' fan-out pipeline.

02

Explain the practical stakes without hype: New playlist item from Mansel Scheffel; queued for transcript-backed review, topic mapping, and a practical learning artifact.

03

Map the idea onto the Inspect -> Plan -> Edit -> Verify -> Review -> Route sequence and name the weakest link.

04

Produce the artifact and include the evidence that proves it: A routing matrix for when to use Codex, Claude, browser checks, or manual review.

Put it into practice

Give this grounded prompt to Codex or Claude after watching.

You are helping me turn one specific YouTube video into real, durable learning.

Source video:
- Title: Claude Code Just Dropped Workflows (An Actual Game Changer)
- URL: https://www.youtube.com/watch?v=ua2YA7TiLEk
- Topic: Codex + Claude Workflows
- My current learning frame: Invoke a deep-research workflow on a specific question you actually care about, inspect the generated workflow.js to set a token budget guard and assign cheaper models (Haiku) to discovery phases, then run it and verify which phase used the most agents.
- Why this matters: New playlist item from Mansel Scheffel; queued for transcript-backed review, topic mapping, and a practical learning artifact.

Transcript anchors from this exact video:
- 0:59 / Evidence 1: "we're having in this main window. And that's one of the reasons that we would actually have sub agents. So again, if we look at our main claude code session over here, what we can do is we..."
- 3:53 / Evidence 2: "Currently, you can't do this in VS Code with the extension. Three quick things to note on this as well. There is no direct file system or shell access from the script. The agents can do that obviously."
- 6:13 / Evidence 3: "tool and took 17 seconds. We can see in the fetch phase all of the agents that are currently running, how many tokens they're using, what tools they're using, and how long they've been running for. While that..."
- 9:26 / Evidence 4: "line. So, I've just asked Claude to do that exactly right now because you can use different phases inside this script. So, some of them could run in Haiku, maybe for discovery if that's what you wanted to..."
- 11:25 / Evidence 5: "phase. Six haiku agents brainstorm name and tagline candidates. Model haiku. Then we have the critique and we have model sonnet. And for the synthesis, we have model opus. That top part over there was like it said..."
- 13:04 / Evidence 6: "claims times three independent verify agents each. Like I said, this is probably going to be your biggest one most of the time that you're running workflows. So, flipping back to our slides, we now know with a..."
- 15:07 / Evidence 7: "majority of my workflows. Again, in a business, you just want that determinism and that reliability. You want to make sure that Claude is doing the same thing every day, getting leads, replying to people, what whatever it..."

Your task:
1. Use the transcript anchors above as the primary source packet. If you add outside context, label it clearly as outside context and keep it secondary.
2. Create a source-check table with columns: timestamp, claim, what the demo proves, confidence, and what still needs verification.
3. Extract the actual teachable claims from the video. Do not invent claims that are not supported by the title, lesson frame, or transcript anchors.
4. Build a reusable learning artifact: A routing matrix for when to use Codex, Claude, browser checks, or manual review.
5. Include:
- a plain-English definition of the core idea
- a diagram or structured model using this sequence: Inspect -> Plan -> Edit -> Verify -> Review -> Route
- 3 concrete examples that apply the video idea to real agentic work
- 2 failure modes the video helps prevent
- a checklist I can use the next time I run Codex or Claude
- one practical exercise with a clear done signal
6. Add a "learning transfer" section: what changes in my workflow tomorrow if I actually learned this?
7. Add a "source check" section that cites which transcript anchor supports each major takeaway.

Quality bar:
- Make this specific to "Claude Code Just Dropped Workflows (An Actual Game Changer)", not a generic Codex + Claude Workflows essay.
- Prefer operational examples, failure modes, and reusable artifacts over broad definitions.
- Call out uncertainty instead of smoothing over weak evidence.
- If evidence is weak, say what transcript segment or timestamp needs review instead of guessing.
- Finish with a concise artifact I could paste into my learning app.

Misconceptions

What to stop believing.

One agent should do every task.

Different tools have different strengths. Routing is part of the workflow.

More context is always better.

Relevant context helps; stale context causes drift and cost.

Practice studio

Learning only counts when you make something.

01

Transcript evidence map

Separate what the video actually says from what you already believe about the topic.

3 source-backed takeaways with timestamps, confidence, and a transfer note.

02

One useful artifact

Apply the video to a real workflow and produce a routing matrix for when to use codex, claude, browser checks, or manual review..

A reusable artifact with a done signal and one verification step.

03

Teach-back card

Explain the lesson to someone who has not watched the video yet.

A 90-second explanation, one diagram, one example, and one misconception to avoid.

Recall check

Answer first, then reveal — without rewatching.

Why does delegating work to a sub-agent keep your main Claude Code session lean, in concrete token terms from the video?

What does a workflow change about who orchestrates the sub-agents, and what are the two concurrency limits given?

In the vitamin C deep-research run, which phase consumed the most agents and why, and what does that imply about when to use workflows?

Source shelf

Use the video as a doorway, then verify with primary sources.

ReadingOpenAI Codexopenai.com/codex/ReadingClaude Code Overviewdocs.anthropic.com/en/docs/claude-code/overview