Context Decay Problem and Agent Solution

Introduction: The Context Problem in Long Sessions

When you build a production API feature with 10 tasks, you face a critical choice: execute all tasks in one long session, or break them into independent units with fresh context each time. In a traditional approach, you might give Claude Code a complete specification and say "implement all 10 tasks." This seems efficient—one instruction, one session, done. However, there's a hidden risk: as the session progresses through tasks, the probability of specification drift increases. By task 10, important validation rules or patterns from the beginning might be forgotten. This is both a probability problem and a technical limitation . Ten tasks mean ten opportunities to miss something. Even if each task has a 95% chance of being perfect, by task 10 the cumulative probability of at least one error is significant. Additionally, as the session progresses, you may hit context window limits, forcing earlier information to be dropped. By task 10, important validation rules or patterns from the beginning might be forgotten due to both accumulated conversation length and statistical likelihood. With agent orchestration, you reduce this accumulated context risk. Instead of one long session, a Main Agent coordinates the work. For each task, it delegates to a fresh Subagent that starts with a clean slate—the specification is reloaded, the constitution is reloaded, and there's no accumulated context from previous tasks. This doesn't eliminate all errors (agents can still misread specs or miss constraints), but it removes the specific problem of context accumulation degrading quality over multiple tasks.

How Orchestration Works: The Main Agent Pattern

What Subagents Automatically Receive

When you invoke a subagent with Task(agent-name): instruction, Claude Code automatically provides: CLAUDE.md : Your project constitution is auto-loaded. The subagent knows your coding standards, patterns, and conventions without you having to repeat them. Files you specify : Any file you reference with @ in your instruction is loaded into the subagent's context. For example: @specs/comments/specification.md or @src/models/task.py . Task instructions : The specific instruction you provide in the Task() command. What subagents do NOT get: Previous agent outputs (unless you explicitly include them) Accumulated conversation history from the main session Files not explicitly referenced with @ This design is intentional—it prevents context pollution and ensures each task gets exactly the context it needs, no more, no less.

Creating Agent Definition Files

Before you can invoke a subagent, you need to define it. Agent definitions live in .claude/agents/ as markdown files with a special structure. Every agent file starts with a YAML header that defines the agent's identity: YAML name: task-executor description: Implements individual tasks following test-first workflow tools: Read, Write, Edit, Bash, Grep model: sonnet name: task-executor description: Implements individual tasks following test-first workflow tools: Read, Write, Edit, Bash, Grep model: sonnet Field meanings: name : Unique identifier used to invoke the agent (e.g., Task(task-executor)). description : When the agent should be used. tools : Capabilities the agent can access (e.g., Read, Write, Bash). model : AI model to use (sonnet, haiku, opus, or inherit). After the YAML header, you write the system prompt — instructions telling the agent its role and process. For example: Markdown You are a task implementation specialist. ## Your Role Implement one task at a time following specifications. ## Process 1. Read acceptance criteria 2. Write tests first 3. Implement code You are a task implementation specialist. ## Your Role Implement one task at a time following specifications. ## Process 1. Read acceptance criteria 2. Write tests first 3. Implement code This is enough to create basic agents for Unit 1's practices. In Unit 2, you'll learn advanced patterns for specialized agents with complex workflows.

Structured Completion Reports

When a subagent finishes its task, it doesn't just say "done." It provides a structured completion report that makes human review trivial: text T001 complete. Validation: ✓ Tests: 5 passed ✓ Coverage: 94% ✓ Types: clean ✓ Acceptance criteria met Files modified: - src/models/comment.py - tests/unit/test_comment_model.py Ready for: git commit -m "feat(comments): Add Comment model (T001)" T001 complete. Validation: ✓ Tests: 5 passed ✓ Coverage: 94% ✓ Types: clean ✓ Acceptance criteria met Files modified: - src/models/comment.py - tests/unit/test_comment_model.py Ready for: git commit -m "feat(comments): Add Comment model (T001)" This report tells you immediately: Did the tests pass? Is coverage adequate? Are types clean? What files changed? What commit message to use? With this information, your review takes 3 minutes instead of needing to investigate what happened.

The Three-Level Validation Strategy

Professional orchestration uses strategic checkpoints instead of reviewing every detail of every task: Level 1 - Task Completion (3 minutes per task): After each agent completes, do a quick review: Read the completion report Spot-check one thing (e.g., test names make sense) Approve or request fix Level 2 - Phase Checkpoint (10-15 minutes per phase): After completing a logical phase (e.g., "Foundation" with 5 tasks), stop for deeper validation: Run integration tests across all phase tasks Verify patterns are consistent Confirm phase goal is achieved Tag with git tag feature-phase-1-complete Level 3 - Feature Complete (30-45 minutes): After all tasks are done, do comprehensive validation: Verify all acceptance criteria Security review Performance testing Documentation check For a 10-task feature: Level 1: 10 \times 3min = 30 minutes Level 2: 2 \times 15min = 30 minutes Level 3: 45 minutes Total: ~1.75 hours of validation This is predictable and efficient compared to ad-hoc review.

Architectural Advantages Beyond Context

While preventing accumulated context decay is the primary benefit, orchestration provides additional advantages: Modularity: Each task is an independent unit with clear inputs and outputs. Testability: You can validate each task separately before moving to the next. Maintainability: Clear boundaries between tasks make the codebase easier to understand. Scalability: You can add more agents for parallel work (different features can run simultaneously). Process Consistency: The same workflow applies to every task—no variation in execution steps. Reproducibility: Given the same specification and tasks, the execution process is predictable (though individual agents may interpret requirements differently).

Summary

In this lesson, you learned: Context decay is a probability problem in long sessions executing many tasks Agent orchestration solves this with fresh subagents per task The Task(agent-name): syntax explicitly delegates work to defined agents Subagents automatically get CLAUDE.md, specified files, and task instructions Agent files use YAML frontmatter to define identity and capabilities Structured completion reports make validation efficient Three-level validation (task, phase, feature) provides systematic quality control In the upcoming tasks, you'll experience this firsthand by executing the same feature both ways and comparing the results.