Context Engineering
Definition
Context engineering is the management of what the agent sees now, what it can recover later, and what must be written down so the future self does not improvise fiction. It includes prompt assembly, compaction, resets, progressive disclosure, and explicit handoff artifacts.
Failure modes
The Anthropic papers are useful because they name the pathologies plainly: one-shot overreach, premature completion, and context anxiety. OpenAI’s account adds another constraint: if important knowledge lives outside the repo, the agent effectively cannot see it.
Working techniques
- Progressive disclosure through
AGENTS.mdand linked documents. - Scoped instruction files such as
CLAUDE.md, plus auto memory that carries recoverable learnings forward without pretending the full transcript should stay loaded forever. - Durable instruction hierarchies so repo guidance, user preferences, and managed policy do not collapse into one mutable blob; see instruction-layering.
- Fresh-session resets when long transcripts become liabilities.
- Structured state files such as feature lists and progress logs.
- Tool-assisted re-orientation at session start: run the environment, inspect logs, confirm the app still works.
- Surface handoff and resume flows so work can move between terminal, app, IDE, or web without losing the durable artifacts.
- Bounded turns and explicit sprint contracts.
Why it matters
Many harness failures are context failures wearing other hats. A bad plan may be a memory problem; a false success may be an evaluation problem caused by insufficient state; a messy codebase may be a documentation visibility problem. An unattended job with weak bootstrap context is the same problem in slower motion, which is why context engineering also sits underneath automation-and-background-work.
Related pages
Read with instruction-layering, memory-persistence, harness-engineering, and claude-code. For operator-facing visibility into assembled context, provenance, and influence, also read context-assembly-visualization-for-harnesses and attention-and-attribution-views-for-llm-harnesses. It also supplies much of the explanatory vocabulary for harness-quality-comparison.