TLDR: Anthropic is championing ‘context engineering’ as a more effective method than traditional prompt engineering for managing AI agents. This new approach focuses on optimizing the entire information flow to an LLM, including system instructions, tools, and message history, to enhance efficiency and coherence in complex tasks.
AI research firm Anthropic is spearheading a shift in how large language models (LLMs) are managed, advocating for a new paradigm called ‘context engineering’ over the widely adopted ‘prompt engineering.’ This approach, detailed in recent publications, aims to improve the efficiency and coherence of AI agents, particularly in extended and complex operations.
Context engineering, as defined by Anthropic, involves the strategic curation and management of the complete set of tokens an LLM utilizes during inference. This encompasses not only the prompts but also system instructions, integrated tools, external data, and the entire message history. The core idea is to optimize the utility of these tokens against the inherent limitations of LLMs, such as their finite ‘attention budget’ and the risk of ‘context rot’—a degradation in performance as irrelevant information accumulates.
This methodology represents a natural evolution from prompt engineering, which primarily focuses on crafting effective instructions for LLMs. As AI agents become more sophisticated and operate over multiple turns and longer time horizons, managing the entire context state becomes paramount.
Anthropic reports significant gains from implementing context engineering strategies. Internal tests combining a ‘Memory Tool’ with ‘Context Editing’ reportedly boosted agent-based search performance by 39 percent. Context editing alone yielded a 29 percent improvement. Furthermore, in a 100-round web search, token consumption was reduced by an impressive 84 percent.
Key strategies for effective context engineering include:
Tuning System Prompts: Prompts should be specific enough to guide behavior but flexible enough to allow for broad heuristics, striking a ‘Goldilocks zone’ that provides clear guidance without micromanaging.
Minimal Toolsets: Designing tools with minimal functional overlap and maximizing token efficiency is crucial. If an AI cannot discern which tool to use, its effectiveness diminishes.
‘Just-in-Time’ Data Strategies: Instead of preloading all potentially relevant information, agents store lightweight identifiers (like file paths or database query IDs) and fetch data only when needed. Anthropic’s Claude Code, for instance, analyzes complex data by loading only necessary components, keeping the context window lean.
Compaction: For long-horizon tasks, models are instructed to periodically summarize key information from conversation history or task states. This distilled essence is then used to seed a fresh context window, discarding verbose outputs while retaining critical details.
Structured Note-Taking: Creating an external memory for the agent allows it to maintain coherence over extended interactions.
Sub-Agent Architectures: Complex problems can be divided among specialized sub-agents, each managing its own context more efficiently.
The concept of ‘context engineering’ is not entirely new, with early mentions by prompt engineer Riley Goodside in 2023 and resurfacing in mid-2025 through discussions by figures like Shopify CEO Tobi Lütke and ex-OpenAI researcher Andrej Karpathy, who saw it as a more accurate descriptor for steering generative AI systems.
Also Read:
- Mastering AI Interaction: A Guide to Becoming a Prompt Engineer
- Slack Transforms into Open AI Platform with New Developer Tools and Conversational Data Integration
These new tools and methodologies are currently available in public beta on the Claude Developer Platform, with integrations extending to Amazon Bedrock and Google Cloud Vertex AI. Anthropic has also provided comprehensive documentation and a ‘cookbook’ for developers to implement these advanced techniques.


