Designing AI Agents you own this product

Principles, patterns, and best practices

Jia Huang

MEAP began May 2026
Last updated May 2026
Publication in Early 2027 (estimated)

ISBN 9781633433632
375 pages (estimated)

Included with a Manning Online subscription

printed in black & white

resources: Source code Book forum Source code on Github

table of content

Part 1 Foundations

1 The agent paradigm shift

1.1 The agent paradigm shift

1.1.1 What breaks and why

1.1.2 The three uncertainties of agent systems

1.2 From engineering ancestors to agent patterns

1.2.1 Every agent pattern has an ancestor

1.2.2 What makes agent patterns fundamentally different

1.3 Seven cognitive functions of agents

1.3.1 Argus reviews a pull request

1.3.2 The two-dimensional framework

1.3.3 The gardener’s mindset

1.4 Summary

2 Agent architecture and the two-axis map

2.1 The anatomy of an agent

2.1.1 From four modules to seven

2.1.2 The agent capability stack: Why seven, not four

2.1.3 The cognitive engine: The Perception-Reasoning-Action loop

2.1.4 Compound error: Why loops amplify mistakes

2.2 General agent architecture

2.2.1 Cognitive modules and their information flows

2.2.2 Runtime virtual machine

2.2.3 The external world boundary

2.2.4 How production systems instantiate the architecture

2.3 The two-dimensional map: Cognitive function × execution topology

2.3.1 The execution topology axis

2.3.2 Why you need both dimensions

2.3.3 The complete pattern map

2.3.4 Scope and limitations of the map

2.3.5 The pattern selection card

2.3.6 How to read the rest of this book

2.4 Single-agent vs. multi-agent: When and why to scale

2.4.1 The single-agent sweet spot

2.4.2 When multi-agent becomes necessary

2.4.3 What multi-agent costs you

2.4.4 The scaling spectrum

2.5 The new contract: Progressive trust and guardrails

2.5.1 The trust spectrum: Four levels

2.5.2 Earning and losing trust

2.5.3 Guardrails as architecture

2.6 Your first agent: The Argus skeleton

2.6.1 The PRA loop in 50 lines

2.6.2 Try it yourself

2.6.3 The same pattern, different frameworks

2.7 Summary

2.8 References

Part 2 The seven cognitive functions

3 Perception: What your agent sees determines what it does

3.1 The context window is not what you think

3.1.1 Attention is not uniform

3.1.2 The three-way squeeze

3.1.3 How production agents solve this

3.1.4 Measuring perception quality

3.1.5 The four patterns ahead

3.2 Pattern: Context Triage

3.2.1 The emergency room

3.2.2 The four tiers

3.2.3 Context triage in Claude Code

3.2.4 Building it

3.2.5 When it breaks

3.3 Pattern: Semantic Compaction

3.3.1 The moment the window fills

3.3.2 Three levels of compression

3.3.3 The rule you must not break

3.3.4 Building it

3.3.5 When it breaks

3.4 Pattern: Progressive Discovery

3.4.1 The detective problem

3.4.2 Forage, then focus, then deepen

3.4.3 The production spectrum

3.4.4 Building it

3.4.5 The progressive disclosure connection

3.4.6 When it breaks

3.5 Pattern: Multi-Modal Fusion

3.5.1 When text is not enough

3.5.2 The representation decision

3.5.3 Building it

3.5.4 When it breaks

3.5.5 Why the simplest pattern asks the hardest question

3.6 Putting it all together: The perception pipeline

3.6.1 Testing and observing perception

3.6.2 Claude Code’s perception cycle in action

3.7 Upgrading Argus: From blind reviewer to context-aware analyst

3.7.1 Before and after

3.8 Summary

3.9 References

4 Memory: What your agent remembers shapes what it becomes

4.1 What is memory? State, persistence, and knowledge across sessions

4.1.1 The memory hierarchy analogy

4.1.2 The memory lifecycle

4.1.3 How production agents handle memory today

4.1.4 Measuring memory quality

4.2 Pattern: Hierarchical Retention

4.2.1 Three tiers and the eviction question

4.2.2 In production: Claude Code’s six-tier hierarchy

4.2.3 Building it

4.2.4 Before and after: What eviction looks like

4.2.5 Argus integration

4.2.6 When it breaks

4.3 Pattern: RAG (Retrieval-Augmented Generation)

4.3.1 Three pipelines: Index, retrieve, generate

4.3.2 In production: Claude Code’s tool-based retrieval

4.3.3 Building it

4.3.4 Production vector database landscape

4.3.5 When it breaks

4.4 Pattern: Progress tracking

4.4.1 The checkpoint chain

4.4.2 In Production: Three approaches compared

4.4.3 Building it

4.4.4 When it breaks

4.5 Pattern: Failure journals

4.5.1 Fix vs. heuristic: Two levels of learning

4.5.2 In Production: Claude Code’s auto memory as implicit failure journal

4.5.3 Building it

4.5.4 The ExpeL evolution

4.5.5 When it breaks

4.6 Composing memory patterns

4.7 Multi-agent memory: Individual vs. shared

4.7.1 Shared memory: The blackboard pattern

4.7.2 The contamination risk

4.8 Argus checkpoint: What Argus can do now

4.9 Summary

4.10 References

5 Reasoning: How your agent decides what to do next

5.1 What is reasoning? How agents think, and how they decide how deeply to think

5.1.1 Two systems of thought

5.1.2 The reasoning landscape has shifted

5.1.3 Why architectural patterns still matter

5.1.4 The reasoning patterns at a glance

5.1.5 Testing and observing reasoning

5.2 Pattern: Chain-of-Thought

5.2.1 Thinking out loud: The step-by-step chain

5.2.2 In Production: Claude Code’s implicit chain-of-thought

5.2.3 Building it

5.2.4 Argus integration

5.2.5 When it breaks

5.3 Pattern: Complexity-Based Routing

5.3.1 Three tiers of reasoning depth

5.3.2 In production: The Planner-Worker economic split

5.3.3 Building it

5.3.4 Argus integration

5.3.5 When it breaks

5.4 Pattern: Parallel Exploration

5.4.1 Branching, scoring, and pruning

5.4.2 In Production: Claude Code’s implicit parallel exploration

5.4.3 Building it

5.4.4 When it breaks

5.5 Pattern: Iterative Hypothesis Testing

5.5.1 The hypothesis-experiment loop

5.5.2 In Production: How coding agents actually debug

5.5.3 Building it

5.5.4 Argus integration

5.5.5 When it breaks

5.6 Composing reasoning patterns: How agents think in layers

5.7 Multi-agent reasoning

5.8 Argus checkpoint: What the agent can do now

5.9 Summary

5.10 References

6 Action: How your agent acts is where strategy meets consequences

7 Reflection: How your agent critiques itself shapes what it improves

8 Collaboration: How your agents coordinate multiplies what they can build

9 Governance: What your agent is allowed to do defines what it can be trusted with

Part 3: Composition

10 Composing patterns: From framework to field

11 Patterns in the wild: Three end-to-end case studies

Appendixes

Appendix A: Pattern reference card

Overview

1 The agent paradigm shift

The text introduces agent design patterns as a necessary evolution in software architecture for systems where an AI model makes runtime decisions. Traditional software assumes deterministic control flow, structured inputs, explicit state, and predictable failures; agent systems instead operate through probabilistic perception, reasoning, action, and reflection loops. The central shift is that engineers no longer write every decision directly—they design constraints, budgets, tools, memory, and guardrails around a model whose behavior is uncertain.

The chapter explains why classical design patterns such as Singleton, Factory, Observer, and Strategy do not transfer cleanly to agents. These patterns assume that humans define structure and choices at design time, while agents make many of those choices dynamically at runtime. Agent systems must handle output uncertainty, behavioral uncertainty, and environmental uncertainty as normal operating conditions. The text also connects agent patterns to earlier engineering traditions, showing how distributed-systems ideas like circuit breakers, sagas, reconciliation loops, cache hierarchies, and bulkheads reappear in agent form as loop detection, plan execution, perception-action cycles, memory tiers, and sandboxing.

The book’s organizing framework is built around seven cognitive functions: perception, memory, reasoning, action, reflection, collaboration, and governance. These functions are illustrated through Argus, a code review agent that triages context, recalls prior failures, reasons about risk, uses tools, critiques its own output, delegates to specialist agents, and passes through approval and observability controls. The broader message is that successful agent engineering is “harness engineering”: the model spends context, compute, and action opportunities, while the surrounding harness budgets and constrains that spending. As a result, design, specification, and pattern composition become higher-leverage than implementation alone.

Deterministic pipeline vs. agent loop—fixed sequence on the top, model-driven cycle on the bottom.

How four GoF patterns break and what replaces them in agent systems.

The lineage of agent patterns, from GoF (1994) through distributed systems (2000s) to agents (2024+).

The seven cognitive functions for processing a pull request review.

The two creations in agent engineering: specification as mental creation, running agent as physical creation.

Summary

Agent architecture is bounded resource allocation under uncertainty — the model spends; the harness budgets. The 27 patterns and seven cognitive functions in this book are different facets of that one premise. Each chapter is one class of allocation strategy.
Agent systems introduce three fundamental uncertainties—output (same prompt, different responses), behavioral (agent selects its own strategy at runtime), and environmental (the world changes between observations)—that break the assumptions underlying classical design patterns. Treating uncertainty as the default operating condition, not an edge case, is the paradigm shift.
Every agent design pattern has an engineering ancestor. Cache hierarchies became tiered memory. Circuit breakers became loop detectors. Saga transactions became plan-and-execute workflows. The medium changed from deterministic code to probabilistic language models; the engineering problems are isomorphic. Experienced engineers already have the right intuitions. This book gives them the vocabulary to design, review, and communicate about agent architectures within their teams.
The seven cognitive functions (perception, memory, reasoning, action, reflection, collaboration, and governance) organize 27 named patterns across six execution topologies. This two-dimensional framework (what the agent does × how it does it) is the map for the rest of this book.
Design inversion: in agent systems, the specification is the source code and the agent is the compiler. Past the implementation plateau, improving code yields diminishing returns. The constraint on output quality is design, not implementation. The developer's role shifts from writing code to designing specifications.
Token leverage: the ratio of output quality to token cost is a first-class design criterion, because in agent engineering, good architecture is not just a quality decision; it is a financial one.

FAQ

What is the “agent paradigm shift” described in chapter 1?

The agent paradigm shift is the move from deterministic software, where developers control every decision at design time, to agent systems, where an AI model makes decisions at runtime. The developer’s role changes from writing all logic directly to constraining, budgeting, routing, and verifying the model’s decisions.

How do traditional software systems differ from agent systems?

Traditional software follows predetermined control flow, uses structured inputs, has explicit state, and usually produces the same output for the same input. Agent systems use runtime LLM decisions, accept unstructured inputs like natural language, maintain state across context windows, memory, and tools, and may produce different outputs for the same prompt.

What are agent design patterns?

Agent design patterns are reusable architectural solutions for building systems where an AI model makes decisions at runtime. They help engineers design reliable and maintainable systems around probabilistic models whose behavior can include hallucination, drift, misinterpretation, and looping.

Why don’t classical design patterns fully explain agent systems?

Classical patterns assume that a human architect makes structural decisions at design time. Agent systems require many decisions—such as which tool to use, what context to attend to, or when to stop—to be made dynamically by the system itself at runtime.

What does the chapter mean by “the model spends; the harness budgets”?

The model spends scarce resources such as tokens, context window space, compute, and action opportunities. The harness is the engineered environment around the model that budgets those resources by constraining, routing, validating, and verifying the agent’s behavior.

What are the three uncertainties of agent systems?

The three uncertainties are output uncertainty, behavioral uncertainty, and environmental uncertainty. Output uncertainty means the same prompt can produce different responses. Behavioral uncertainty means the agent’s runtime decisions cannot be fully predicted. Environmental uncertainty means the world the agent observes may change between actions.

How do GoF patterns such as Singleton, Factory, Observer, and Strategy break in agent systems?

Singleton breaks because agent state is not globally consistent but distributed across context and memory. Factory breaks because possible actions may not be known at compile time. Observer breaks because attention must be actively managed instead of synchronously receiving every event. Strategy breaks because the agent, not the human developer, selects strategies at runtime.

How are agent patterns related to distributed systems patterns?

Agent patterns inherit ideas from distributed systems patterns and adapt them to probabilistic language-model behavior. For example, circuit breakers become loop detection and halting, saga transactions become plan-and-execute workflows, cache hierarchies become tiered memory, and bulkhead isolation becomes sandboxing and minimal permissions.

What are the seven cognitive functions of agents introduced in the chapter?

The seven cognitive functions are perception, memory, reasoning, action, reflection, collaboration, and governance. Together they describe what an agent needs to do: attend to relevant input, retain useful context, think through problems, use tools, critique itself, coordinate with other agents or humans, and operate within safety and permission boundaries.

What is Argus, and how is it used in the book?

Argus is the code review agent built throughout the book. It starts as a simple perception-reasoning-action loop and gains one cognitive module per chapter, eventually becoming a complete governed agent capable of reviewing pull requests with memory, reasoning, tool use, reflection, collaboration, and governance.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

Introductory offer
Save 50% for a limited time!

eBook

pdf, ePub, online

$55.99 $27.99

you save $28.00 (50%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

Introductory offer
Save 50% for a limited time!

eBook

$55.99 $27.99

you save $28.00 (50%)

Introductory offer
Save 50% for a limited time!

eBook

pdf, ePub, online

$55.99 $27.99

you save $28.00 (50%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more