AI Agents in Action, Second Edition you own this product

Intelligent workflows with LLMs, MCP, A2A, and more

Micheal Lanham

MEAP began November 2025
Last updated March 2026
Publication in Summer 2026 (estimated)

ISBN 9781633434530
325 pages (estimated)

Included with a Manning Online subscription

printed in black & white

available in Complex Chinese

catalog / Data Science / AI / AI Agents

resources: Source code Book forum Source code on Github

table of content

1 The rise of AI agents

1.1 Defining agents and agentic thinking

1.1.1 Understanding agent/assistant and LLM patterns

1.1.2 Thinking like agents

1.1.3 Agents act with tools

1.2 Introducing the Model Context Protocol (MCP)

1.3 Understanding the functional layers of an agent

1.3.1 The Agent Persona

1.3.2 Agent Actions & Tools

1.3.3 Agent Reasoning & Planning

1.3.4 Agent Knowledge & Memory

1.3.5 Agent Evaluation & Feedback

1.4 Advancing onto multi-agent systems

1.4.1 The agent-flow assembly line

1.4.2 Agent orchestrations (hub-and-spoke)

1.4.3 Agent collaboration (teams of agents)

1.5 Summary

2 Core components: Large Language Models, prompting, and agents

2.1 Understanding Large Language Models

2.1.1 LLMs: Probabilistic Token Machines

2.1.2 What is a token?

2.1.3 Tuning Temperature, Top P, and more

2.2 Controlling LLMs with prompt engineering (Agent Persona)

2.2.1 Applying core prompt techniques

2.2.2 Thinking like an LLM

2.2.3 Avoiding common prompt pitfalls

2.3 Building an agent with OpenAI Agents

2.3.1 Building a minimal agent

2.3.2 Setting the Agent Model and other parameters

2.3.3 Controlling inputs and typed outputs

2.3.4 Tracing agents

2.4 Enhancing agents through tool integration

2.4.1 Providing agents with tools

2.4.2 Tracing agentic tool use

2.5 Exercises

2.6 Summary

3 Actions with Model Context Protocol for AI agents

3.1 Understanding MCP fundamentals for agent development

3.1.1 The standardization problem MCP solves

3.1.2 MCP architecture: Clients, servers, and services

3.1.3 Core components: Tools, resources, and prompts

3.1.4 MCP deployment patterns for agents

3.1.5 MCP powers the functional agent layers

3.2 Getting started with MCP Servers

3.2.1 Coding up an MCP Server for Claude

3.2.2 Using the MCP inspector

3.2.3 Understanding MCP transport types

3.2.4 From desktop to agents: the key differences

3.3 Actioning MCP servers for Agents

3.3.1 Actioning local MCP servers over STDIO with agents

3.3.2 Actioning local MCP servers over SSE with agents

3.3.3 Connecting to the standard MCP servers

3.4 Building MCP servers for agents

3.4.1 Converting tools to an MCP server

3.4.2 Consuming MCP servers locally or remotely

3.5 Exercises

3.6 Summary

4 Architecting and building multi-agent systems

4.1 Architecting multi-agent systems

4.1.1 Decision-making for agent systems

4.1.2 Communicating with shared-memory, message-passing, and MCP

4.1.3 Channeling multi-agent coordination strategies

4.2 Balancing agents with agentic flows

4.2.1 Transforming agents to agent flows

4.2.2 Building an Agent-to-Agent flow

4.2.3 Agency and decision making in agent flows

4.3 Understanding handoffs in aAgent flows

4.3.1 Agent-to-agent flow with handoffs

4.3.2 Visualizing agent flows

4.3.3 Monitoring the handoff

4.4 Validating agent flows with guardrails

4.4.1 Implementing input and output guardrails

4.4.2 Using agents as guardrails

4.4.3 Adding guardrails to pass off agent flows

4.5 Exercises

4.6 Summary

5 Agent Reasoning and Planning

5.1 Understanding LLM Reasoning and Planning

5.1.1 Chain of Thought Reasoning

5.1.2 ReAct Paradigm (Reasoning + Acting + Observing)

5.1.3 Planning with LLMs

5.2 Instructing agents to reason and plan

5.2.1 Applying CoT to an Agent

5.2.2 Implementing ReAct with Agents

5.3 Advanced reasoning with agents

5.3.1 Tree of Thought

5.3.2 Reflexion

5.3.3 Selecting the right pattern for your agents

5.4 Utilizing the Sequential Thinking MCP Server

5.4.1 Unchaining the Sequential Thinking Server

5.4.2 Revisiting time travel problems with Sequential Thinking

5.4.3 Advanced reasoning with sequential thinking

5.5 Exercises

5.6 Summary

6 Working with memory and knowledge RAG for agents

6.1 Understanding retrieval in AI applications

6.1.1 The basics of retrieval augmented generation (RAG)

6.1.2 Delving into semantic search and document indexing

6.1.3 Applying vector similarity search

6.2 Vector databases and similarity search

6.2.1 Demystifying document embeddings

6.2.2 Querying document embeddings from Chroma

6.3 Building practical RAG knowledge agents

6.3.1 Everything begins with search and relevance

6.3.2 Building a vector search RAG agent

6.3.3 Building a hybrid search RAG agent

6.4 Adding memory to agents with MCP

6.4.1 Understanding memory form and agent function

6.4.2 Implementing a graph database for memory using MCP

6.4.3 Creating hybrid memory systems with MCP

6.4.4 Semantic augmented memory and applications to semantic, episodic, and procedural memory

6.4.5 Uncluttering memory with compression and forgetting

6.5 Exercises

6.6 Summary

7 Building robust agents with evaluation and feedback

7.1 Introducing agent evaluation and feedback

7.2 Implementing test-driven agent development

7.2.1 Exploring TDAD in practice

7.2.2 Coding and testing the RAG agent

7.2.3 Refactoring the agent

7.2.4 Extending evaluation with an agent evaluator

7.3 Employing grounding, critic, and evaluation agents

7.3.1 Reviewing the grounding agent

7.3.2 Grounding the RAG agent

7.3.3 Implementing grounding agents as guardrails

7.3.4 Understanding the role of rubrics in evaluation

7.3.5 Building a rubric critic agent

7.4 Phoenix for evaluation and feedback

7.4.1 Connecting to Phoenix

7.4.2 Adding metadata and session tracking

7.4.3 Experimenting with evaluators

7.4.4 Providing feedback with Annotations

7.5 Exercises

7.6 Summary

8 Deploying agents and agentic systems

8.1 Strategies for consuming agents

8.1.1 Embedding real-time voice agents into web applications

8.1.2 Hosting agents through an API

8.1.3 Consuming an agent web service in a web application

8.2 Dockerizing agent systems

8.2.1 Containerizing an agent microservice

8.2.2 Orchestrating agentic systems with Docker Compose

8.2.3 Externalizing local agent microservices

8.3 Considering advanced deployment strategies

8.3.1 Choosing a runtime: edge, API, or event-driven

8.3.2 The three “wires” of communication

8.3.3 Practical multi-agent topologies that adapt well

8.3.4 State, memory, and idempotency

8.3.5 Release engineering for agents (prompts, tools, models)

8.3.6 Observability matters

8.3.7 Reliability patterns: timeouts, fallbacks, and budgets

8.3.8 Cost control and model routing

8.4 Security, safety, and governance in production

8.4.1 A quick threat model for agentic systems

8.4.2 Identity and access—for people, services, and agents

8.4.3 Secrets and configuration management

8.4.4 Tool safety: sandboxing and egress control

8.4.5 Prompt-injection and data-exfiltration defenses

8.4.6 Safety and policy enforcement

8.5 Exercises

8.6 Summary

9 Engaging GPT Assistants

10 Exploring collaborative agent systems

11 Troubleshooting

Overview

4 Architecting and building multi-agent systems

This chapter explains how and why to evolve from single agents to multi-agent systems, emphasizing both the gains in capability and the tradeoffs in cost, latency, and predictability. It centers on three foundational architectures—flow (assembly-line), orchestration (hub-and-spoke), and collaboration (peer-to-peer)—and frames design choices around decision-making (command), control (who executes tools), and communication (how context is shared). A fourth “C,” coordination, shapes how agents progress through work—sequentially, in parallel, hierarchically, or via iterative refinement. The guidance is pragmatic: begin with the simplest effective pattern (typically a flow), then iterate, mixing and adapting as needs grow.

The chapter surveys communication strategies that regulate context exposure and cost, including point-to-point message passing, shared conversational memory, and protocol-based interactions where agents can be invoked like tools. It then maps coordination variants—sequential pipelines, parallel delegation, hierarchical manager–worker plans, and iterative debate/refinement—alongside additional tactics such as voting/ensembles, role-playing, conditional routing, and decentralized peer-to-peer networks. Throughout, it highlights the balance between giving agents enough agency to be effective and constraining them to reduce ambiguity, overhead, and error, encouraging designers to align patterns across a codebase while retaining the flexibility to combine them judiciously.

On the implementation side, the text shows how to transform an overloaded single agent into a specialized agent-to-agent flow to improve focus, determinism, and cost, then introduces explicit, code-level decision points to stabilize outcomes. It demonstrates built-in handoffs that let agents transfer control without manual wiring, ways to visualize and trace flows, and techniques to monitor what is passed between agents. Robustness is increased with guardrails that validate inputs, outputs, and inter-agent transfers—sometimes powered by agents themselves—plus retry loops to recover from failures. While orchestration can centralize planning and delegation, it adds complexity and brittleness; the recommended path is to keep flows simple, strongly type inputs/outputs, limit unnecessary context, add guardrails where appropriate, and graduate to more complex coordination only after establishing reliable flows.

The three well-established patterns for building multiple agent systems, from the agent flow (assembly line), orchestrator (hub-and-spoke), to the collaboration (peer to peer).

Decision-making (command) and control demonstrated for more specialized multi-agent architectures, the flow, orchestration and manager architectures.

Comparison of different agent communication patterns represented in a flow.

Various ways agents may coordinate execution sequentially or in parallel.

A single agent is transformed into a multi-agent flow of agents. The agent role is broken down into three well-defined and distinct roles that encapsulate a well-defined set of tools.

shows an agent-to-agent flow with a deterministic (coded) decision point added. Taking the decision away from the agents and making it deterministic keeps the workflow more consistent.

three agent flow communication patterns demonstrating how agents can pass messages from one another and in turn pass command and control from agent to agent.

There are two ways to visualize the agent flow using draw_graph and the Traces page that can be fournd on the OpenAI Dashboard under logs.

Guardrails can be used to validate and control input and output of an agent.

Traces page after executing the Orchestration agent shows X and Y.

Summary

Single-agent designs often hit scalability walls; converting a monolithic agent into an agentic flow (a chain of specialised agents) restores clarity, extensibility, and performance.
Agent-to-agent flows work like prompt-chaining with superpowers: each node can invoke tools and reason independently yet pass concise, typed outputs downstream to keep the context lean.
Insert deterministic decision points (code or schema checks) wherever the flow must repeat reliably; don’t rely on stochastic LLM judgment for pass/fail branches.
The OpenAI Agents SDK supports two hand-off styles: conversational (shared thread) and pass-off (explicit code routing). Choose conversational for speed and pass-off for fine-grained control.
Use the handoffs field of the agent plus clear instructional prompts to enable internal hand-offs that require zero extra orchestration code.
Visualise complex flows early using draw_graph(), and the Dashboard Traces view reveals hidden loops, tool chains, and latency hotspots.
Wrap risky transfers in guardrails—input/output validators that reject, retry, or correct data before it corrupts the flow; tripwires surface as explicit exceptions you can loop on.
Guardrails themselves can be LLM-powered agents, giving you natural-language policies without brittle regex or length checks—remember they, too, need schemas and tests.
Tool limits still matter in multi-agent worlds: every registered tool inflates every call; keep each agent’s tool list tight (< 10) and scoped to its role to avoid token bloat.
Flow, orchestration, and manager-worker are the three canonical decision patterns. Start with plain flows and graduate to orchestrators only when centralised delegation is truly required.
Choose one communication layer (shared memory, message passing, MCP, or emerging A2A protocol) per project; mixing channels multiplies debugging pain.
A production-ready agentic flow blends typed I/O, deterministic checkpoints, visual traces, scoped tool sets, and guardrails—yielding pipelines that can scale, recover, and evolve without surprise.
Agents may be coordinated using multiple different strategies: Sequential Flow, Parallel Delegation, Hierarchical Coordination, Iterative Debate and Refinement, Voting / Best-of-N (Ensemble), Role-Playing Collaboration, Conditional Routing (Branching), and Peer-to-Peer Network

FAQ

When should I move from a single agent to a multi‑agent flow?

Move when a single agent becomes overloaded with tools or long instructions, when specialization would improve quality, or when cost/latency rises due to excess context. Splitting into focused agents clarifies roles, reduces token overhead, and makes behavior easier to reason about. Start with the simplest possible flow and iterate.

What are the main multi‑agent architectures and how do they differ?

- Flow (assembly line): Decision‑making and control pass step‑by‑step; communication is point‑to‑point. Simple and predictable.
- Orchestrator (hub‑and‑spoke): Central agent decides and delegates; workers execute. Strong oversight, higher coordination cost.
- Collaboration (peer to peer): Agents share a channel, make decisions collectively, and coordinate without a strict leader. Flexible but harder to debug.

How do the “four Cs” shape my design: decision‑making, control, communication, coordination?

- Decision‑making (command): Who plans and chooses next steps.
- Control: Who can actually use tools and perform actions.
- Communication: How context moves (shared thread, selective pass‑off, protocol like MCP).
- Coordination: Execution topology (sequential, parallel, hierarchical, iterative, etc.). Balancing these four determines predictability, cost, and speed.

Which agent communication pattern should I use?

- Shared conversation thread: All agents see the same context. Easiest, but higher token cost and potential distraction.
- Pass‑off/message passing: Call each agent with only the needed inputs. More control, more glue code.
- Protocol/tooling (e.g., MCP): Treat another agent/service as a tool. Tight scoping and clear contracts. Choose the least permissive option that still solves your use case.

How do I refactor a single agent into a clear agent‑to‑agent flow?

Identify distinct roles and tools, create one agent per role, and pass only the minimal structured output to the next step. Constrain outputs with typed models to simplify handoffs. Keep prompts short, tool scopes small, and verify each step with simple tests before adding more agents.

What coordination strategies are available and when should I use them?

- Sequential pipeline: Ordered stages; great for linear workflows.
- Parallel delegation: Independent subtasks; faster when merging is simple.
- Hierarchical coordination: Orchestrator manages dependencies and mixtures of parallel/serial work.
- Iterative debate/refinement: Critic/gatekeeper improves quality on hard problems.
- Voting/Best‑of‑N: Ensembles for reliability; costly if tasks are cheap.
- Role‑playing: Complementary perspectives to clarify and iterate.
- Conditional routing: Classify and send to experts.
- Peer‑to‑peer: Decentralized and robust, but complex.

How can I make flows more deterministic and reliable?

Externalize key decisions into code (branching on typed outputs), use strict input/output schemas, and limit shared context. Add retries with bounded attempts and fallbacks. Where appropriate, insert guardrails to validate inputs/outputs and stop or correct bad states early.

What are SDK handoffs and how do they compare to manual pass‑offs?

SDK handoffs let an agent transfer control to another without glue code by declaring downstream agents and mentioning handoffs in instructions. Benefits: simpler plumbing and a single conversation thread. Trade‑offs: agents must reference each other, and complex graphs can be harder to inspect or recover if the thread breaks. Manual pass‑offs give maximum control at the cost of more code.

How do I observe, debug, and understand agent‑to‑agent behavior?

- Visualize the graph (e.g., draw a flow diagram from agent definitions).
- Inspect traces in your dashboard to see tool calls, turns, and timing.
- Wrap handoffs with callbacks to log what data moved and why the transfer occurred.
- Constrain and log typed inputs/outputs at each boundary to spot drift quickly.

What are guardrails and where should I apply them?

Guardrails validate and potentially block or correct inputs and outputs at agent boundaries. Use input guardrails to filter or reshape goals; use output guardrails to enforce quality (e.g., length, structure, safety). You can implement them with code or dedicated “guardrail agents.” Combine with retries, typed models, and selective context to keep flows robust and cost‑efficient.

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$47.99 $33.59

you save $14.40 (30%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $33.59

you save $14.40 (30%)

eBook

pdf, ePub, online

$47.99 $33.59

you save $14.40 (30%)

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more