zkaria gamal

Posted on Jun 9

Agent vs Multi-Agent Systems: A Practical Guide with LangGraph & LangChain

#agentaichallenge #agents #opensource #ai

Hey everyone! 👋
I've been deep in Agentic AI Engineering lately, and one of the most common questions I get is:

What's the real difference between a single Agent and a Multi-Agent system? And how do you actually build them in production?

Today I'll break it down clearly and show you exactly how I implemented both in my repo:

zkzkGamal/agentic-ai-engineering

1. Single Agent (including ReAct)

A Single Agent is one LLM-powered entity that can:

Reason
Call tools
Maintain memory
Loop until the task is done

ReAct Agent is a popular implementation pattern:

Reason → Act (tool call) → Observe (tool result) → Repeat

Pros: Simple, easy to debug, great for focused tasks.

Cons: One agent has to be good at everything → can become bloated, slow, or less specialized.

In my repo:

Chapter 4 teaches classic ReAct agents with LangGraph.
Chapter 5 uses a ReAct-style tool-calling agent inside the Execute node (via create_tool_calling_agent).

2. Multi-Agent System

A Multi-Agent system = multiple specialized agents (or nodes) working together like a team.

Instead of one giant agent, you break the work into roles:

Router / Supervisor
Researcher
Executor / Tool User
Critic / Reviewer
Summarizer
etc.

Benefits:

Better specialization
More efficient (cheap model for routing, powerful model only when needed)
Easier to maintain and scale
Clear separation of concerns

How My Repo Uses Both (Real Architecture)

The crown jewel is in Chapter 5: Multi-Node LangGraph Agent with MCP Tools.

Core Architecture (Single Graph, Multi-Node = Multi-Agent flavor)

Key Nodes (Specialized "Agents"):

Router Node (agent/nodes/router.py)
- Fast intent classification (Math? Email? Just chat?)
- Acts as a Supervisor
Execute Node (agent/nodes/execute.py)
- Runs a full ReAct tool-calling agent
- Connects to tools via MCP (Model Context Protocol) server (math + email tools)
Summarize Node
- Takes raw tool output (JSON, etc.) and makes it human-friendly
Conversation Node
- Lightweight chat fallback (avoids heavy tool path)

This is not a pure peer-to-peer multi-agent (like Researcher → Writer → Critic), but a hierarchical multi-node LangGraph — which is one of the most practical and production-friendly multi-agent patterns today.

You also see pure multi-agent collaboration examples in Chapter 4 (Researcher + Writer + Critic working together).

Why This Architecture Rocks

Separation of concerns → easier debugging and testing
Efficiency → cheap routing + targeted execution
Security → Tools run in isolated MCP server (not directly in the LLM agent)
Observability → Each node is a clear step you can log/monitor
Extensibility → Want to add a "Research" intent? Just add a new node and update the router.

This is built with LangGraph + LangChain (no heavy LlamaIndex or basic OpenAI-only examples).

Key Takeaways

Aspect	Single Agent (ReAct)	Multi-Node / Multi-Agent
Complexity	Simpler	More structured
Specialization	One agent does everything	Each node/agent has a clear role
Efficiency	Can be wasteful	Optimized routing & execution
Debugging	Easier at first	Better long-term traceability
Best For	Focused tasks	Complex, real-world workflows

Modern reality (2026): Most production agent systems are multi-node LangGraph setups that combine ReAct inside specialized nodes.

Want to see it in action?

→ Clone the repo:

git clone https://github.com/zkzkGamal/agentic-ai-engineering.git

Start with Chapter 4 for fundamentals, then run the full Chapter 5 system (LangGraph assistant + separate MCP tool server).

It includes:

Pytest + GitHub Actions CI
Local Ollama support
Memory, streaming, and more

Would love your feedback or contributions! ⭐

DEV Community