Midas126

Posted on Mar 6

Building Your Own AI Agent: A Practical Guide with LangGraph

#ai #langchain #python #machinelearning

From Chatbots to Autonomous Agents: The Next AI Frontier

We've all interacted with AI chatbots. You ask a question, it provides an answer. But what if your AI could not just answer, but act? What if it could browse the web, execute code, update a database, or orchestrate a multi-step workflow autonomously? This is the promise of AI agents—and building them is more accessible than you might think.

While large language models (LLMs) excel at reasoning and language, they lack the ability to take actions in the digital world. An AI agent bridges this gap by combining an LLM's reasoning with tools it can use. Think of it as giving ChatGPT hands and eyes. In this guide, we'll build a practical research agent from scratch using LangGraph, a powerful framework for creating stateful, multi-actor applications.

Why LangGraph? The Power of State Machines

Before we dive into code, let's understand the paradigm shift. Traditional chatbot interactions are stateless—each query is independent. Agents, however, maintain state and make decisions based on context. LangGraph models agent workflows as graphs, where nodes represent steps (LLM calls, tool executions) and edges define the flow based on conditions.

This graph-based approach is perfect for agents because:

It handles cycles and loops naturally (an agent can decide to use a tool multiple times)
It maintains memory across steps
It enables complex branching logic

Building Blocks: Tools, LLMs, and State

Let's set up our environment and install dependencies:

pip install langgraph langchain-openai tavily-python

Now, let's create our core components. We'll build a research agent that can search the web and summarize findings.

from langchain_openai import ChatOpenAI
from langchain_community.tools.tavily_search import TavilySearchResults
from typing import TypedDict, List, Annotated
import operator

# Initialize our LLM
llm = ChatOpenAI(model="gpt-4-turbo", temperature=0)

# Create a web search tool
search_tool = TavilySearchResults(max_results=3)

# Define our agent's state
class AgentState(TypedDict):
    question: str
    search_results: List[str]
    answer: str
    iterations: Annotated[int, operator.add]  # This will auto-increment

The AgentState is crucial—it defines what information flows through our agent graph. The Annotated type with operator.add tells LangGraph to automatically sum this value when multiple nodes modify it.

Crafting the Agent Graph: Nodes and Edges

Now let's build our graph piece by piece. We'll create three primary nodes:

Search Node: Uses the search tool to find information
Analyze Node: Processes and evaluates the search results
Generate Node: Produces the final answer

from langgraph.graph import StateGraph, END
from langchain_core.messages import HumanMessage, SystemMessage

# Initialize our graph builder
workflow = StateGraph(AgentState)

# Node 1: Search the web
def search_node(state: AgentState):
    print(f"🔍 Searching for: {state['question']}")
    results = search_tool.invoke({"query": state['question']})
    return {"search_results": [r["content"] for r in results]}

# Node 2: Analyze if we need more information
def analyze_node(state: AgentState):
    # Check if we have enough information
    messages = [
        SystemMessage(content="You are a research analyst. Determine if the search results sufficiently answer the question."),
        HumanMessage(content=f"Question: {state['question']}\n\nSearch Results:\n{chr(10).join(state['search_results'])}")
    ]

    response = llm.invoke(messages)
    analysis = response.content.lower()

    # Simple decision logic
    if "insufficient" in analysis or "need more" in analysis:
        return {"answer": "continue_searching"}
    else:
        return {"answer": "generate_final"}

# Node 3: Generate final answer
def generate_node(state: AgentState):
    messages = [
        SystemMessage(content="You are a research assistant. Provide a comprehensive answer based on the search results."),
        HumanMessage(content=f"Question: {state['question']}\n\nSearch Results:\n{chr(10).join(state['search_results'])}")
    ]

    response = llm.invoke(messages)
    return {"answer": response.content, "iterations": 1}

# Add nodes to our graph
workflow.add_node("search", search_node)
workflow.add_node("analyze", analyze_node)
workflow.add_node("generate", generate_node)

Connecting the Dots: Conditional Routing

The real magic happens in how we connect these nodes. We need conditional edges that let the agent decide its own path:

# Set the entry point
workflow.set_entry_point("search")

# Define the connections
workflow.add_edge("search", "analyze")

# Conditional edge from analyze
def route_after_analyze(state: AgentState):
    if state["answer"] == "continue_searching" and state["iterations"] < 3:
        return "search"  # Go back and search again
    else:
        return "generate"  # Generate final answer

workflow.add_conditional_edges(
    "analyze",
    route_after_analyze,
    {
        "search": "search",
        "generate": "generate"
    }
)

workflow.add_edge("generate", END)

# Compile our agent
agent = workflow.compile()

Our agent now has autonomous decision-making! It can search, evaluate if it needs more information, and loop back up to 3 times before generating a final answer.

Putting Our Agent to Work

Let's test our creation with a complex query:

# Initialize state
initial_state = AgentState(
    question="What are the latest advancements in quantum machine learning as of 2024, and what practical applications are emerging?",
    search_results=[],
    answer="",
    iterations=0
)

# Run the agent
result = agent.invoke(initial_state)

print(f"🤖 Final Answer ({result['iterations']} iterations):")
print("=" * 50)
print(result['answer'])

The agent will autonomously:

Search for quantum ML advancements
Analyze if results are sufficient
Potentially search multiple times with refined queries
Generate a comprehensive answer

Advanced Pattern: Multi-Agent Collaboration

Single agents are powerful, but real-world tasks often require specialization. Let's extend our system with a reviewer agent that validates answers:

class MultiAgentState(TypedDict):
    question: str
    draft_answer: str
    reviewed_answer: str
    feedback: str

# Create a reviewer node
def review_node(state: MultiAgentState):
    messages = [
        SystemMessage(content="You are a critical reviewer. Identify gaps, biases, or inaccuracies in the answer."),
        HumanMessage(content=f"Question: {state['question']}\n\nDraft Answer: {state['draft_answer']}")
    ]

    response = llm.invoke(messages)
    return {"feedback": response.content}

# Create a refinement node  
def refine_node(state: MultiAgentState):
    messages = [
        SystemMessage(content="Improve the answer based on the reviewer's feedback."),
        HumanMessage(content=f"Original Answer: {state['draft_answer']}\n\nReviewer Feedback: {state['feedback']}")
    ]

    response = llm.invoke(messages)
    return {"reviewed_answer": response.content}

# Build multi-agent workflow
multi_workflow = StateGraph(MultiAgentState)
multi_workflow.add_node("generate", generate_node)  # Reuse our generator
multi_workflow.add_node("review", review_node)
multi_workflow.add_node("refine", refine_node)

multi_workflow.set_entry_point("generate")
multi_workflow.add_edge("generate", "review")
multi_workflow.add_edge("review", "refine")
multi_workflow.add_edge("refine", END)

multi_agent = multi_workflow.compile()

This creates a pipeline where one agent generates content and another reviews it—mimicking real-world collaborative workflows.

Production Considerations and Best Practices

Building agents is exciting, but deploying them requires care:

Add Validation: Always validate tool inputs/outputs
Implement Timeouts: Prevent infinite loops
Add Observability: Log decisions and tool usage
Handle Errors Gracefully:

from langgraph.checkpoint import MemorySaver

# Add persistence to our agent
agent_with_memory = workflow.compile(
    checkpointer=MemorySaver()
)

# Now we can run with thread_id for continuity
config = {"configurable": {"thread_id": "user_123"}}
result1 = agent_with_memory.invoke(
    {"question": "What is LangGraph?"}, 
    config
)
# Later, continue the conversation
result2 = agent_with_memory.invoke(
    {"question": "How does it compare to LangChain?"}, 
    config
)

Your Next Steps in Agent Development

We've built a functional research agent, but this is just the beginning. Consider enhancing it with:

Specialized Tools: Add database queries, API calls, or code execution
Human-in-the-Loop: Add nodes that pause for human approval on critical decisions
Long-term Memory: Integrate vector databases for context across sessions
Evaluation Metrics: Track success rates and tool usage patterns

The most exciting aspect of AI agents isn't just what they can do today, but how they're redefining human-computer interaction. Instead of giving precise commands, we're moving toward describing outcomes and letting the agent figure out the steps.

Ready to build your own? Start by forking the complete code from this gist and try adding a new tool. Share what you create—the agent ecosystem grows through community innovation.

Remember: The best agents aren't just smart—they're reliable, transparent, and know their limits. Build with those principles, and you'll create AI that doesn't just answer questions, but gets things done.

Want to dive deeper? Check out the LangGraph documentation for advanced patterns like streaming, persistence, and human-in-the-loop workflows. Have questions or improvements? Share your agent projects in the comments below!

DEV Community