Payal Baggad for Techstuff Pvt Ltd

Posted on Oct 24

Building Long-Term Memory in Agentic AI: A Practical Guide 🚀

#ai #agents #llm #tutorial

Modern AI agents need more than just conversation skills → they need memory. This guide explores how to build production-grade long-term memory systems using LangGraph, enabling AI agents to learn and improve from user feedback over time.

🧩 The Two-Layer Memory System

Agentic AI systems rely on two distinct memory layers:

✦ Short-Term Memory (Thread-Level): Works within a single conversation, tracking messages, files, and interactions. Think of it as the agent's "working memory" that gets cleared after each session.

✦ Long-Term Memory (Cross-Thread): Persists across multiple sessions, storing user preferences, decisions, and learned behaviors. This memory enables consistency and personalization over time.

👉 LangGraph Memory Storage Options

LangGraph provides three storage implementations suited for different development stages:

InMemoryStore: Perfect for notebooks and testing. Runs entirely in memory with no persistence → ideal for quick experiments.
Local Development Store: Automatically saves data to your file system when using langgraph dev. Provides basic persistence without external databases.
Production Store: Uses PostgreSQL with pgvector for scalable, reliable storage with built-in semantic search capabilities.

📩 Building an Email Assistant with Memory

Let's build a practical example: an AI email assistant that learns from your feedback.

🔹 Core Architecture Components
➥ Schemas: Define data structures using Pydantic models to ensure type-safety and predictable outputs. The RouterSchema classifies emails as "ignore," "respond," or "notify."
➥ Prompts: Craft instructions for different agent roles → triage classification, email composition, and memory updates. Default preferences provide baseline behavior.
➥ Tools: Functions like write_email, schedule_meeting, and check_calendar_availability give the agent action capabilities.

🔹 Memory Functions
Two critical functions manage the learning process:

get_memory(): Retrieves preferences from storage or initializes them with defaults on first run. This ensures the agent always has rules to follow.

update_memory(): Uses a specialized LLM to update stored knowledge based on user feedback intelligently. It follows strict rules: never overwrite entirely, only add new information or update contradicted facts, and preserve everything else.

🔹 The Learning Workflow
The system operates through a graph-based workflow:

Triage Router: Classifies incoming emails using stored triage preferences
Response Agent: Drafts responses following learned writing styles
Human-in-the-Loop (HITL): Pauses for user review and captures feedback
Memory Update: Processes feedback to refine future behavior

🎥 Capturing Feedback Effectively

The system learns from three types of user interactions:

● Accept: No memory update → simple approval provides no new information.
● Edit: Direct modifications create clear before/after comparisons. For example, changing a meeting duration from 45 to 30 minutes teaches the agent to prefer shorter meetings.
● Response: Natural language feedback like "make it less formal" gets generalized into reusable rules.

🌀The Four-Step Learning Loop

Here's how feedback transforms into learned behavior:

Step 1 - Trigger: Learning only activates when you provide explicit feedback through edits or responses.
Step 2 - Manager Call: A dedicated "memory manager" LLM analyzes feedback using strict update instructions.
Step 3 - Surgical Update: The manager makes targeted changes, integrating new rules without deleting existing knowledge.
Step 4 - Injection: Updated preferences are fetched on the next run, automatically changing agent behavior.

⚖ Practical Example: Learning from Edits

When testing the email assistant:

First Run (Accept): The user approves all actions → memory remains unchanged.

Second Run (Edit): User changes the meeting duration from 45 to 30 minutes and shortens the email subject. The system learns:
➥ Calendar preference: "30-minute meetings are preferred."
➥ Response preference: "Use concise subject lines and brief confirmations."

These learned preferences persist and influence all future interactions.

🔑 Key Implementation Insights

➧ Memory Organization: Use namespaces (like tuples) to organize different types of preferences → triage rules, writing style, calendar settings.
➧ Controlled Updates: The memory manager LLM prevents corruption by following strict rules about preserving existing information.
➧ Contextual Learning: Different feedback types update different memory namespaces → meeting edits update calendar preferences, and email edits update response style.

🔎 Why This Matters

This approach transforms generic AI agents into personalized assistants that:
● Adapt to individual preferences without manual reconfiguration
● Improve continuously through natural user interactions
● Maintain consistency across sessions and conversations
● Scale from prototypes to production systems

🚀 Getting Started

The complete implementation is available on GitHub (search for "langgraph-long-memory"). Start with InMemoryStore for local testing, then migrate to PostgreSQL for production deployment.

By implementing long-term memory, you're not just building an AI tool → you're creating an assistant that grows smarter with every interaction, truly understanding and adapting to its users' needs.

DEV Community