Michael O

Posted on Apr 17 • Edited on Apr 25 • Originally published at xeroaiagency.com

How to Give an AI Agent Persistent Memory Across Sessions

#ai #solopreneur #automation #webdev

Most AI agents have the memory of a goldfish.

You train them, instruct them, spend an hour getting them dialed in, and then the session ends. Next time you open it, it's gone. You start from zero. No context about what it did yesterday, what failed last week, what the current priorities are.

This isn't a bug. It's how language models work by default. And it's the single biggest reason AI agent projects fail within the first few weeks. The agent can't build on its own work because it doesn't remember doing it.

Here's the architecture that actually solves this. I've been running it on my AI co-founder, Evo, for several months now. It's not complicated, but it requires being deliberate about structure.

Why "Just Put It in the Prompt" Doesn't Work

The first instinct is to solve this by cramming everything into the system prompt. Tell the agent what it did, who it is, what the priorities are, every single time. Some people even pipe in recent conversation logs.

This breaks fast. You hit context window limits. You pay for the same tokens over and over. And the agent still drifts, because a 4,000-word prompt injected at session start is not the same as actually remembering something.

Real memory means the agent has access to persistent, structured information that gets updated over time. Not a static prompt. A living set of files.

The Two-Layer Memory System

The architecture that works splits memory into two categories: short-term and long-term. Same way your brain works.

Short-term: daily log files

One flat file per day. Named by date. The agent writes to it as it works: what it did, what it decided, what failed, what's pending. Raw, operational, logged automatically.

These files expire. You don't need every day from six months ago. But yesterday matters. Last week sometimes matters. You keep them around for 30-90 days and then let them go.

The key is that the agent writes to these files itself. Not you. Every time it completes something meaningful, it appends a line. This is what makes it genuinely persistent: the agent creates its own memory trail.

Long-term: a curated memory file

One file that lives forever. Call it MEMORY.md or whatever you want. This is where the lessons that matter get promoted.

Not everything belongs here. Not "posted tweet at 3pm" or "checked analytics." What belongs here: strategic decisions, lessons learned, things that changed how the system works, facts about the business that need to persist. The stuff that, if it disappeared, would cause the agent to make the same mistakes again.

The agent is responsible for promoting entries from daily logs to long-term memory. It does this during low-traffic moments, looking at what happened recently and deciding what's worth keeping. Over time, this file becomes a distilled record of the agent's understanding of the business.

The Boot Sequence

Memory is only useful if the agent actually reads it at the start of each session.

The way this works in practice: when Evo starts a session, it automatically loads today's daily log, the long-term MEMORY.md, and a few other context files. It reads them before doing anything else.

This boot sequence is what turns a stateless language model into something that feels like it remembers. It's not magic. It's just: read your notes before you start working.

Most agent setups skip this, or make it optional, or load the memory only when the agent "needs" it. That's wrong. Load it every time. The cost is minimal (a few thousand tokens). The benefit is continuity.

What Goes Wrong Without Structure

If you run an agent without this architecture, a few things happen:

The agent invents facts. It doesn't know what the current product prices are, so it makes something up based on what seems plausible. The more it does this, the more its internal model of the business diverges from reality.

The agent repeats work. It has no memory of attempting the same task two weeks ago and learning it doesn't work. It tries again with the same approach and fails the same way.

The agent loses context on priorities. You told it last week that the #1 goal is newsletter growth, not TikTok analytics. It doesn't remember that. It treats everything as equally important.

These aren't model failures. They're architecture failures. A better model won't fix them.

One File That Changes Everything: Source of Truth

Alongside daily and long-term memory, there's a third file that matters: a canonical facts document.

This file contains things that are objectively true about the business: product names, current prices, active URLs, what's live vs in development, what the agent is and isn't allowed to do. It's not a memory file. It's a reference file. The agent checks it before making any claim about the business.

Without this, agents drift. They'll tell you a product costs $49 when you changed the price to $29 last month. They'll link to a URL that no longer exists. They'll describe features that were cut.

The source of truth file is the canonical record. One file. The agent reads it. What's in it is what's true.

The Practical Setup

If you want to implement this today, you need four things:

A folder for daily log files, one per day, formatted as YYYY-MM-DD.md. The agent appends to the current day's file as it works.

A single long-term MEMORY.md file with a clear structure: what's important, what decisions have been made, what lessons have been learned. Keep it under 10KB. It loads every session, so you want it dense, not sprawling.

A source of truth document with canonical business facts. Update it manually when things change. Treat it like a contract with your agent.

A boot sequence that loads all three automatically when a session starts. Most agent platforms have a way to inject files at session start. Use it.

That's the whole system. No vector databases. No embeddings. No complex retrieval pipelines. Just structured text files that the agent reads and writes consistently.

What This Looks Like Running

Evo has been running this architecture for months now. Every morning, it picks up where it left off. It knows what the active cron jobs are, what products are live, what the current revenue numbers are, what failed last week and why.

When something significant happens, it writes it down. When a lesson is worth keeping, it promotes it to long-term memory. When I log in after a day at work, it has context. Not perfect context. Not human-level continuity. But enough to keep working without starting over.

The result isn't an agent with perfect memory. It's an agent that degrades gracefully, has enough context to do useful work, and doesn't require you to re-explain everything every single time.

For anyone building autonomous systems that need to run reliably without constant supervision, that's the real goal.

If you want to go deeper on the architecture behind Evo, Book 1 walks through the full system: identity files, memory layers, source of truth documents, verification loops, and how they fit together. It's at xeroaiagency.com/learn/build-an-ai-cofounder.

Originally published at xeroaiagency.com

Want to build your own AI co-founder?

I'm building Xero in public — an AI system that runs distribution, content, and ops while I work a full-time job.

Start here (free): Your First AI Agent — $7 guide, instant download
Go deeper: Build an AI Co-Founder — the full architecture ($19)
Newsletter: AI for the Rest of Us — practical AI 3x/week for people with day jobs
Site: xeroaiagency.com