<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: lweiss01</title>
    <description>The latest articles on DEV Community by lweiss01 (@lweiss01).</description>
    <link>https://dev.to/lweiss01</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F91372%2F347ebd17-a84d-489b-b376-857af9a868e9.png</url>
      <title>DEV Community: lweiss01</title>
      <link>https://dev.to/lweiss01</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/lweiss01"/>
    <language>en</language>
    <item>
      <title>The Agent-to-Agent Continuity Gap Nobody Is Talking About</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Wed, 27 May 2026 13:55:44 +0000</pubDate>
      <link>https://dev.to/lweiss01/the-agent-to-agent-continuity-gap-nobody-is-talking-about-2bgh</link>
      <guid>https://dev.to/lweiss01/the-agent-to-agent-continuity-gap-nobody-is-talking-about-2bgh</guid>
      <description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; Most AI agent memory discussions still assume one agent talking to itself across sessions. But real coding workflows already involve Claude, Codex, Cursor, and Gemini touching the same repo in the same week. The hard problem is not "how does an agent remember." It is "how do multiple agents stay coordinated on the same project without stepping on each other." That problem does not live inside any one agent. It lives in the repo.&lt;/p&gt;




&lt;p&gt;I wrote a post last week arguing that AI coding agent memory belongs in the repository, not the chat window. Checkpoints, not transcripts.&lt;/p&gt;

&lt;p&gt;Sitting with that argument for a few days, I realized it is actually downstream of a bigger one I had not made explicitly yet. The checkpoint primitive only matters because of a problem the current agent stack does not have a name for.&lt;/p&gt;

&lt;p&gt;So here it is.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Industry Map Has A Blind Spot
&lt;/h2&gt;

&lt;p&gt;There is a really good 2026 agent stack map going around right now from Paolo Perrone. Six layers. Models, protocols, memory, frameworks, eval, guardrails. It is a useful map.&lt;/p&gt;

&lt;p&gt;But read the memory layer carefully and you notice something.&lt;/p&gt;

&lt;p&gt;Every memory tier on that map assumes one agent.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;In-context state lives inside one agent's context window&lt;/li&gt;
&lt;li&gt;Vector retrieval lives inside one agent's RAG pipeline&lt;/li&gt;
&lt;li&gt;Persistent memory services like Letta, Zep, and Mem0 are designed for one agent learning across sessions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is a real problem and worth solving. But it is not the problem most coding workflows actually have.&lt;/p&gt;

&lt;h2&gt;
  
  
  Most Coding Workflows Already Use Multiple Agents
&lt;/h2&gt;

&lt;p&gt;Look at how anyone serious is shipping code right now.&lt;/p&gt;

&lt;p&gt;Claude for architecture and review.&lt;br&gt;
Codex for implementation.&lt;br&gt;
Cursor for inline iteration.&lt;br&gt;
Gemini for exploration.&lt;br&gt;
A human approving and editing all of it.&lt;/p&gt;

&lt;p&gt;That is not a future scenario. That is a Tuesday.&lt;/p&gt;

&lt;p&gt;And every single one of those agents has its own context window, its own session, its own memory, its own opinions about the codebase. None of them know what the other ones did an hour ago.&lt;/p&gt;

&lt;p&gt;The continuity problem is not "Claude forgot what we discussed yesterday."&lt;/p&gt;

&lt;p&gt;The continuity problem is "Claude does not know what Codex implemented this morning, Cursor reverted half of it at lunch, and the human merged something different from a different branch."&lt;/p&gt;

&lt;p&gt;That is a coordination problem dressed up as a memory problem.&lt;/p&gt;

&lt;h2&gt;
  
  
  Agent-to-Agent Memory Does Not Exist Yet
&lt;/h2&gt;

&lt;p&gt;Perrone's map notes this honestly. MCP standardized how agents call tools. It says nothing about how agents talk to each other. IBM has ACP. Google has A2A. Neither is a standard. Neither is widely adopted. Neither solves the coding workflow case.&lt;/p&gt;

&lt;p&gt;So in practice, every team running a multi-agent coding workflow is solving this themselves. Usually badly. Usually by re-explaining context to every new session by hand.&lt;/p&gt;

&lt;p&gt;The dedicated memory vendors do not solve this either, because they are designed to give one agent a longer memory. Plugging Cursor and Claude Code into the same Mem0 instance and hoping they coordinate is not a thing that works today.&lt;/p&gt;

&lt;p&gt;Memory infrastructure is single-agent infrastructure. The multi-agent coordination layer is missing.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Repo Is The Only Shared Surface
&lt;/h2&gt;

&lt;p&gt;Here is the thing that kept hitting me.&lt;/p&gt;

&lt;p&gt;When Claude, Codex, Cursor, and Gemini are all working on the same project, there is exactly one piece of infrastructure all of them already see.&lt;/p&gt;

&lt;p&gt;The repository.&lt;/p&gt;

&lt;p&gt;They all read it. They all write to it. They all already have file system access through MCP or equivalent. Git already tracks who changed what and when.&lt;/p&gt;

&lt;p&gt;The repo is the shared substrate. It is the only shared substrate. Everything else is per-agent.&lt;/p&gt;

&lt;p&gt;So if you want continuity across agents, the continuity artifacts have to live in the repo. Not in a vector database that one agent is plugged into. Not in a hosted memory service that another agent does not know about. In the repo. In files. Versioned. Auditable. Diffable. Visible to every agent that can read the file system.&lt;/p&gt;

&lt;p&gt;That is what makes checkpoints the right primitive. Not because vector search is bad. Vector search is great for what it does. But you cannot retrieve from a vector store that the next agent has never heard of.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Reframe
&lt;/h2&gt;

&lt;p&gt;When you stop framing the problem as "agent memory" and start framing it as "multi-agent coordination on a shared artifact," a lot of the tooling debates collapse.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Bigger context windows do not help. The next agent has a different context window.&lt;/li&gt;
&lt;li&gt;Better RAG does not help. The next agent has a different RAG pipeline, or no RAG at all.&lt;/li&gt;
&lt;li&gt;Hosted memory services do not help unless every agent in your workflow is plugged into the same one, which they are not.&lt;/li&gt;
&lt;li&gt;Transcripts do not help, because they are noise and the next agent does not have your transcript.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;What does help is a small, structured, versioned record of what was decided, what is in progress, what is at risk, and what the next agent should pick up. Sitting in the repo. Where everyone can see it.&lt;/p&gt;

&lt;p&gt;That is the continuity primitive the current stack map does not have a slot for.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Am Building
&lt;/h2&gt;

&lt;p&gt;Holistic is an open source CLI exploring this idea. Repo-native checkpoints. Agent-agnostic. No vendor account, no hosted service, no per-agent SDK. Just files in your repo that any agent can read and any agent can update.&lt;/p&gt;

&lt;p&gt;It is still early. The thesis is what I am most interested in pressure testing right now.&lt;/p&gt;

&lt;p&gt;If you are running a multi-agent coding workflow and you have your own answer to the coordination problem, I want to hear it. If you think I am wrong about the repo being the right substrate, I really want to hear that.&lt;/p&gt;

&lt;p&gt;Repo: &lt;a href="https://github.com/lweiss01/holistic" rel="noopener noreferrer"&gt;https://github.com/lweiss01/holistic&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The repo remembers, not the window. And no single agent remembers for the others.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>webdev</category>
      <category>programming</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Checkpoints, Not Transcripts: Rethinking AI Coding Agent Memory</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Sun, 24 May 2026 18:40:21 +0000</pubDate>
      <link>https://dev.to/lweiss01/checkpoints-not-transcripts-rethinking-ai-coding-agent-memory-21pj</link>
      <guid>https://dev.to/lweiss01/checkpoints-not-transcripts-rethinking-ai-coding-agent-memory-21pj</guid>
      <description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; AI coding agent memory should live in the repository, not the chat window. Bigger context windows and vector databases are solving the wrong problem. Here is the case for treating the repo itself as the durable cognitive surface.&lt;/p&gt;




&lt;p&gt;Everyone is trying to solve AI agent memory right now.&lt;/p&gt;

&lt;p&gt;Longer context windows.&lt;br&gt;
Vector databases.&lt;br&gt;
Conversation replay.&lt;br&gt;
Semantic retrieval.&lt;br&gt;
Infinite transcripts.&lt;/p&gt;

&lt;p&gt;But after spending months building workflows across Claude, Codex, Gemini, Cursor, and other coding agents, I've started to think we may be treating the wrong thing as the source of truth.&lt;/p&gt;

&lt;p&gt;The problem is not:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"How do we make the model remember everything forever?"&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The problem is:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"How does a software project remain cognitively coherent across sessions, compaction, agent switches, and time?"&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Those are very different problems.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Context Window Is Not Durable Infrastructure
&lt;/h2&gt;

&lt;p&gt;Modern AI coding workflows are surprisingly fragile.&lt;/p&gt;

&lt;p&gt;An agent works for hours. The context window fills up. Compaction happens. Then suddenly:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;architectural reasoning disappears&lt;/li&gt;
&lt;li&gt;unresolved work gets forgotten&lt;/li&gt;
&lt;li&gt;regressions come back&lt;/li&gt;
&lt;li&gt;agents undo each other&lt;/li&gt;
&lt;li&gt;humans re-explain the same context repeatedly&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The industry response so far has mostly been: store more. Bigger context windows, vector databases, hosted memory services, semantic retrieval over giant transcripts.&lt;/p&gt;

&lt;p&gt;But transcripts are not understanding.&lt;/p&gt;

&lt;p&gt;And replaying giant chat histories is not the same thing as preserving operational continuity.&lt;/p&gt;

&lt;p&gt;In practice, most coding workflows do not fail because information disappeared entirely. They fail because the important state was never extracted from the conversation in the first place.&lt;/p&gt;

&lt;h2&gt;
  
  
  Checkpoints, Not Transcripts
&lt;/h2&gt;

&lt;p&gt;The idea I have been exploring is pretty simple:&lt;/p&gt;

&lt;p&gt;Instead of preserving entire conversations forever, preserve structured checkpoints at meaningful moments.&lt;/p&gt;

&lt;p&gt;Not:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;every token&lt;/li&gt;
&lt;li&gt;every thought&lt;/li&gt;
&lt;li&gt;every conversational detour&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But the things that actually matter:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;current state&lt;/li&gt;
&lt;li&gt;architectural decisions&lt;/li&gt;
&lt;li&gt;unresolved threads&lt;/li&gt;
&lt;li&gt;regression risks&lt;/li&gt;
&lt;li&gt;next recommended actions&lt;/li&gt;
&lt;li&gt;implementation reasoning&lt;/li&gt;
&lt;li&gt;handoff context&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The checkpoint becomes the durable source of truth.&lt;/p&gt;

&lt;p&gt;The live context window becomes disposable working memory.&lt;/p&gt;

&lt;p&gt;That distinction changes a lot.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Repo Should Remember
&lt;/h2&gt;

&lt;p&gt;One realization that kept hitting me while working across multiple coding agents:&lt;/p&gt;

&lt;p&gt;The repository itself is the only thing that actually persists.&lt;/p&gt;

&lt;p&gt;Agents change.&lt;br&gt;
Models change.&lt;br&gt;
Sessions end.&lt;br&gt;
Windows compact.&lt;/p&gt;

&lt;p&gt;But the repo stays.&lt;/p&gt;

&lt;p&gt;So instead of treating continuity as something trapped inside a chat session, I started treating continuity as a repo-native concern.&lt;/p&gt;

&lt;p&gt;That means:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;continuity artifacts live in the repo&lt;/li&gt;
&lt;li&gt;handoffs live in the repo&lt;/li&gt;
&lt;li&gt;operational state lives in the repo&lt;/li&gt;
&lt;li&gt;regression memory lives in the repo&lt;/li&gt;
&lt;li&gt;checkpoints live in the repo&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The repo remembers, not the window.&lt;/p&gt;

&lt;h2&gt;
  
  
  Multi-Agent Development Is Already Here
&lt;/h2&gt;

&lt;p&gt;A lot of tooling still assumes:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;one human, one agent, one session.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That is not how many people are actually working anymore.&lt;/p&gt;

&lt;p&gt;Real workflows increasingly look like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Claude for architecture&lt;/li&gt;
&lt;li&gt;Codex for implementation&lt;/li&gt;
&lt;li&gt;Cursor for iteration&lt;/li&gt;
&lt;li&gt;Gemini for exploration&lt;/li&gt;
&lt;li&gt;a human reviewing all of it&lt;/li&gt;
&lt;li&gt;another session tomorrow continuing the work&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Continuity is no longer just memory. It is coordination across interchangeable execution surfaces. And once you frame it that way, the chat window stops looking like the right place to store anything important.&lt;/p&gt;

&lt;h2&gt;
  
  
  AI Agents Are Temporary. Repositories Persist.
&lt;/h2&gt;

&lt;p&gt;I think we are entering a phase where software repositories themselves become cognitive systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;accumulating decisions&lt;/li&gt;
&lt;li&gt;preserving continuity&lt;/li&gt;
&lt;li&gt;coordinating work&lt;/li&gt;
&lt;li&gt;surviving agent turnover&lt;/li&gt;
&lt;li&gt;carrying operational memory forward over time&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Not because the models became infinitely smart.&lt;/p&gt;

&lt;p&gt;But because the continuity stopped depending entirely on the model session.&lt;/p&gt;

&lt;p&gt;That is the direction I have been exploring with Holistic, an open-source CLI for repo-native continuity across agents: &lt;a href="https://github.com/lweiss01/holistic" rel="noopener noreferrer"&gt;https://github.com/lweiss01/holistic&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Still early. Still evolving quickly. If you are working across multiple coding agents and running into the continuity problem, I would genuinely love feedback, critiques, or just a conversation about how you are solving it.&lt;/p&gt;

&lt;p&gt;The repo remembers, not the window.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>productivity</category>
      <category>tooling</category>
    </item>
  </channel>
</rss>
