<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: lweiss01</title>
    <description>The latest articles on DEV Community by lweiss01 (@lweiss01).</description>
    <link>https://dev.to/lweiss01</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F91372%2F347ebd17-a84d-489b-b376-857af9a868e9.png</url>
      <title>DEV Community: lweiss01</title>
      <link>https://dev.to/lweiss01</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/lweiss01"/>
    <language>en</language>
    <item>
      <title>Maybe We're Using the Wrong Metaphor for AI</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Sat, 20 Jun 2026 17:39:08 +0000</pubDate>
      <link>https://dev.to/lweiss01/maybe-were-using-the-wrong-metaphor-for-ai-649</link>
      <guid>https://dev.to/lweiss01/maybe-were-using-the-wrong-metaphor-for-ai-649</guid>
      <description>&lt;p&gt;A friend asked me recently what I'd been spending so much time working on. I told her I'd been experimenting with AI agents. She nodded, thought about it for a second, and then asked the obvious question:&lt;/p&gt;

&lt;p&gt;"So... ChatGPT?"&lt;/p&gt;

&lt;p&gt;At first, that felt like an easy question to answer. I started explaining that an agent can use tools, work through tasks over time, read files, modify code, and interact with systems around it. Some frameworks even coordinate multiple agents, each with different roles and responsibilities. The explanation wasn't wrong, but the longer I talked, the less satisfied I became with it. I realized I wasn't struggling to explain the technology. I was struggling with the metaphor.&lt;/p&gt;

&lt;p&gt;Most of the language we use to describe modern AI systems comes from organizations. We talk about workers, managers, planners, reviewers, and teams. We discuss delegation, coordination, communication, and hierarchy. If you spend enough time reading about agent architectures, it starts to sound less like software engineering and more like organizational design. At first that seemed perfectly reasonable. Agents behave enough like people that the comparison feels natural.&lt;/p&gt;

&lt;p&gt;Lately, though, I've started wondering whether that metaphor is quietly shaping how we think about the problem itself.&lt;/p&gt;

&lt;h2&gt;
  
  
  Looking at the Work
&lt;/h2&gt;

&lt;p&gt;Most conversations about AI focus on the worker. Which model should we use? Which framework is best? How many agents should be involved? What roles should they have? How should they coordinate? These are useful questions, and it's easy to understand why they've become the center of the discussion. After all, the workers are the visible part of the system.&lt;/p&gt;

&lt;p&gt;What I've found myself paying attention to, however, is not the worker but &lt;strong&gt;the work itself&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;When I step back and look at what is actually happening, I don't really see an organization. I see work moving through a process. Information enters the system, decisions get made, tasks move forward, stall, loop, branch, and occasionally have to be done again. The thing that increasingly captures my attention isn't the individual agent performing a task. It's the flow of the work itself and the way that flow changes over time.&lt;/p&gt;

&lt;p&gt;That distinction may sound subtle, but I think it leads to a different set of questions.&lt;/p&gt;

&lt;p&gt;If we view AI primarily as a workforce problem, then the obvious goal is to build better workers. Smarter models, better prompting techniques, improved coordination between agents, and more sophisticated frameworks all become natural areas of focus. Much of the current AI ecosystem is focused on exactly those things.&lt;/p&gt;

&lt;p&gt;What I find interesting is that many of the frustrations people encounter with AI don't actually feel like failures of intelligence. The models are already astonishingly capable. They can write software, summarize research, analyze documents, generate designs, and solve problems that would have seemed extraordinary only a few years ago. Yet despite all of that capability, the same kinds of problems continue to appear. Context gets lost. Work gets repeated. Requirements drift. Effort accumulates without producing meaningful progress. Teams of agents sometimes create more complexity than clarity.&lt;/p&gt;

&lt;p&gt;Those don't strike me as intelligence problems. They strike me as process problems.&lt;/p&gt;

&lt;h2&gt;
  
  
  A Different Tradition of Thinking
&lt;/h2&gt;

&lt;p&gt;Once I started looking at AI through that lens, I found myself thinking less about artificial intelligence and more about systems. Manufacturing, operations, quality management, and reliability engineering have spent decades studying how work moves through complex systems. They ask questions about waste, variation, bottlenecks, feedback loops, and early warning signals. When problems emerge, the goal is not simply to identify who made a mistake. The goal is to understand what the system is producing and why.&lt;/p&gt;

&lt;p&gt;The more I think about AI, the more relevant those questions seem.&lt;/p&gt;

&lt;p&gt;That's why many discussions about agents feel incomplete to me. We spend enormous amounts of time debating models, frameworks, team structures, and architectures. Those discussions matter, but they all tend to assume that the worker is the center of the story. Increasingly, I'm convinced that the system deserves at least as much attention.&lt;/p&gt;

&lt;p&gt;A factory can employ brilliant workers and still generate waste if the process is poorly designed. A software team can be filled with talented engineers and still struggle if the workflow is unhealthy. And an AI system can contain remarkably capable models while still produce disappointing outcomes if the surrounding process is fragile.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Factory Behind the Workers
&lt;/h2&gt;

&lt;p&gt;Maybe that's why I keep finding myself drawn toward concepts like continuity, feedback loops, drift, supervision, operational signals, and process health. Those ideas feel less like questions about intelligence and more like questions about systems. They focus less on the capabilities of individual workers and more on the health of the environment those workers operate within.&lt;/p&gt;

&lt;p&gt;To be clear, I don't think the workers stop mattering. Better models matter. Better tools matter. Better frameworks matter. But I increasingly suspect we're reaching a point where making the workers smarter is only part of the story.&lt;/p&gt;

&lt;p&gt;The system matters too.&lt;/p&gt;

&lt;p&gt;I don't know whether the factory metaphor is ultimately the right one. What I do know is that the more I work with AI, the more I find myself paying attention to the movement of the work rather than the characteristics of the worker. And if that's the right direction, then some of the next breakthroughs in AI may come not from making the workers smarter, but from understanding the systems they're working within.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>agents</category>
      <category>discuss</category>
      <category>systems</category>
    </item>
    <item>
      <title>AI Agents Don't Work. They Take Shifts.</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Wed, 10 Jun 2026 14:00:00 +0000</pubDate>
      <link>https://dev.to/lweiss01/ai-agents-dont-work-they-take-shifts-8of</link>
      <guid>https://dev.to/lweiss01/ai-agents-dont-work-they-take-shifts-8of</guid>
      <description>&lt;p&gt;I think I've been looking at AI projects backwards.&lt;/p&gt;

&lt;p&gt;For months, agents were the center of everything I thought about. Agent memory, agent orchestration, agent frameworks, agent autonomy. Like most people working in this space, I put the agent at the center of the picture because the agent was the thing doing the work. Optimizing it seemed obvious.&lt;/p&gt;

&lt;p&gt;Then I noticed something I haven't been able to stop thinking about since: every time I looked at a real project, the agent kept changing. Claude would spend a while on a feature. Later, Codex would pick it up. Cursor would get involved. Sometimes I'd step in myself, and then another model would take over. The work kept moving forward, but the participants kept rotating in and out.&lt;/p&gt;

&lt;p&gt;At first I read this as a memory problem. Context seemed to disappear between sessions. Decisions had to be rediscovered, failed approaches retraced, the same explanations given over and over. The obvious conclusion was that agents were forgetting.&lt;/p&gt;

&lt;p&gt;But the more I watched these transitions, the less convincing that explanation became. Nothing had actually been forgotten. Claude hadn't forgotten anything. Neither had Codex. The next participant simply hadn't been there when the previous decisions were made. What I was seeing wasn't memory loss. It was the friction that appears whenever work has to survive a participant change.&lt;/p&gt;

&lt;p&gt;That distinction sent me down a very different path.&lt;/p&gt;

&lt;p&gt;We've become accustomed to putting the wrong thing at the center of the diagram. We talk about agents the way a factory might talk about workers. We measure their capabilities, compare their strengths and weaknesses, and pour enormous energy into making them more effective. Those are useful conversations, but they left me wondering whether we were focusing on the participant instead of the thing being produced.&lt;/p&gt;

&lt;p&gt;A factory doesn't exist for its workers. It exists so cars can roll off the line.&lt;/p&gt;

&lt;p&gt;That sounds almost too obvious to say out loud, but it's worth sitting with for a moment. When you walk through a factory, nobody mistakes the worker for the product. The workers matter. They contribute. Some are more skilled than others, some faster. But none of them are the thing moving through the system. The car is. The entire factory is organized around helping that car move from one stage of production to the next. Workers participate in the process, but the car is what persists. It survives shift changes, vacations, retirements, the constant replacement of individual participants.&lt;/p&gt;

&lt;p&gt;Factories have spent decades learning how to handle that reality: they don't build continuity systems because workers are forgetful. They build continuity systems because workers leave. The day shift goes home, the night shift arrives, and the work continues.&lt;/p&gt;

&lt;p&gt;Once I saw that, I couldn't stop seeing the same pattern in AI development. Projects outlive agents. Projects outlive sessions. Projects outlive models. The project I'm working on today will almost certainly be touched by systems that don't exist yet. Models will improve, frameworks will come and go, entire categories of tooling will appear and disappear. The project remains.&lt;/p&gt;

&lt;p&gt;That's what finally made the idea click.&lt;/p&gt;

&lt;p&gt;I had been centering the wrong object. I was treating the agent as the thing moving through the system and the project as a container for the work. But the longer I looked at real projects, the more backwards that seemed. Everything else rotated. The project was the only thing that stayed.&lt;/p&gt;

&lt;p&gt;Claude isn't the thing. Codex isn't the thing. Cursor isn't the thing. I'm not even the thing. We're all participants acting on the thing.&lt;/p&gt;

&lt;p&gt;The project is the thing.&lt;/p&gt;

&lt;p&gt;I want to sit with that for a second, because it's easy to nod along and miss what it actually implies.&lt;/p&gt;

&lt;p&gt;It implies that the agent I'm using today is a shift worker. A skilled one, maybe my favorite one, but a shift worker. When the session ends, the shift ends. Whatever that agent understood about the work either made it into the project or it didn't. If it didn't, it's gone, and no amount of model improvement fixes that, because the next participant wasn't there.&lt;/p&gt;

&lt;p&gt;It also implies that I'm a shift worker, which was the part that took longest to accept. I close the laptop at night, and tomorrow a version of me shows up with most of the context evaporated, asking the same questions a fresh agent would ask. The gap between me and Claude at the start of a session is smaller than I'd like to admit.&lt;/p&gt;

&lt;p&gt;And strangely, the demotion is freeing. If the project is the thing, I can stop asking which agent is smartest and start asking what each participant leaves behind in the project when their shift ends. That's a different question with different answers. The smartest agent that leaves nothing behind is worth less to the project than an average one that writes everything down.&lt;/p&gt;

&lt;p&gt;That realization changed how I think about continuity. A checkpoint is not memory for an agent. A checkpoint is continuity for a project. A handoff isn't helping an agent remember what happened, it's helping the work survive a shift change. The purpose of continuity was never to preserve the participant. It's to preserve the work.&lt;/p&gt;

&lt;p&gt;Which is why I've become less interested in perfect memory and more interested in continuity. Perfect memory helps a participant. Continuity helps a project. And projects are the things that actually have to survive.&lt;/p&gt;

&lt;p&gt;The funny thing is that the title of this post isn't really the conclusion. It's just the observation that got me here. AI agents don't work. They take shifts. The real conclusion is something else entirely:&lt;/p&gt;

&lt;p&gt;The project is the unit of continuity. The agent is the unit of execution.&lt;/p&gt;

&lt;p&gt;Once I started looking at AI systems through that lens, a lot of ideas that had previously felt separate (checkpoints, handoffs, project memory, supervision, even Andon) started looking like different parts of the same system. They're all answering the same question.&lt;/p&gt;

&lt;p&gt;How does the work survive the shift change?&lt;/p&gt;

</description>
      <category>programming</category>
      <category>discuss</category>
      <category>agents</category>
      <category>architecture</category>
    </item>
    <item>
      <title>I Built a Prediction Market Insider Trading Detector in One Afternoon (Because George Santos)</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Wed, 03 Jun 2026 16:58:13 +0000</pubDate>
      <link>https://dev.to/lweiss01/i-built-a-prediction-market-insider-trading-detector-in-one-afternoon-because-george-santos-4dkk</link>
      <guid>https://dev.to/lweiss01/i-built-a-prediction-market-insider-trading-detector-in-one-afternoon-because-george-santos-4dkk</guid>
      <description>&lt;p&gt;This morning I woke up to news that George Santos is under federal investigation for insider trading on Kalshi. Within a few hours, I had a working anomaly detector monitoring 410 politically-sensitive prediction markets with a live dashboard.&lt;/p&gt;

&lt;p&gt;Here's how it happened, what I found, and how you can run it yourself.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Santos Actually Did
&lt;/h2&gt;

&lt;p&gt;The short version is almost embarrassingly simple. The day before Trump's State of the Union address, Santos posted a video on X saying he'd be in the gallery. Traders on Kalshi piled into "yes he'll attend" contracts and the odds shot up.&lt;/p&gt;

&lt;p&gt;What Santos didn't mention was that he'd already placed bets that he &lt;em&gt;wouldn't&lt;/em&gt; attend. When he posted "Watching SOTU from an airport tv was not part of the plan! FML" while Trump was speaking, those odds cratered and Santos walked away with tens of thousands of dollars.&lt;/p&gt;

&lt;p&gt;Kalshi caught it because they have internal account data. They knew which account placed those trades. They froze it, referred the matter to the DOJ and the CFTC, and here we are.&lt;/p&gt;

&lt;p&gt;The thing that stuck with me reading the story was a simpler question: &lt;strong&gt;what could you see from the outside?&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  The Public API Question
&lt;/h2&gt;

&lt;p&gt;Kalshi has a solid public REST API. No auth required for market data. I'd poked at it before for other projects, so I knew the shape of it. The question was whether the historical trade data was rich enough to detect the Santos pattern after the fact.&lt;/p&gt;

&lt;p&gt;The Santos pattern, stripped down, is:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Someone with private knowledge establishes a position&lt;/li&gt;
&lt;li&gt;A public statement moves the market significantly&lt;/li&gt;
&lt;li&gt;Their position was already on the right side of that move&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;You can't see who placed trades from the public API. But you can see &lt;em&gt;that&lt;/em&gt; unusual volume concentration happened before a market moved. That's the tipping-point signal -- not proof of anything, but a flag worth examining.&lt;/p&gt;

&lt;p&gt;So I started pulling on the thread.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I Found in the API
&lt;/h2&gt;

&lt;p&gt;After some exploration (the historical endpoint pagination is a bit of an adventure), I found that Kalshi's public API exposes:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Full trade history per market -- ticker, price, contract size, timestamp, block trade flag&lt;/li&gt;
&lt;li&gt;Candlestick data at 1-minute, 1-hour, and 1-day intervals&lt;/li&gt;
&lt;li&gt;Market metadata including series, category, open/close time, volume&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;What it doesn't expose: user identity, order placement timestamps (only fill time), or anything that would let you name a specific trader.&lt;/p&gt;

&lt;p&gt;The Santos SOTU markets themselves turned out to be inaccessible -- they were structured as multivariate markets with a different ticker format that doesn't surface cleanly through the standard search. But that's fine. The Santos case is the argument for &lt;em&gt;why&lt;/em&gt; this tool should exist, not the data it runs on.&lt;/p&gt;

&lt;p&gt;What I found instead was more interesting.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Watchlist
&lt;/h2&gt;

&lt;p&gt;I spent about an hour mapping out which Kalshi series carry genuine MNPI risk -- markets where someone with access to non-public government information would have a meaningful trading edge.&lt;/p&gt;

&lt;p&gt;The list got long fast:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Executive Actions&lt;/strong&gt; -- Cabinet departures (&lt;code&gt;KXCABOUT&lt;/code&gt;), next AG (&lt;code&gt;KXNEXTAG&lt;/code&gt;), next SecDef, next DNI, pardons, insurrection act invocation, martial law. Anyone in the White House personnel office or inner circle knows these things before markets do.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;SCOTUS&lt;/strong&gt; -- resignation markets, court size change, next justice confirmation. Clerks and justices themselves know when a retirement is coming.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Economic Data&lt;/strong&gt; -- Fed rate decisions, CPI, GDP. BLS and BEA staff have the data weeks before release.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Geopolitical&lt;/strong&gt; -- Greenland acquisition, Panama Canal, Taiwan recognition, Zelensky-Putin talks. NSC and State Department staff work these negotiations in real time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Congressional&lt;/strong&gt; -- impeachment, house/senate control, government shutdown, veto override. Congressional whips know vote counts before anyone else.&lt;/p&gt;

&lt;p&gt;That's 33 series, 410 open markets, and a combined trade history of over 123,000 records from just the first collection run.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Anomaly Scorer
&lt;/h2&gt;

&lt;p&gt;With data flowing in, I built a three-signal scorer:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Volume Z-Score&lt;/strong&gt; -- compares recent trade volume against a rolling baseline. A spike of 8+ standard deviations above normal is a strong signal. This is the Santos pattern in statistical form: someone piling into a position before a market-moving event.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Block Trade Ratio&lt;/strong&gt; -- the API flags block trades (large privately-negotiated contracts) separately. Heavy block trade concentration on a political market before an announcement is suspicious in a way that retail chatter isn't.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Price Divergence&lt;/strong&gt; -- detects sudden directional price movement inconsistent with gradual drift. The signature of a market that already knows something.&lt;/p&gt;

&lt;p&gt;The compound score combines all three with a block trade modifier. Thresholds: yellow at 25, red at 60.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Flagged on Day One
&lt;/h2&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fab1f8tc1ymtaeshg36tu.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fab1f8tc1ymtaeshg36tu.png" alt="pmwatch dasbhoard showing market signals" width="799" height="451"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Four markets scored above threshold within hours of the first collection run:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Market&lt;/th&gt;
&lt;th&gt;Score&lt;/th&gt;
&lt;th&gt;What it is&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;KXNEXTAG-29-TBLA&lt;/td&gt;
&lt;td&gt;167&lt;/td&gt;
&lt;td&gt;Todd Blanche for AG, 22% price move&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;KXNEXTODNI-29-RCRA&lt;/td&gt;
&lt;td&gt;156&lt;/td&gt;
&lt;td&gt;Rick Crawford for DNI, 129% price move&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;KXGREENTERRITORY-29&lt;/td&gt;
&lt;td&gt;152&lt;/td&gt;
&lt;td&gt;US acquires Greenland, vol z=11.6&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;KXCPI-26MAY-T0.5&lt;/td&gt;
&lt;td&gt;105&lt;/td&gt;
&lt;td&gt;CPI above 0.5%, moving DOWN one week before June 10 release&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;To be clear: high scores don't mean insider trading happened. They mean unusual market activity that warrants a closer look.&lt;/p&gt;

&lt;p&gt;TBLA is Todd Blanche -- Trump's former personal attorney who represented him through the criminal trials. RCRA is Rick Crawford, the Arkansas congressman who has been floated for intelligence roles. Both markets were flagging compound anomalies with volume z-scores above 8 on pmwatch's first run.&lt;/p&gt;

&lt;p&gt;The Todd Blanche signal is worth noting specifically. It strengthened from 118 to 167 between the first and second hourly collection runs, which is the pattern you'd expect from a position being built ahead of an announcement rather than random noise. And the CPI market moving down with unusual volume a week before BLS publishes the May number? That's a flag worth watching.&lt;/p&gt;

&lt;p&gt;I don't know what any of this means. That's kind of the point. pmwatch surfaces candidates. Investigators with subpoena power figure out the rest.&lt;/p&gt;




&lt;h2&gt;
  
  
  Day Two Update (June 4, 2026)
&lt;/h2&gt;

&lt;p&gt;Overnight the signal count grew from 6 to 21. A few notable developments:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Todd Blanche (KXNEXTAG-29-TBLA) is now at 377&lt;/strong&gt; -- up from 167 at launch. That's the strongest and most persistent signal on the board, strengthening across every hourly collection run since yesterday.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;New: Ghislaine Maxwell pardon market flagged at 346&lt;/strong&gt; &lt;br&gt;
(KXTRUMPPARDONS-29JAN21-GMAX) -- vol z-score of 24.6, the highest single &lt;br&gt;
reading pmwatch has produced. The price hasn't moved yet, which means large volume is accumulating without public market-moving information to explain it. Detected at 11:46 AM ET, June 4. Screenshot timestamped and saved.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Sean Combs pardon market also appeared&lt;/strong&gt; (KXTRUMPPARDONS-29JAN21-SCOM) &lt;br&gt;
as a lower-score signal in the same collection run.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;CPI cluster is now 4 signals strong&lt;/strong&gt; ahead of the June 10 BLS release -- multiple CPI strike markets flagging simultaneously with scores ranging from 99 to 267.&lt;/p&gt;

&lt;p&gt;To be clear: none of this proves anything. pmwatch flags unusual market &lt;br&gt;
activity. Whether any of it represents insider trading is for investigators with subpoena power to determine. But the Maxwell signal at 346 with a 24.6 z-score is the kind of reading this tool was designed to surface.&lt;/p&gt;

&lt;p&gt;The repo is live and the scheduler is running. Come back after the June 10 &lt;br&gt;
CPI release to see if the markets were right.&lt;/p&gt;


&lt;h2&gt;
  
  
  The Stack
&lt;/h2&gt;

&lt;p&gt;Nothing exotic here:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Python&lt;/strong&gt; for the collector and scorer&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SQLite&lt;/strong&gt; for storage (plenty for this use case)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;APScheduler&lt;/strong&gt; to run collection every 60 minutes&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;FastAPI&lt;/strong&gt; for the API layer&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Plain HTML + JS&lt;/strong&gt; for the dashboard -- no build step, deploys anywhere&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The entire thing runs on the public Kalshi API with no authentication. Setup is three commands:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/lweiss01/pmwatch.git
&lt;span class="nb"&gt;cd &lt;/span&gt;pmwatch
pip &lt;span class="nb"&gt;install&lt;/span&gt; &lt;span class="nt"&gt;-r&lt;/span&gt; requirements.txt
python scheduler.py
uvicorn api:app &lt;span class="nt"&gt;--port&lt;/span&gt; 8000
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  What This Can and Can't Do
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Can detect:&lt;/strong&gt; unusual volume concentration before events, block trade anomalies, price divergence inconsistent with public information flow.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Can't do:&lt;/strong&gt; identify specific traders, prove intent, access order placement timestamps (only fill time is public), or monitor Polymarket (offshore, separate API).&lt;/p&gt;

&lt;p&gt;This is a tipping-point detector. It finds markets that look wrong. Kalshi and the CFTC have the identity layer to take it further.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why It Matters Right Now
&lt;/h2&gt;

&lt;p&gt;The CFTC issued an advisory on prediction market insider trading in February 2026. H.R. 7004, the Public Integrity in Financial Prediction Markets Act, was introduced in January after a trader made $400,000 on the Polymarket Maduro capture market right before the US military operation. The Senate passed a resolution in April preventing its own members from using prediction markets.&lt;/p&gt;

&lt;p&gt;The regulatory infrastructure is being built right now. Public monitoring tools that work from open data are part of that ecosystem -- the same role that STOCK Act trackers like Quiver Quantitative play for congressional stock trading, but for the prediction market era.&lt;/p&gt;

&lt;p&gt;GovGreed tracks 190,000 congressional stock trades and scores them with a 7-layer AI model. Nobody is doing that for prediction markets yet.&lt;/p&gt;

&lt;p&gt;pmwatch is a start.&lt;/p&gt;




&lt;h2&gt;
  
  
  What's Next
&lt;/h2&gt;

&lt;p&gt;The immediate roadmap:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Social media correlation -- cross-reference X post timestamps with trade anomaly windows&lt;/li&gt;
&lt;li&gt;Government calendar integration -- BLS release schedule, FOMC blackout periods&lt;/li&gt;
&lt;li&gt;Polymarket support&lt;/li&gt;
&lt;li&gt;Webhook alerts for high-score signals&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The repo is at &lt;strong&gt;&lt;a href="https://github.com/lweiss01/pmwatch" rel="noopener noreferrer"&gt;https://github.com/lweiss01/pmwatch&lt;/a&gt;&lt;/strong&gt;. Issues and PRs welcome. If you work in fintech, policy, or prediction markets and want to talk about where this goes, I'm easy to find.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Built June 3, 2026. The same day the Santos story broke. Sometimes timing is everything.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>python</category>
      <category>opensource</category>
      <category>webdev</category>
      <category>beginners</category>
    </item>
    <item>
      <title>I Thought My AI Agents Had a Memory Problem. They Didn't.</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Tue, 02 Jun 2026 17:15:00 +0000</pubDate>
      <link>https://dev.to/lweiss01/i-thought-my-ai-agents-had-a-memory-problem-they-didnt-5703</link>
      <guid>https://dev.to/lweiss01/i-thought-my-ai-agents-had-a-memory-problem-they-didnt-5703</guid>
      <description>&lt;p&gt;When I started building Holistic, I thought I was working on a memory problem.&lt;/p&gt;

&lt;p&gt;Like many people using AI coding agents, I kept running into the same frustration. A session would end, a new one would begin, and suddenly the project was paying a tax. Decisions had to be rediscovered. Prior attempts had to be reconstructed. The same explanations kept getting repeated because the next participant didn't seem to know what the previous one had already learned.&lt;/p&gt;

&lt;p&gt;The obvious explanation was memory. The agent forgot. So I started building checkpoints. Not transcripts. Not giant chat histories. Not larger context windows. I wanted durable project state: a record of what had changed, what had been tried, what had been decided, and what still needed to happen.&lt;/p&gt;

&lt;p&gt;That eventually became Holistic. At the time, I thought I was building an agent memory tool. Looking back, I think that description was wrong. Not completely wrong, but incomplete. The clue was a contradiction I couldn't explain.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Contradiction
&lt;/h2&gt;

&lt;p&gt;As I started using multiple agents more frequently, the memory explanation stopped fitting the evidence. Claude would work on a task, then Codex, then Cursor, and sometimes back to Claude again. Each transition seemed to produce the same failure mode. Context disappeared. Decisions became unclear. Progress slowed while the next participant figured out where things stood.&lt;/p&gt;

&lt;p&gt;I kept calling this memory loss until I realized that wasn't actually what I was observing. Claude hadn't forgotten anything. Codex hadn't forgotten anything. Cursor hadn't forgotten anything either.&lt;/p&gt;

&lt;p&gt;The next participant simply hadn't been there.&lt;/p&gt;

&lt;p&gt;That sounds like a small distinction, but it completely changed how I thought about the problem. The information wasn't being forgotten. It was failing to survive the handoff. Memory and handoff are not the same thing. One is about preserving knowledge inside a participant. The other is about preserving progress across participants.&lt;/p&gt;

&lt;p&gt;The more I looked, the more I found examples where memory wasn't the bottleneck at all. The issue appeared precisely when work crossed a boundary: from one agent to another, from one session to another, or from an agent back to a human. The failure wasn't forgetting. The failure was continuity. I just didn't have that word yet.&lt;/p&gt;

&lt;h2&gt;
  
  
  Following The Wrong Theory
&lt;/h2&gt;

&lt;p&gt;Once the memory explanation started breaking down, I moved to the next obvious theory. Maybe the problem wasn't memory. Maybe it was coordination. Agents needed better ways to communicate. Better handoffs. Better shared artifacts.&lt;/p&gt;

&lt;p&gt;That explanation got further because it described more of what I was seeing, but it still didn't explain everything. What caught my attention was that people kept arriving at remarkably similar solutions from completely different directions. Some were using shared files. Others relied on conversation logs. Some created structured handoff records. Others experimented with append-only histories. The technologies varied, but the pattern didn't.&lt;/p&gt;

&lt;p&gt;Nobody was arguing about whether the problem existed. They were arguing about how to solve it. That was an important clue, because it suggested I wasn't looking at a personal annoyance or a workflow quirk. I was looking at a category of problem that was beginning to emerge wherever people used multiple agents over time.&lt;/p&gt;

&lt;h2&gt;
  
  
  Then Andon Happened
&lt;/h2&gt;

&lt;p&gt;Around the same time, I started designing Andon. At first it felt unrelated. Holistic was about preserving state. Andon was about supervision. I wanted a way to answer operational questions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;What is my agent doing?&lt;/li&gt;
&lt;li&gt;Is it healthy?&lt;/li&gt;
&lt;li&gt;Is it stuck?&lt;/li&gt;
&lt;li&gt;Is it drifting?&lt;/li&gt;
&lt;li&gt;Do I need to intervene?&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The name came from Toyota's andon system, where workers can signal abnormal conditions before defects continue down the production line. I liked the metaphor immediately because it aligned with something I had spent years thinking about professionally: systems, quality, process improvement, and how work flows through organizations. But it wasn't until much later that I realized why the metaphor felt so natural.&lt;/p&gt;

&lt;p&gt;The interesting part of Toyota's andon system isn't the board, the lights, or even the cord itself. The interesting part is the assumption underneath it all: the work continues even when the workers change. Factories operate across shifts. Hospitals operate across shifts. Airlines operate across shifts. Large software systems operate across shifts. Participants come and go, but the work remains.&lt;/p&gt;

&lt;p&gt;Once I saw that, everything else started to click into place.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Work Must Survive The Worker
&lt;/h2&gt;

&lt;p&gt;That is the idea I had been circling all along. Not memory. Not coordination. Continuity. Every mature operational system eventually discovers the same requirement: work must survive participant replacement.&lt;/p&gt;

&lt;p&gt;That's why organizations create runbooks, status boards, shift reports, architecture records, incident logs, and operating procedures. Their purpose is not simply to help someone remember. Their purpose is to help the work continue when somebody else takes over.&lt;/p&gt;

&lt;p&gt;Seen through that lens, Holistic and Andon stopped looking like separate ideas. Holistic preserves continuity. Andon protects continuity. Same problem, different layer. &lt;/p&gt;

&lt;p&gt;Holistic answers the question:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;What happened before?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Andon answers the question:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;What is happening now?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Both exist because the work cannot depend on any single participant, whether that participant is a human, Claude, Codex, Cursor, or whatever comes next. The project persists. The participants do not.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Thesis Revealed Itself
&lt;/h2&gt;

&lt;p&gt;For a long time I thought I was building tools. Only later did I realize the tools were pulling me toward a theory: the project persists, the participants do not. Once that becomes clear, the question changes. It is no longer "How do we make agents remember?" It becomes "How does the work continue when the worker changes?"&lt;/p&gt;

&lt;p&gt;That's a different category of problem. It's not memory, it's continuity. And eventually the thesis condensed into two sentences:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Memory belongs to participants.&lt;/p&gt;

&lt;p&gt;Continuity belongs to projects.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I didn't start with that idea. I backed into it through a series of observations that refused to fit my original explanation. Checkpoint by checkpoint, handoff by handoff, and tool by tool, the same pattern kept reappearing until it was impossible to ignore.&lt;/p&gt;

&lt;h2&gt;
  
  
  Looking Ahead
&lt;/h2&gt;

&lt;p&gt;The future of AI development is probably not one agent with perfect recall. It's many agents, many sessions, many models, many handoffs. Claude this morning. Codex this afternoon. Cursor tonight. A human reviewer tomorrow. A completely different model six months from now.&lt;/p&gt;

&lt;p&gt;When I started building Holistic, I thought I was building an agent memory tool. When I started building Andon, I thought I was building a supervision system. Looking back, both were pointing at the same thing. The tools came first. The thesis took longer to reveal itself. But I think I've finally found the idea I was actually building toward:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;The work must survive the worker.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;Holistic is open source if you want to see how the handoff layer works in practice: &lt;a href="https://github.com/lweiss01/holistic" rel="noopener noreferrer"&gt;github.com/lweiss01/holistic&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>architecture</category>
      <category>discuss</category>
    </item>
    <item>
      <title>The Agent-to-Agent Continuity Gap Nobody Is Talking About</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Wed, 27 May 2026 13:55:44 +0000</pubDate>
      <link>https://dev.to/lweiss01/the-agent-to-agent-continuity-gap-nobody-is-talking-about-2bgh</link>
      <guid>https://dev.to/lweiss01/the-agent-to-agent-continuity-gap-nobody-is-talking-about-2bgh</guid>
      <description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; Most AI agent memory discussions still assume one agent talking to itself across sessions. But real coding workflows already involve Claude, Codex, Cursor, and Gemini touching the same repo in the same week. The hard problem is not "how does an agent remember." It is "how do multiple agents stay coordinated on the same project without stepping on each other." That problem does not live inside any one agent. It lives in the repo.&lt;/p&gt;




&lt;p&gt;I wrote a post last week arguing that AI coding agent memory belongs in the repository, not the chat window. Checkpoints, not transcripts.&lt;/p&gt;

&lt;p&gt;Sitting with that argument for a few days, I realized it is actually downstream of a bigger one I had not made explicitly yet. The checkpoint primitive only matters because of a problem the current agent stack does not have a name for.&lt;/p&gt;

&lt;p&gt;So here it is.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Industry Map Has A Blind Spot
&lt;/h2&gt;

&lt;p&gt;There is a really good 2026 agent stack map going around right now from Paolo Perrone. Six layers. Models, protocols, memory, frameworks, eval, guardrails. It is a useful map.&lt;/p&gt;

&lt;p&gt;But read the memory layer carefully and you notice something.&lt;/p&gt;

&lt;p&gt;Every memory tier on that map assumes one agent.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;In-context state lives inside one agent's context window&lt;/li&gt;
&lt;li&gt;Vector retrieval lives inside one agent's RAG pipeline&lt;/li&gt;
&lt;li&gt;Persistent memory services like Letta, Zep, and Mem0 are designed for one agent learning across sessions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is a real problem and worth solving. But it is not the problem most coding workflows actually have.&lt;/p&gt;

&lt;h2&gt;
  
  
  Most Coding Workflows Already Use Multiple Agents
&lt;/h2&gt;

&lt;p&gt;Look at how anyone serious is shipping code right now.&lt;/p&gt;

&lt;p&gt;Claude for architecture and review.&lt;br&gt;
Codex for implementation.&lt;br&gt;
Cursor for inline iteration.&lt;br&gt;
Gemini for exploration.&lt;br&gt;
A human approving and editing all of it.&lt;/p&gt;

&lt;p&gt;That is not a future scenario. That is a Tuesday.&lt;/p&gt;

&lt;p&gt;And every single one of those agents has its own context window, its own session, its own memory, its own opinions about the codebase. None of them know what the other ones did an hour ago.&lt;/p&gt;

&lt;p&gt;The continuity problem is not "Claude forgot what we discussed yesterday."&lt;/p&gt;

&lt;p&gt;The continuity problem is "Claude does not know what Codex implemented this morning, Cursor reverted half of it at lunch, and the human merged something different from a different branch."&lt;/p&gt;

&lt;p&gt;That is a coordination problem dressed up as a memory problem.&lt;/p&gt;

&lt;h2&gt;
  
  
  Agent-to-Agent Memory Does Not Exist Yet
&lt;/h2&gt;

&lt;p&gt;Perrone's map notes this honestly. MCP standardized how agents call tools. It says nothing about how agents talk to each other. IBM has ACP. Google has A2A. Neither is a standard. Neither is widely adopted. Neither solves the coding workflow case.&lt;/p&gt;

&lt;p&gt;So in practice, every team running a multi-agent coding workflow is solving this themselves. Usually badly. Usually by re-explaining context to every new session by hand.&lt;/p&gt;

&lt;p&gt;The dedicated memory vendors do not solve this either, because they are designed to give one agent a longer memory. Plugging Cursor and Claude Code into the same Mem0 instance and hoping they coordinate is not a thing that works today.&lt;/p&gt;

&lt;p&gt;Memory infrastructure is single-agent infrastructure. The multi-agent coordination layer is missing.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Repo Is The Only Shared Surface
&lt;/h2&gt;

&lt;p&gt;Here is the thing that kept hitting me.&lt;/p&gt;

&lt;p&gt;When Claude, Codex, Cursor, and Gemini are all working on the same project, there is exactly one piece of infrastructure all of them already see.&lt;/p&gt;

&lt;p&gt;The repository.&lt;/p&gt;

&lt;p&gt;They all read it. They all write to it. They all already have file system access through MCP or equivalent. Git already tracks who changed what and when.&lt;/p&gt;

&lt;p&gt;The repo is the shared substrate. It is the only shared substrate. Everything else is per-agent.&lt;/p&gt;

&lt;p&gt;So if you want continuity across agents, the continuity artifacts have to live in the repo. Not in a vector database that one agent is plugged into. Not in a hosted memory service that another agent does not know about. In the repo. In files. Versioned. Auditable. Diffable. Visible to every agent that can read the file system.&lt;/p&gt;

&lt;p&gt;That is what makes checkpoints the right primitive. Not because vector search is bad. Vector search is great for what it does. But you cannot retrieve from a vector store that the next agent has never heard of.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Reframe
&lt;/h2&gt;

&lt;p&gt;When you stop framing the problem as "agent memory" and start framing it as "multi-agent coordination on a shared artifact," a lot of the tooling debates collapse.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Bigger context windows do not help. The next agent has a different context window.&lt;/li&gt;
&lt;li&gt;Better RAG does not help. The next agent has a different RAG pipeline, or no RAG at all.&lt;/li&gt;
&lt;li&gt;Hosted memory services do not help unless every agent in your workflow is plugged into the same one, which they are not.&lt;/li&gt;
&lt;li&gt;Transcripts do not help, because they are noise and the next agent does not have your transcript.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;What does help is a small, structured, versioned record of what was decided, what is in progress, what is at risk, and what the next agent should pick up. Sitting in the repo. Where everyone can see it.&lt;/p&gt;

&lt;p&gt;That is the continuity primitive the current stack map does not have a slot for.&lt;/p&gt;

&lt;h2&gt;
  
  
  What I Am Building
&lt;/h2&gt;

&lt;p&gt;Holistic is an open source CLI exploring this idea. Repo-native checkpoints. Agent-agnostic. No vendor account, no hosted service, no per-agent SDK. Just files in your repo that any agent can read and any agent can update.&lt;/p&gt;

&lt;p&gt;It is still early. The thesis is what I am most interested in pressure testing right now.&lt;/p&gt;

&lt;p&gt;If you are running a multi-agent coding workflow and you have your own answer to the coordination problem, I want to hear it. If you think I am wrong about the repo being the right substrate, I really want to hear that.&lt;/p&gt;

&lt;p&gt;Repo: &lt;a href="https://github.com/lweiss01/holistic" rel="noopener noreferrer"&gt;https://github.com/lweiss01/holistic&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The repo remembers, not the window. And no single agent remembers for the others.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>webdev</category>
      <category>programming</category>
      <category>productivity</category>
    </item>
    <item>
      <title>Checkpoints, Not Transcripts: Rethinking AI Coding Agent Memory</title>
      <dc:creator>lweiss01</dc:creator>
      <pubDate>Sun, 24 May 2026 18:40:21 +0000</pubDate>
      <link>https://dev.to/lweiss01/checkpoints-not-transcripts-rethinking-ai-coding-agent-memory-21pj</link>
      <guid>https://dev.to/lweiss01/checkpoints-not-transcripts-rethinking-ai-coding-agent-memory-21pj</guid>
      <description>&lt;p&gt;&lt;strong&gt;TL;DR:&lt;/strong&gt; AI coding agent memory should live in the repository, not the chat window. Bigger context windows and vector databases are solving the wrong problem. Here is the case for treating the repo itself as the durable cognitive surface.&lt;/p&gt;




&lt;p&gt;Everyone is trying to solve AI agent memory right now.&lt;/p&gt;

&lt;p&gt;Longer context windows.&lt;br&gt;
Vector databases.&lt;br&gt;
Conversation replay.&lt;br&gt;
Semantic retrieval.&lt;br&gt;
Infinite transcripts.&lt;/p&gt;

&lt;p&gt;But after spending months building workflows across Claude, Codex, Gemini, Cursor, and other coding agents, I've started to think we may be treating the wrong thing as the source of truth.&lt;/p&gt;

&lt;p&gt;The problem is not:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"How do we make the model remember everything forever?"&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;The problem is:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;"How does a software project remain cognitively coherent across sessions, compaction, agent switches, and time?"&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Those are very different problems.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Context Window Is Not Durable Infrastructure
&lt;/h2&gt;

&lt;p&gt;Modern AI coding workflows are surprisingly fragile.&lt;/p&gt;

&lt;p&gt;An agent works for hours. The context window fills up. Compaction happens. Then suddenly:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;architectural reasoning disappears&lt;/li&gt;
&lt;li&gt;unresolved work gets forgotten&lt;/li&gt;
&lt;li&gt;regressions come back&lt;/li&gt;
&lt;li&gt;agents undo each other&lt;/li&gt;
&lt;li&gt;humans re-explain the same context repeatedly&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The industry response so far has mostly been: store more. Bigger context windows, vector databases, hosted memory services, semantic retrieval over giant transcripts.&lt;/p&gt;

&lt;p&gt;But transcripts are not understanding.&lt;/p&gt;

&lt;p&gt;And replaying giant chat histories is not the same thing as preserving operational continuity.&lt;/p&gt;

&lt;p&gt;In practice, most coding workflows do not fail because information disappeared entirely. They fail because the important state was never extracted from the conversation in the first place.&lt;/p&gt;

&lt;h2&gt;
  
  
  Checkpoints, Not Transcripts
&lt;/h2&gt;

&lt;p&gt;The idea I have been exploring is pretty simple:&lt;/p&gt;

&lt;p&gt;Instead of preserving entire conversations forever, preserve structured checkpoints at meaningful moments.&lt;/p&gt;

&lt;p&gt;Not:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;every token&lt;/li&gt;
&lt;li&gt;every thought&lt;/li&gt;
&lt;li&gt;every conversational detour&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But the things that actually matter:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;current state&lt;/li&gt;
&lt;li&gt;architectural decisions&lt;/li&gt;
&lt;li&gt;unresolved threads&lt;/li&gt;
&lt;li&gt;regression risks&lt;/li&gt;
&lt;li&gt;next recommended actions&lt;/li&gt;
&lt;li&gt;implementation reasoning&lt;/li&gt;
&lt;li&gt;handoff context&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The checkpoint becomes the durable source of truth.&lt;/p&gt;

&lt;p&gt;The live context window becomes disposable working memory.&lt;/p&gt;

&lt;p&gt;That distinction changes a lot.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Repo Should Remember
&lt;/h2&gt;

&lt;p&gt;One realization that kept hitting me while working across multiple coding agents:&lt;/p&gt;

&lt;p&gt;The repository itself is the only thing that actually persists.&lt;/p&gt;

&lt;p&gt;Agents change.&lt;br&gt;
Models change.&lt;br&gt;
Sessions end.&lt;br&gt;
Windows compact.&lt;/p&gt;

&lt;p&gt;But the repo stays.&lt;/p&gt;

&lt;p&gt;So instead of treating continuity as something trapped inside a chat session, I started treating continuity as a repo-native concern.&lt;/p&gt;

&lt;p&gt;That means:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;continuity artifacts live in the repo&lt;/li&gt;
&lt;li&gt;handoffs live in the repo&lt;/li&gt;
&lt;li&gt;operational state lives in the repo&lt;/li&gt;
&lt;li&gt;regression memory lives in the repo&lt;/li&gt;
&lt;li&gt;checkpoints live in the repo&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The repo remembers, not the window.&lt;/p&gt;

&lt;h2&gt;
  
  
  Multi-Agent Development Is Already Here
&lt;/h2&gt;

&lt;p&gt;A lot of tooling still assumes:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;one human, one agent, one session.&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;That is not how many people are actually working anymore.&lt;/p&gt;

&lt;p&gt;Real workflows increasingly look like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Claude for architecture&lt;/li&gt;
&lt;li&gt;Codex for implementation&lt;/li&gt;
&lt;li&gt;Cursor for iteration&lt;/li&gt;
&lt;li&gt;Gemini for exploration&lt;/li&gt;
&lt;li&gt;a human reviewing all of it&lt;/li&gt;
&lt;li&gt;another session tomorrow continuing the work&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Continuity is no longer just memory. It is coordination across interchangeable execution surfaces. And once you frame it that way, the chat window stops looking like the right place to store anything important.&lt;/p&gt;

&lt;h2&gt;
  
  
  AI Agents Are Temporary. Repositories Persist.
&lt;/h2&gt;

&lt;p&gt;I think we are entering a phase where software repositories themselves become cognitive systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;accumulating decisions&lt;/li&gt;
&lt;li&gt;preserving continuity&lt;/li&gt;
&lt;li&gt;coordinating work&lt;/li&gt;
&lt;li&gt;surviving agent turnover&lt;/li&gt;
&lt;li&gt;carrying operational memory forward over time&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Not because the models became infinitely smart.&lt;/p&gt;

&lt;p&gt;But because the continuity stopped depending entirely on the model session.&lt;/p&gt;

&lt;p&gt;That is the direction I have been exploring with Holistic, an open-source CLI for repo-native continuity across agents: &lt;a href="https://github.com/lweiss01/holistic" rel="noopener noreferrer"&gt;https://github.com/lweiss01/holistic&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Still early. Still evolving quickly. If you are working across multiple coding agents and running into the continuity problem, I would genuinely love feedback, critiques, or just a conversation about how you are solving it.&lt;/p&gt;

&lt;p&gt;The repo remembers, not the window.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>productivity</category>
      <category>tooling</category>
    </item>
  </channel>
</rss>
