<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Daniel Feldman</title>
    <description>The latest articles on DEV Community by Daniel Feldman (@daniel_loxia).</description>
    <link>https://dev.to/daniel_loxia</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3937531%2Fe22d69a7-5520-4f4e-8798-08c52693cb3f.png</url>
      <title>DEV Community: Daniel Feldman</title>
      <link>https://dev.to/daniel_loxia</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/daniel_loxia"/>
    <language>en</language>
    <item>
      <title>AI agents fail because they forget</title>
      <dc:creator>Daniel Feldman</dc:creator>
      <pubDate>Mon, 18 May 2026 08:43:03 +0000</pubDate>
      <link>https://dev.to/daniel_loxia/ai-agents-fail-because-they-forget-3ahm</link>
      <guid>https://dev.to/daniel_loxia/ai-agents-fail-because-they-forget-3ahm</guid>
      <description>&lt;p&gt;If you work with agents every day, you already know the pattern.&lt;/p&gt;

&lt;p&gt;The first 20 minutes feel magical.&lt;/p&gt;

&lt;p&gt;Then:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;context starts drifting&lt;/li&gt;
&lt;li&gt;requirements get forgotten&lt;/li&gt;
&lt;li&gt;architecture decisions disappear&lt;/li&gt;
&lt;li&gt;agents repeat mistakes&lt;/li&gt;
&lt;li&gt;long runs slowly lose coherence&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The problem is not model quality anymore.&lt;/p&gt;

&lt;p&gt;The problem is memory architecture.&lt;/p&gt;

&lt;p&gt;Most agent systems today are still built around temporary conversations instead of operational memory.&lt;/p&gt;

&lt;p&gt;At OnBuzz we've been exploring what happens when you treat agents more like long-running workers inside a system instead of isolated chats.&lt;/p&gt;

&lt;h1&gt;
  
  
  Memory needs to behave more like humans do
&lt;/h1&gt;

&lt;p&gt;One thing became obvious very quickly:&lt;/p&gt;

&lt;p&gt;Not all memory should behave the same.&lt;/p&gt;

&lt;p&gt;So we built layered memory systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;short-term memory for active execution&lt;/li&gt;
&lt;li&gt;long-term memory for persistent knowledge&lt;/li&gt;
&lt;li&gt;event-based memory connected to actions, tasks, conversations, and decisions&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That changes retrieval quality dramatically.&lt;/p&gt;

&lt;p&gt;Instead of dumping giant context windows into prompts, agents can retrieve relevant operational context when needed.&lt;/p&gt;

&lt;h1&gt;
  
  
  Recap loops became critical
&lt;/h1&gt;

&lt;p&gt;Long-running agents drift.&lt;/p&gt;

&lt;p&gt;Even the best models do.&lt;/p&gt;

&lt;p&gt;So we implemented internal recap mechanisms that continuously reconnect agents to:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;the original objective&lt;/li&gt;
&lt;li&gt;active constraints&lt;/li&gt;
&lt;li&gt;completed work&lt;/li&gt;
&lt;li&gt;unresolved blockers&lt;/li&gt;
&lt;li&gt;execution history&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This massively improves stability during long conversations and autonomous runs.&lt;/p&gt;

&lt;h1&gt;
  
  
  Full local interaction history
&lt;/h1&gt;

&lt;p&gt;Another thing we wanted badly as developers:&lt;/p&gt;

&lt;p&gt;Full access to interaction history locally.&lt;/p&gt;

&lt;p&gt;Not just temporary chats.&lt;/p&gt;

&lt;p&gt;Agents can retrieve context dynamically based on operational needs instead of relying purely on bloated prompts.&lt;/p&gt;

&lt;p&gt;That changes debugging and long-running workflows completely.&lt;/p&gt;

&lt;h1&gt;
  
  
  Real multi-agent orchestration
&lt;/h1&gt;

&lt;p&gt;Not "multiple tabs."&lt;/p&gt;

&lt;p&gt;Actual orchestration:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;teams&lt;/li&gt;
&lt;li&gt;roles&lt;/li&gt;
&lt;li&gt;task delegation&lt;/li&gt;
&lt;li&gt;context sharing&lt;/li&gt;
&lt;li&gt;work transfer between agents&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Different agents handling:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;architecture&lt;/li&gt;
&lt;li&gt;implementation&lt;/li&gt;
&lt;li&gt;validation&lt;/li&gt;
&lt;li&gt;reviews&lt;/li&gt;
&lt;li&gt;research&lt;/li&gt;
&lt;li&gt;execution&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In parallel.&lt;/p&gt;

&lt;h1&gt;
  
  
  Goal-driven autonomous execution
&lt;/h1&gt;

&lt;p&gt;Agents can operate autonomously toward objectives with continuity between runs.&lt;/p&gt;

&lt;p&gt;Not just one-shot prompts.&lt;/p&gt;

&lt;p&gt;Combined with scheduling, agents can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;create recurring workflows&lt;/li&gt;
&lt;li&gt;schedule tasks for themselves&lt;/li&gt;
&lt;li&gt;schedule tasks for other agents&lt;/li&gt;
&lt;li&gt;maintain operational continuity over time&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;(fully configurable, not enabled by default)&lt;/p&gt;

&lt;h1&gt;
  
  
  Manager agents and dynamic agent creation
&lt;/h1&gt;

&lt;p&gt;One of the more interesting things we're exploring right now:&lt;/p&gt;

&lt;p&gt;Manager agents that supervise other agents.&lt;/p&gt;

&lt;p&gt;They can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;coordinate execution&lt;/li&gt;
&lt;li&gt;monitor progress&lt;/li&gt;
&lt;li&gt;delegate work&lt;/li&gt;
&lt;li&gt;create specialized agents dynamically during runtime based on goals/tasks&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We’re also experimenting with reusable skills that agents can generate and import into the system dynamically.&lt;/p&gt;

&lt;p&gt;The workflow starts feeling much closer to coordinating engineering systems than manually operating chats.&lt;/p&gt;

&lt;h1&gt;
  
  
  CLI-first + local-first
&lt;/h1&gt;

&lt;p&gt;We wanted the system to feel native to developers.&lt;/p&gt;

&lt;p&gt;CLI-first.&lt;br&gt;
Local-first.&lt;br&gt;
Full control.&lt;/p&gt;

&lt;p&gt;Your data stays with you.&lt;/p&gt;

&lt;h1&gt;
  
  
  Why this matters
&lt;/h1&gt;

&lt;p&gt;The biggest productivity gain is not:&lt;br&gt;
"AI writes code faster."&lt;/p&gt;

&lt;p&gt;It's reducing operational overhead.&lt;/p&gt;

&lt;p&gt;Less:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;rebuilding context&lt;/li&gt;
&lt;li&gt;repeating instructions&lt;/li&gt;
&lt;li&gt;manually coordinating workflows&lt;/li&gt;
&lt;li&gt;re-checking obvious things&lt;/li&gt;
&lt;li&gt;fighting context collapse&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;More:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;architectural thinking&lt;/li&gt;
&lt;li&gt;execution leverage&lt;/li&gt;
&lt;li&gt;parallel workflows&lt;/li&gt;
&lt;li&gt;shipping systems faster&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The highest leverage developers are usually not the fastest typers.&lt;/p&gt;

&lt;p&gt;They're the ones who can coordinate complexity efficiently.&lt;/p&gt;

&lt;p&gt;That's the workflow shift we're interested in exploring.&lt;/p&gt;

&lt;p&gt;If this space interests you, we'd genuinely love feedback from people building with agents daily.&lt;/p&gt;

&lt;p&gt;⭐ on the repo helps a lot:&lt;br&gt;
&lt;a href="https://github.com/Loxia-ai/onbuzz-community" rel="noopener noreferrer"&gt;https://github.com/Loxia-ai/onbuzz-community&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;We're also opening a contributor core with bounties and cloud credits for the full platform.&lt;/p&gt;

</description>
      <category>opensource</category>
      <category>ai</category>
      <category>programming</category>
      <category>discuss</category>
    </item>
  </channel>
</rss>
