Ahmed Rakan

Posted on Sep 7

Fixing AI Amnesia: MeridianDB’s Answer to Costly Training and Memory Loss

#webdev #programming #javascript #ai

Picture this:You've spent months fine-tuning your AI agent to handle customer support brilliantly. Then you add more knowledge to its RAG storage to improve the knowledge base for new products. Suddenly, it starts giving terrible advice about your original product. Sound familiar?

This isn't a bug; it's a fundamental limitation called catastrophic forgetting. When AI systems learn new information, they literally sometimes forget what they have learned before. It's like having an employee with perfect amnesia every time they attend a training session.

For businesses deploying AI agents, this creates a nightmare scenario:

Retraining costs that can reach millions of dollars.
Agents that lose critical domain knowledge overnight or over a response.
Inability to deploy truly adaptive AI systems.
Constant fear that updates will break existing functionalities.

The stakes are pretty high. As AI agents become more sophisticated and handle more critical business functions, we can't afford systems that forget everything they have learned or we think they learned.

We need a way first to visualize the performance of their AI agents. ce of our agent and its forgetting instances governed by human-in-loop. We need to enable memory sharing between organizations and their AI agents.

Why Traditional Solutions Fall Short

Most attempts to solve this problem focus on model architecture tweaks or fancy training techniques or even innovation in the model parameters. But here is the thing:The problem isn't just in the model; it's also how the AI agent stores, retrieves knowledge, and acts upon it.

Current approaches include:

Vector Databases: Great for semantic similarity, but they're essentially sophisticated keyword searches that use objects represented as multi-dimensional data and mathematical operations of calculated distances such as cosine similarity.
Knowledge Graphs: Excellent for structured relationships but terrible at handling the nuanced, embedding-rich information that modern AI agents need.

Combined Vector-Graph System

Better, but they still treat memory as a static storage problem rather than a dynamic learning challenge. None of these addresses the core issue: AI agents need a memory system that can learn continuously and enable collaboration and knowledge sharing between multiple agents without forgetting what it already knows.

Learning from biology:

The Brain Solution Here is where it gets interesting. The human brain solved this exact problem millions of years ago through something called complementary learning systems.

Hippocampus: Quickly stores new experiences as episodic memories.
Neocortex: Slowly integrates these experiences into long-term structured knowledge.
Replay Mechanism: During sleep your hippocampus replays experiences to the neocortex for consolidation.

This creates perfect balance within the human brain for continuous learning. In machine learning, continuous machine learning is still a topic of research; the solution that achieves such must create balance between plasticity and elasticity in the machine, the same as it is in the human brain:

Plasticity: You can rapidly learn new information.
Stability: New learning doesn't erase old memories. What if we could build an AI memory system that at minimal try-hard to do this?

Introducing MeridianDB: A Memory System That Helps AI Agents to Continuously Learn (Prototype Draft Design)

There are two ways to build such a federated database: to go with a high-performance programming language like Rust, C, or even Go. And build everything from scratch. But since I am just a curious person who loves to build, I went to the Cloudflare stack of the following: NodeJS, Hono, Cloudflare Workers, D1, Vectorize, KV, other Cloudflare solutions, and React (dashboard). This allows me to ship the prototype faster and get traction and possible feedback to improve the design.

There are multiple limitations in the stack, but it could handle large-scale traffic easily; for example, one limitation is that D1 is limited to 10 GB of storage. We could have 1 D1 database for each agent and algorithmically share knowledge between agents. Without further ado, let's dive into the draft architecture.

Dual Memory Architecture

Instead of just vector- or graph-based storage, we will create a vector-graph database by combining.

Vectorize: Rapidly stores new experiences as high-dimensional embeddings.
D1: Slowly builds a structured relational knowledge graph between every entity the AI agents embed.
KV: frequently accessed entities cache, sessions, analytics, and other use cases.

Smart Memory Consolidation

Instead of just storing memories. MeridianDB actively consolidates them:

Identifies important experiences worth preserving.
Clusters similar memories to avoid redundancy.
Updates relationship based on new information.
Prunes outdated or conflicting knowledge.

Stability-Plasticity Control

A built-in governor system that:

Determines when to prioritize new learning vs preserving old knowledge. - Applies different learning rates based on importance and novelty.
Prevents catastrophic forgetting through intelligent regularization.

Beyond Simple Similarity: Contextual Intelligence

Traditional databases find information based on similarity through distance calculation in multiple mathematical models. MeridianDB finds information based on relevance (see Diagram 5). When your AI agent queries for information, MeridianDB considers:

Semantic similarity: What the query means
Temporal context: When information was learned and how recent it is
Environmental context: What situation the agent is in
Behavioral utility: What information has been useful in similar situations
Graph relationships: How concepts connect to each other through the agent experience and human-in-loop refining. This multi-dimensional approach means agents get better answers, not just similar ones.

Multi-Agent Collaboration Without Chaos

Here's where it gets really powerful. MeridianDB enables multiple AI agents to share knowledge without stepping on each other's toes (see Diagram 6).

Key capabilities:

Collaborative Learning: Agents can learn from each other's experiences.
Selective Sharing: Organizations control what knowledge gets shared between organizations and between agents through fine-grained access control.
Cross-Agent Transfer: Successful strategies spread automatically
Conflict Resolution: The system handles contradictory information gracefully.

Imagine having a sales agent learn from customer support interactions or a product recommendation engine that improves based on actual purchase patterns, all happening automatically and safely.

Built for Scale on Cloudflare's Edge

MeridianDB leverages Cloudflare's global infrastructure to deliver:

Technical Stack:

D1: Relational metadata & property graph storage
Vectorize: High-dimensional embeddings with sub-100 ms retrieval
KV: Session management and real-time analytics
R2: Artifact storage and backup systems
Workers: Edge compute for global low-latency access

Performance Optimizations:

Embedding quantization for faster retrieval
Caching of frequently accessed entities
Lazy graph loading to minimize memory overhead
Batch vector operations for efficient updates

This architecture delivers enterprise-grade performance: <500 ms for simple queries, <2000 ms for complex multi-hop reasoning.

Measuring What Matters: Forgetting Analytics MeridianDB includes built-in analytics to track what traditional systems ignore :

Backward Transfer (BWT): How much new learning impacts old knowledge
Forgetting Curves: Knowledge decay rates over time
Transfer Efficiency: How effectively agents share knowledge between each other
Computational Cost: Storage and query overhead tracking
Stability Ratios: Balance between learning and retention These metrics let you optimize your AI agents' learning patterns and catch problems before they impact production.

Real-World Applications

Customer Support Evolution Instead of retraining your entire support bot when you launch new products, agents continuously learn from each interaction while maintaining expertise on existing products.

Multi-Domain AI Assistants

Deploy agents that can handle multiple business functions—HR, sales, and technical support—without knowledge silos or interference between domains.

Collaborative Research Teams

Multiple AI researchers can contribute to shared knowledge bases, with automatic conflict resolution and knowledge integration.

Adaptive Recommendation

Systems recommendation engines that learn from user behavior in real-time while maintaining understanding of long-term preferences and seasonal patterns.

References

Parisi, G. I., Kemker, R., Part, J. L., Kanan, C., & Wermter, S. (2019). Continual Lifelong Learning with Neural Networks: A Review [Review of the literature]. Neural Networks. arXiv preprint arXiv:1802.07569 arXiv
Mikolov, T., Sutskever, I., Chen, K., Corrado, G., & Dean, J. (2013). Distributed Representations of Words and Phrases and their Compositionality. arXiv preprint arXiv:1310.4546 arXiv
Angles, R. (2018). The Property Graph Database Model. CEUR Workshop Proceedings, 2100. https://ceur-ws.org/Vol-2100/paper26.pdf CEUR-WS

Top comments (2)

Shaquille Niekerk • Sep 8

This was a very interesting read.
And I've definitely come across the AI Amnesia thing. Or even times where the model seems to fixate on the last modification you made to it, rather than remembering a past trait it used to have.

Ahmed Rakan • Sep 8

Hello Shaquille Niekerk,

Yes that's the problem I am aiming to fix with MeridianDB, already started charting the design as source code github.com/ARAldhafeeri/MeridianDB...