WonderLab

Posted on Apr 21

One Open Source Project a Day (No.44): MemPalace - The AI Memory System Built by the Star of Resident Evil

#ai #agents #mcp #opensource

Introduction

"She survived T-virus mutants on screen. Now she's fixing AI amnesia in real life."

This is article No.44 in the "One Open Source Project a Day" series. Today's project is MemPalace (GitHub).

Let's start with the backstory, because it's genuinely surprising: one of this project's co-founders is Milla Jovovich — Alice from the Resident Evil franchise, Leeloo from The Fifth Element, the Hollywood actress who spent six films fighting zombie hordes. She has no engineering background. But she's a heavy AI tools user, grew deeply frustrated with how existing memory solutions worked, and teamed up with crypto industry CEO Ben Sigman to build this — primarily using Claude Code over several months.

That fact alone is worth pausing on: a non-engineer, using an AI coding tool, shipped an AI infrastructure project that hit 22k stars in 48 hours.

Now for the product itself. LLMs are fundamentally stateless — every new session, your AI assistant starts from scratch. It doesn't remember the architectural decision from last week, doesn't know you prefer TypeScript, has no sense of your project history. For solo developers, that context-rebuilding overhead costs 200-600 euros of productive time every month.

MemPalace tackles this with a 2,000-year-old cognitive technique: the Method of Loci. Ancient Greek rhetoricians "placed" things to remember in imagined architectural spaces. MemPalace translates that spatial metaphor into an AI memory architecture: people and projects become "Wings," specific topics become "Rooms," raw content lives in "Drawers."

48k+ Stars, 6.3k+ Forks — one of the most-watched AI tools of the moment.

What You'll Learn

How the memory palace metaphor maps to an AI memory architecture
4-layer progressive loading: booting a full memory system on just 170 tokens
Zero-LLM write path: eliminating API costs at the storage layer entirely
Temporal knowledge graph: preventing stale information from polluting memory
MCP integration: how 29 tools connect to Claude Code and other AI assistants
The controversy: which marketing claims have been disputed and why

Prerequisites

Basic understanding of LLM context window limitations
Python basics (optional)
Experience with Claude Code, Cursor, or similar AI coding tools (helps contextualize the use case)

Project Background

What Is It?

MemPalace is a local-first, open-source AI memory system that gives AI assistants cross-session persistent memory via the MCP (Model Context Protocol) standard.

Core design philosophy:

Problem: LLMs are stateless; every session starts from zero
              ↓
Common solution: Cloud memory services (Mem0, Zep)
  Problems: requires API keys, data leaves local machine, expensive monthly fees

MemPalace solution: local-first
  ✅ Fully offline (core features require zero API keys)
  ✅ Data stays local (privacy, no cloud upload)
  ✅ Zero-LLM write path (no AI called during storage = zero API cost)
  ✅ MCP protocol bridges mainstream AI assistants

About the Creators

The origin story is distinctive:

Milla Jovovich: Hollywood actress, best known as Leeloo in The Fifth Element and Alice in the Resident Evil franchise. Non-technical background, but a heavy AI tools user who grew frustrated with existing memory solutions and kicked off the project
Ben Sigman: Crypto/blockchain industry CEO with a technical development background

The two spent months building it primarily with Claude Code — the bulk of the codebase is AI-assisted. The project is itself a live demonstration of what AI-assisted development can produce.

Launched April 5, 2026, initially under Milla's personal GitHub account before moving to the MemPalace organization.

Project Stats

⭐ GitHub Stars: 48,500+ (22k in first 48 hours)
🍴 Forks: 6,300+
📝 Commits: 503+ (develop branch)
📄 License: MIT
🌐 Language breakdown: Python 88.9%, HTML 4.6%, CSS 3.1%, Vue 1.4%
📦 PyPI: pip install mempalace

Key Features

The Core: Memory Palace Spatial Structure

MemPalace's most distinctive design is using spatial metaphor to organize AI memory:

Palace
└── Wing          ← Top level: a person / a project / a major theme
    └── Room      ← Second level: specific topic (e.g., auth, billing, architecture)
        └── Hall  ← Third level: memory type classification
            │     facts / events / decisions / preferences / emotional
            ├── Drawer   ← Verbatim raw storage (800-char chunks, 100-char overlap)
            └── Closet   ← AAAK-compressed summary version

Tunnel            ← Cross-wing connector (fast navigation between same-named Rooms)

This isn't just a metaphor — there's neuroscience behind it: the Method of Loci activates the hippocampus's place cell system, building stronger associative memory through spatial indexing. MemPalace's architecture attempts to give AI a similarly structured index.

4-Layer Progressive Loading: 170 Tokens to Boot

The most notable engineering design in MemPalace — load on demand, conserve tokens aggressively:

L0: Identity file (~50-100 tokens)
    "Who I am, which project I'm working on"
    ↓ (need more context)

L1: Top 15 memories ranked by importance (~500-800 tokens)
    Recent decisions, key preferences, important facts
    ↓ (diving into a specific topic)

L2: Semantic recall scoped to current Wing/Room
    Project-relevant memories, vector retrieval
    ↓ (need global search)

L3: Full semantic search (no limit)
    Cross-Wing full-library retrieval

At conversation start, only L0 loads (50-100 tokens). The AI assistant dives deeper on demand — meaning day-to-day memory overhead is minimal.

Zero-LLM Write Path: No API Cost at Storage Time

MemPalace's write path never calls an LLM:

# Traditional AI memory systems (with API cost):
User content → Call LLM to extract key info → Store summary
                       ↑
               Costs money + slow + requires network

# MemPalace write path:
User content → Regex rules + keyword scoring → Auto-classify to Wing/Room/Hall
             → Chunk into 800-char blocks → ChromaDB
                ↑
             Fully local, zero API calls

Project mining (mempalace mine):

# Scan entire project directory, auto-classify memories by rules
mempalace mine ~/projects/myapp

# Four-priority classification rules:
# 1. Folder name (e.g., auth/ → auth room)
# 2. File name (e.g., billing.md → billing room)
# 3. Keyword scoring (content keyword matching)
# 4. Default: general room

Temporal Knowledge Graph: Preventing Stale Data Pollution

MemPalace uses SQLite to store entity triples with validity time windows:

-- Knowledge graph structure
CREATE TABLE knowledge_graph (
    subject TEXT,
    predicate TEXT,
    object TEXT,
    valid_from TEXT,  -- ISO timestamp
    valid_to TEXT,    -- NULL = currently valid
    wing TEXT,
    confidence REAL
);

-- Example entries:
-- ("project", "language", "TypeScript", "2026-01-01", NULL, "myapp")
-- ("deployment", "platform", "Vercel", "2026-01-01", "2026-03-15", "myapp")
--  ↑ expired: no longer valid after 2026-03-15

This prevents a common problem: "project deploys to Vercel" recorded three months ago continuing to pollute the AI's context after you migrated to AWS.

29 MCP Tools: Seamless Claude Code Integration

# Add MCP server to Claude Code
claude mcp add mempalace -- python -m mempalace.mcp_server

The 29 MCP tools cover:

Palace navigation: navigate_to_wing, navigate_to_room, tunnel traversal
Memory I/O: store_memory, retrieve_memory, search_across_palace
Knowledge graph: add_entity, query_relationships, update_fact
Agent diary: independent diary per Agent for reasoning traces
Auto-hooks: extract and archive key info on session end / pre-compaction

Quick Start

# Install
pip install mempalace

# Initialize project memory
mempalace init ~/projects/myapp

# Scan and import project files
mempalace mine ~/projects/myapp

# Semantic search
mempalace search "authentication approach"

# Generate current context summary (paste into AI conversation)
mempalace wake-up > context.txt

# Integrate with Claude Code (MCP)
claude mcp add mempalace -- python -m mempalace.mcp_server

Development setup:

git clone https://github.com/MemPalace/mempalace.git
cd mempalace
pip install -e ".[dev]"
pytest tests/ -v

Deep Dive

System Architecture

┌────────────────────────────────────────────────────────┐
│             MCP Layer (29 tools)                       │
│  Claude Code / ChatGPT / Cursor / any MCP client      │
└───────────────────────┬────────────────────────────────┘
                        │
┌───────────────────────▼────────────────────────────────┐
│             Memory Orchestration Layer (Python)        │
│  4-layer load control / spatial navigation / hooks    │
└────────┬──────────────────────────────┬────────────────┘
         │                              │
┌────────▼───────────────┐  ┌───────────▼────────────────┐
│   Vector Storage Layer  │  │  Knowledge Graph Layer     │
│   ChromaDB              │  │  SQLite                    │
│   Drawers (raw chunks)  │  │  Entity triples            │
│   Semantic vector index │  │  Temporal validity windows │
└─────────────────────────┘  └────────────────────────────┘

AAAK Compression Format

MemPalace introduces a compression format called AAAK (Agent Accessible Abbreviated Knowledge) for Closet storage:

Original text:
"The authentication module uses JWT tokens with a 24-hour expiration.
 Refresh tokens are stored in Redis with a 30-day TTL."

After AAAK compression:
"auth: JWT, 24h exp; rfsh tkn: Redis, 30d TTL"

Design rationale: any LLM can natively read it (not binary compression — structured English abbreviation). The team claimed 30x compression, but independent testing found: enabling AAAK degrades retrieval quality by ~12.4% (a 55-character hard truncation cuts sentence semantics). The team has publicly acknowledged this.

How It Compares

Dimension	MemPalace	Mem0	Zep	Hindsight
Deployment	Local only	Cloud only	Cloud only	Cloud + local
Pricing	Free	$249/mo (enterprise)	Paid	Free + paid
LLM write path	None (zero cost)	Yes	Yes	Yes
Memory organization	Spatial palace	Flat summaries	Knowledge graph	Multi-strategy
LongMemEval score	96.6% (disputed)	49.0%	63.8%	91.4%

Understanding the Controversy

Independent technical analysis after launch uncovered several issues. The team has acknowledged most of them:

1. Benchmark gaming
The advertised 96.6% LongMemEval score only tested ChromaDB's raw embedding capability — the palace structure was completely bypassed in the test. When spatial features are enabled, performance actually drops:

With Room-scoped retrieval: 89.4% (−7.2 pp)
With AAAK compression: 84.2% (−12.4 pp)

2. "30x lossless compression" is inaccurate
AAAK's 55-character hard truncation loses sentence ends. Token counting uses len(text)//3 instead of a real tokenizer. The team updated their documentation after being called out.

3. Unimplemented features
"Contradiction detection" mentioned in the README has zero code in knowledge_graph.py. A "fact-checking" module exists but isn't integrated into the main pipeline.

Why it's still worth watching
These criticisms are primarily about marketing claims. The core design ideas are genuinely valuable: spatial metaphor organization, 4-layer progressive loading, zero-LLM write path — these design patterns have merit independent of any benchmark number. And a team that publicly acknowledges mistakes is more trustworthy long-term than one that doesn't.

Resources

Official

🌟 GitHub: https://github.com/MemPalace/mempalace
🌐 Website: https://www.mempalace.tech
📦 PyPI: pip install mempalace

Technical References

📊 Independent analysis: lhl/agentic-memory ANALYSIS-mempalace.md
🧠 Method of Loci: Wikipedia — Memory palace
🔌 MCP Protocol: Model Context Protocol documentation

Alternatives Worth Knowing

Mem0: https://mem0.ai — easiest-to-start cloud memory service
Zep: https://www.getzep.com — enterprise-grade knowledge graph memory
Hindsight: https://hindsight.so — SOTA multi-strategy retrieval

Summary

Key Takeaways

The spatial metaphor has real value: The Wing/Room/Drawer hierarchy isn't just a cute analogy — it's a genuinely useful memory index structure that reduces the cognitive load of managing AI memory
4-layer progressive loading: Starting at 170 tokens and deepening on demand is an elegant design pattern worth borrowing for any agent memory system
Zero-LLM write path: Eliminating storage costs entirely is the logical conclusion of "local-first" philosophy
Temporal knowledge graph: Validity windows preventing stale data pollution is an underappreciated practical detail
"AI helps build AI tools" experiment: A non-engineer building a functional AI infrastructure tool with Claude Code is a meaningful data point about where AI-assisted development stands today
Honest team response: Publicly acknowledging benchmark issues rather than doubling down earns long-term credibility

Who Should Use This

Heavy AI tool users: Solo developers and creative professionals who rebuild context every single session
Privacy-first users: Individuals and teams unwilling to upload work memory to cloud services
AI Agent developers: Engineers who need persistent memory infrastructure for their agents
Researchers: Scholars and engineers studying AI memory architecture and persistent memory design patterns

A Question Worth Sitting With

MemPalace implicitly asks: what is the right unit of organization for AI memory?

Mem0 chose "summary entries." Zep chose "knowledge graph entities." MemPalace chose "spatial locations." No single answer is universally correct — but choosing the spatial metaphor carries a subtle assumption: that the way humans use AI is increasingly like long-term collaboration in a shared workspace. When an AI assistant starts having memories like "we discussed this in the auth room last month," something fundamental shifts about what human-AI collaboration actually is.

Visit my personal site for more useful knowledge and interesting products

DEV Community