Your agent's memory can't tell you when it's lying

#ai #llm #opensource #devtools

Your coding agent has "memory" now — but can it tell when that memory is wrong? 🧠

Kage is a memory layer for coding agents — Claude Code, Codex, Cursor, Windsurf, anything that speaks MCP. Your agent recalls what your team already figured out about the repo before it edits, and captures new learnings as it works. Less re-explaining your codebase, fewer repeated mistakes.

The catch with most agent memory: it'll cheerfully hand your agent a "fact" about a file that got deleted last week — or one that never existed. Acting on that is worse than no memory at all. Kage is built so it can't.

See it in 30 seconds (no setup):

npx -y @kage-core/kage-graph-mcp demo

How you actually use it

Install once, point it at your agent, and it's mostly automatic from there:

npm i -g @kage-core/kage-graph-mcp
kage init --project .
kage setup claude-code --project . --write   # or: codex, cursor, windsurf…

Then, every task:

🔎 At task start, your agent recalls the grounded, still-true memory for the code it's about to touch.
✍️ As it learns a bug cause, a gotcha, a decision — it captures a durable note (validated on the way in).
🔁 You review memory in the same PR as the code; kage refresh re-grounds it on merge.

No vector DB, no API key, no separate service. The memory is just JSON in your repo.

What makes the memory trustworthy 🛡️

Every learning becomes a packet, and each one has to earn its keep:

capture → validate citations → ground to code → store as git JSON → recall

🚫 Validated on write — a packet citing files that don't exist is rejected. Hallucinations never get in.
🔗 Grounded to your code — each packet is fingerprinted to the real files it's about.
⊘ Withheld on recall — if the cited code was deleted or refactored, the memory is hidden from the agent and flagged for you.
📝 Governed like code — plain JSON, reviewed in the PR, versioned with your repo.

Trust shouldn't be a vibe, so you can actually score your repo's memory with kage benchmark --trust (hallucinated-citation rejection, stale-memory exclusion, live grounding) — but that's the proof, not the product.

See what your agents remember 👀

Kage ships a viewer — a live dashboard of your repo's memory: a Memory Trust score, every packet wired to the code it's grounded in, and a feed of what agents actually recalled.

Play with the live one (nothing to install): https://kage-core.com/viewer

Try it 🚀

# 30-second demo
npx -y @kage-core/kage-graph-mcp demo

# then wire it into your repo + agent
npm i -g @kage-core/kage-graph-mcp
kage init --project .
kage setup claude-code --project . --write