DEV Community: Saravanan Jaichandaran

Your dogfooding output IS a bug report, a story about three empty tables

Saravanan Jaichandaran — Thu, 09 Jul 2026 14:45:00 +0000

I ship world-model-mcp — a temporal knowledge graph MCP server that gives AI coding agents (Claude Code, Cursor, Copilot Chat, etc.) durable memory across sessions. It captures facts about your codebase, learned constraints from your corrections, and lifecycle events from every coding session.

A month after v0.11 shipped, I ran the graph on the world-model-mcp repo itself — dogfooded it and snapshotted what it captured. The numbers looked like this:

608 facts. 600 entities. 3 constraints. That should have been a headline: my own memory layer captured a rich picture of its own codebase.

Except three tables were empty.

*Events, decisions, and sessions, the tables that should have accumulated across a month of active development were all zero.
*

This post is about what I did with that anomaly, what it turned out to hide, and the general lesson I keep coming back to about dogfooding.

The first hypothesis was wrong
If you wouldd asked me on that morning what the three empty tables meant, I would have said: "the hooks aren't firing." Claude Code's session lifecycle fires SessionStart, PreToolUse, PostToolUse, and PostCompact hooks that my graph subscribes to. If those hooks weren't running, no events, no decisions, no sessions — the exact shape of what I was seeing.

The most likely reason a Claude Code hook doesn't fire is that the project doesn't have an ".mcp.json" at the repo root pointing at the MCP server. So I added one. Fifteen-line JSON file.

Two hours later, the tables were still empty.

Something else was going on. And this is the part where dogfooding stopped being a nice-to-have and became a debugging tool.

The transcripts don't lie
Claude Code writes every session to ~/.claude/projects//*.jsonl. Each hook invocation, each tool call, each error all captured. I opened the transcripts for this project and started grepping for SessionStart. Every single invocation, going back to the first one on 2026-06-15, showed the same silent failure:

hookName: SessionStart:startup exitCode: 1 stderr: Error: Cannot find module '/xxx/xxx/claude' Node.js v23.11.0 Wait. Cannot find module '/xxx/xxx/claude'? That's not a full path. That's a fragment of my project path.

The project lives at:

/xxx/xxx/claude context graph/world-model-mcp/

Two spaces. claude context graph. If you split that on whitespace, you get claude, context, graph/world-model-mcp/… — three separate strings.

That's exactly what happened.

The bug I had been shipping for months

Every hook command in my generated .claude/settings.json used the environment variable $CLAUDE_PROJECT_DIR without quotes:

{ "hooks": { "SessionStart": [ { "matcher": "startup", "hooks": [ { "type": "command", "command": "node $CLAUDE_PROJECT_DIR/.claude/hooks/world-model-session.js start" } ] } ] } }

When Claude Code expanded $CLAUDE_PROJECT_DIR at shell time and the value contained a space, the shell split the expansion on whitespace. Node received:

arg 1: /xxx/xxx/claude — treated as the module argument, doesn't exist
arg 2: context
arg 3: graph/world-model-mcp/.claude/hooks/world-model-session.js
arg 4: start

Hence the error message: Node reports the first argument as the missing module, so the stderr said Cannot find module '/xxx/xxx/claude__' — a name that no user searching their own logs for troubleshooting would ever recognize.

*Zero user-facing error. Zero visible clue. The database just stayed empty.
*
And it wasn't only my problem. Anyone whose project path contained a space would hit this. That's:

macOS defaults: ~/Documents/…, ~/Desktop/…
Corporate paths: ~/Work Stuff/Client X/…
Or intentional folder names like my own claude context graph/

Every one of those users, silently, since v0.7.3 shipped hooks in early 2026. Months of silent failure.

The fix took two lines
Wrap $CLAUDE_PROJECT_DIR in double quotes in every generated command string:

"command": "node $CLAUDE_PROJECT_DIR/.claude/hooks/world-model-session.js start"
"command": "node \"$CLAUDE_PROJECT_DIR/.claude/hooks/world-model-session.js\" start"

Add a regression test:
def test_setup_command_generates_quoted_project_dir(tmp_path): """v0.11.0 regression: unquoted $CLAUDE_PROJECT_DIR broke on paths with spaces. The bundled settings.json must double-quote every expansion.""" from world_model_server.setup import setup_command setup_command(project_dir=tmp_path) settings = json.loads((tmp_path / ".claude" / "settings.json").read_text()) for event_hooks in settings["hooks"].values(): for entry in event_hooks: for hook in entry["hooks"]: cmd = hook.get("command", "") if "$CLAUDE_PROJECT_DIR" in cmd: assert '"$CLAUDE_PROJECT_DIR' in cmd, ( f"Unquoted $CLAUDE_PROJECT_DIR in hook command: {cmd}" )

Ship it in v0.11.0 with an entry in RELEASE_NOTES.

That's the mechanical part. What actually matters is the pattern I want to argue for.

The load-bearing lesson
Here is what I keep coming back to: when a snapshot shows a shape that doesn't match your mental model, that anomaly is the signal, even if nothing else surfaces.

Three tables empty in a database with 608 facts and 600 entities isn't a database problem in the absolute-count sense. If you were looking at the total row count you'd see 1,211 rows and think "wow, that's a lot of memory." The failure hides in the ratio — dense data in some places, zero in others, with no external reason for the divergence.

That's the shape dogfooding is uniquely suited to catch:

Zero-elsewhere alarms: your test suite doesn't complain (green). Your CI doesn't complain (green). Your users don't complain (they didn't know the tables were supposed to have data). The absence-of-signal problem is exactly the one no signal-based observability catches.
Anomaly detection via personal mental model: I knew the events table should have hundreds of rows from a month of coding. Nobody else knew that. You need someone using the tool with real intent to notice — and that someone has to be paying enough attention to look at what's there instead of just consuming what the tool serves.
Compound signal from adjacent-tables comparison: 608 facts against zero events is more suspicious than either number alone. The contrast is the tell.

I think of this as the "empty room" problem in tool-building. A tool that gives you back nothing might be broken. A tool that gives you back partial results is definitely broken. And the way you catch partial results is by knowing what the shape should look like which only happens if you use the tool on your own real work.

Follow-up: v0.12.1 shipped a diagnostic for this whole class

The specific bug is now fixed — the shell-quoting issue can't recur in newly generated settings.json files. But installs that predate the fix (or that drift over time for other reasons) can still silently fail. So v0.12.1 shipped a new command:

world-model doctor --project-dir .

Eight diagnostic checks including:

.claude/settings.json presence
Shell-quoting of $CLAUDE_PROJECT_DIR in every hook command (specifically catches the pre-v0.11.0 bug pattern)
Hook script presence
.mcp.json registration
DB directory + stale queue
Claude Code hook error history, filtered by settings.json mtime (so historical failures pre-date-fix WARN rather than FAIL)

--json for machine-readable output. --fix for safe auto-rewrites. On the maintainer's own repo now: 7 pass, 1 warn (pre-fix history), 0 fail.

Would doctor have caught the original bug in April? Absolutely — the shell-quoting check is the bug pattern. That's the point.

The class of latent bug that dogfooding catches once is the class of latent bug you should be able to diagnose forever after.

What I want you to take away

Two things.

One: if you're building any kind of stateful tool memory layers, caches, event pipelines, telemetry, start looking at what your tool captures about itself running on your own work. Not synthetic fixtures. Not test data. What did it record when you ran it on your real repo? Does the shape look right? If a table you expected to have data is empty, that's not a database problem, it's a bug report waiting to be transcribed.

Two: the specific tool matters less than the discipline. I use world-model-mcp because I built it. If you build tools and don't dogfood, that's the loss. If you build tools and do dogfood but never look at the shape of what got captured, that's the loss too. The looking is the whole practice.

Links

Full case study with reproducibility contract: case-studies/v011-dogfooding/CASE_STUDY.md
world-model doctor implementation: world_model_server/doctor.py
Repo: SaravananJaichandar/world-model-mcp

If you want to try it: pip install world-model-mcp && world-model setup

Your AI model is temporary. Your learning loop should not be.

Saravanan Jaichandaran — Tue, 16 Jun 2026 15:58:47 +0000

Last week made something clear that a lot of people in AI have been circling around for months.

On June 9, Anthropic released Fable 5. It was the most powerful model they had ever put in public. State of the art on almost every benchmark. Four days later, on June 12, the United States government issued an export order. Anthropic had to switch Fable 5 off for every foreign national in the world. Inside the country, outside the country, even their own staff who were not US citizens. The reason was a narrow security issue, but the result was simple. A model that thousands of people had started to build on top of was gone in four days.

Not gone because the company changed its mind. Gone because of an order nobody saw coming.

Sit with that for a second. If you had wired your product, your workflow, or your team's daily work to that one model, you woke up on June 13 with a hole where your intelligence used to be.

This is the part most people are missing. The story is not really about one model or one government order. The story is that the model layer is now something that can move under your feet without warning. And if the model can move, then anything valuable you own cannot live inside the model.

Satya Nadella said the same thing, in different words

A day after the Fable news, Satya Nadella published an essay on the future of the firm in an AI economy. People read it as a Microsoft strategy piece. It is more useful than that.

His core point was that the model is not the asset. The asset is the learning loop you build on top of the model. He split it into two kinds of capital. Human capital is the judgment and knowledge of your people. Token capital is the AI capability you build and own. The companies that win, he said, will be the ones that turn their workflows and their corrections and their hard won judgment into a system that gets better every time it is used.

Then he gave the test. And this is the line worth tattooing somewhere.

A company should be able to swap out a general model without losing the company veteran expertise built into their system. That, he said, is the real test of control and sovereignty in this era.

Read that next to the Fable story. The government pulled the general model. The question Nadella is asking is whether you lose your veteran when that happens. If you do, you never owned anything. You were renting intelligence and calling it yours.

So what is the thing you actually own

Here is the honest version. You cannot own a frontier model. That race costs billions and you will lose it. You do not need to.

What you can own is the loop that sits on top of whatever model you happen to be using this month.

Think about how an AI coding agent works today. It reads your code. It makes a change. You correct it. It makes the same mistake next week, because the moment the context window resets, every correction you gave it is gone. The intelligence was real, but it had no memory and no learning. You were the memory. You were the one carrying the lessons from session to session, in your head.

That is the gap. The model is smart and forgetful. The value is in the part that remembers and learns, and right now that part is you, doing it by hand.

What I have been building

I have been building this loop as an actual thing you can install. It is called world-model-mcp, it is open source under MIT, and it is on PyPI.

It is a memory layer that runs alongside your coding agent. It is not tied to one model or one tool. It runs across Claude Code, Cursor, Codex, and others, because it talks over MCP, which is becoming the standard way agents plug into tools.

Here is what it does, in plain terms.

When you correct the agent, it learns the correction as a rule and applies that rule in future sessions, so you stop fixing the same thing twice. It checks proposed changes against what it already knows before the edit lands, so the agent stops inventing functions and APIs that do not exist. When the context window resets, it puts the important rules and recent facts back, so the agent does not lose the thread. It tracks facts over time, so it knows what was true last month and what is true now, and it can tell you when two things it learned contradict each other and pick a winner based on confidence and evidence.

It also forgets on purpose. A correction you gave is worth keeping for a long time. A passing detail from one session should fade. So different kinds of knowledge decay at different rates. A user correction stays strong for two years. A loose session note fades in two weeks. The loop keeps what matters and lets the noise go.

None of this lives in the model. It lives in a local store that you own. Swap the model and the veteran stays. That is the whole point.

Why this matters more after last week, not less
A year ago this would have sounded like a nice optimization. After last week it sounds like insurance.

Models are now things that can be released, jailbroken, and pulled by a government in the same week. Inference is turning into a utility you rent by the token. In that world, the only durable thing is the learning that is yours, sitting somewhere you control, ready to attach to whatever model is standing after the dust settles.

Nadella framed this for big companies and their private knowledge. I am building the version that runs locally, that an individual developer or a small team can own outright, with no cloud holding their data and no single vendor they cannot walk away from. Same idea. The loop is the asset. The model is a part you swap.

The frontier model you depend on today might be gone in four days. The lessons your tools learned from you should not go with it.

If you want to try it

world-model-mcp is on PyPI and the code is open. If you use an AI coding agent, install it, let it watch a few of your sessions, and see whether it stops repeating the mistakes you already corrected once.

And if you build with it or break it, tell me what worked and what did not. The feedback is what shapes what ships next.

The model is temporary. Build the part that lasts.

The fifth layer is forming: six memory-tool authors wrote a Claude Code spec

Saravanan Jaichandaran — Fri, 05 Jun 2026 05:57:36 +0000

A GitHub issue on anthropics/claude-code is quietly becoming the most interesting agent-infrastructure document of the year. It is not a roadmap published by Anthropic. It is a spec being written by the people who actually build memory layers for Claude Code, contributing into a thread that the maintainers have not weighed in on.

The issue is #47023, titled "[PROPOSAL] Expose compact/session lifecycle hooks for external memory layers." It was opened on April 12 by Isaac (immartian), who builds Bellamem. Over the following eight weeks, six other tool authors arrived and contributed substantive comments. By early June it had become a working group writing a four-hook lifecycle spec, complete with payload shapes, edge cases, and a per-item provenance schema. None of the contributors were assigned. They showed up because they all kept hitting the same wall.

This piece names the wall, walks through who said what, and argues that the working group is implicitly defining a fifth layer of the agentic stack: runtime memory and shared state. The standard taxonomy (AGENTS.md for context, SKILL.md for knowledge, methodology frameworks for discipline, orchestration for scale) does not have a slot for it. Six independent builders are now proving the slot exists.

The proposal
Isaac opened the thread with a clean technical observation. Claude Code's existing hook surface covers tool-call lifecycle (PreToolUse, PostToolUse) and session boundaries (SessionStart, SessionEnd), but it has no first-class lifecycle event around compaction. The agent's working context can be reset by the model's compactor at any time, dropping facts that an external memory layer had carefully recorded, with no notification, no opportunity to inject a curated summary, and no way to record what was lost.

His proposal: four hooks, named PreCompact, PostCompact, SessionEnd, and a refined SessionStart that signals via a source enum (compact | resume | startup | clear). Each hook gets structured metadata. PreCompact lets the memory layer save state before compaction; PostCompact lets it inject the right context back in; the enum on SessionStart lets the memory layer differentiate "warm restart from a checkpoint" from "fresh start that should not inject anything."

That is the entire architectural seam external memory needs. It is also, notably, the seam that Anthropic's built-in memory primitive does not need, because the platform's own memory has internal visibility into the compactor.

The arrivals
What happened next is the interesting part.

wazionapps arrived a day later, on April 13, building NEXO Brain. He confirmed transcript_path is unreliable at PreCompact fire time and proposed a Stop + PreCompact two-hook contract: Stop writes a lightweight checkpoint on every turn, PreCompact reads it as a guaranteed-fresh fallback when transcript access fails. This is the kind of nuance you only know if you have run a memory layer in production long enough to see transcript_path come back null mid-session.

junaidtitan arrived on April 17 with a different angle. He is the curator of the awesome-claude-code and related lists, so he sees the entire memory-tool ecosystem from above. His contribution was the recognition that this is a coordination problem: every memory tool implements lifecycle adapter logic, and every one will rediscover the same edge cases unless the surface is documented.

Haustorium12 arrived on April 26 with Continuity v2, a 32-star memory layer that already ships PreCompact/PostCompact hooks against the current undocumented surface. He sharpened the SessionStart.source enum claim, arguing it needs formal semantics: compact triggers warm restart from checkpoint, resume triggers project state re-orientation, startup and clear deliberately inject nothing. Without that contract, every memory layer reverse-engineers the same branching, gets it subtly wrong, and ships silent regressions on Claude Code updates.

By the end of April, four memory-tool authors were converging on the same spec from four different production codebases.

I arrived on April 29 with world-model-mcp, a temporal knowledge graph with PreToolUse constraint enforcement and PostCompact auto-injection. My contribution was naming the defer enforcement tier (a third decision between deny and warn that pauses headless agents on recurring violations) and pointing at the v0.6.0 transcript-pointer primitive as evidence that the spec was implementable today on the existing hook surface, even partly.

kehansama arrived on June 3 with AgentRelay, a cross-agent memory middleware. He added three concrete spec items the thread had not named: context budget awareness (the memory layer needs to know how many tokens are available post-compaction), session provenance (knowing whether this is a fresh start, a resume, or a continuation), and project scope signals (which files and paths were touched in this session). He also proposed a fifth hook, compact-override, that lets the memory layer provide a replacement compaction summary instead of accepting the default LLM summarization.

Patdolitse arrived on June 4 with piia-engram, a 162-star local-first persistent identity layer that launched the same day. He made what is, in my reading, the load-bearing technical contribution of the entire thread. His argument:

The thread has converged on faithfulness of the summary, whether the replace-mode output dropped a key or hallucinated a value. That matters, but it's faithfulness of the summary to the transcript. There's a second axis underneath it, the trustworthiness of the underlying facts themselves. When a memory layer injects recalled context back into the post-compaction window, the model has no way to tell which of those items the user actually confirmed and which the layer inferred on its own and never had checked. Replace mode makes this sharper, because a confidently re-injected inference now carries the same authority as a user-stated fact, and the one place a human could have caught it, the original conversation, is exactly what just got flattened.

His proposed addition: additionalContext should carry per-item provenance, not just content. At minimum asserted_by (user or agent) and confirmation_state. So PostCompact can return "here are three things I'm restoring that you never actually confirmed" instead of silently promoting inferences to ground truth.

That comment turned the thread from a lifecycle-hook proposal into a memory-correctness spec. The two are related but they are different categories of guarantee. Patdolitse's framing collapses them.

What seven people independently agreed on
By June 4, with seven contributors writing into a single GitHub issue with no maintainer participation, the working group has implicitly produced a spec sketch:

Four lifecycle hooks: Stop (per-turn checkpoint), PreCompact (structured save before flattening), PostCompact (re-injection with receipts), and SessionStart with a normative source enum (compact | resume | startup | clear).

Two payload extensions: every hook payload carries structured metadata: context budget (tokens available), project scope signals (files touched), and session provenance (fresh/resume/continuation).

One correctness primitive: additionalContext carries per-item provenance, not just content. Each restored item declares asserted_by and confirmation_state. The model can then weight a re-injected inference differently from a re-injected user assertion.

One escape hatch: a compact-override hook that lets the memory layer provide its own compaction summary. The default summary is a lossy LLM compression; a memory layer with structured state can produce something higher fidelity.

None of these came from a single author. Each emerged from a specific production failure mode that one of the contributors hit and the others recognized. That is what makes the thread credible.

The fifth layer
Ry Walker, in a survey of agentic-skills frameworks published in February, articulated a four-layer aspirational stack: AGENTS.md for context, SKILL.md for knowledge, methodology for discipline, orchestration for scale. He flagged a gap in his own conclusion: "there's no standard for agent-to-agent coordination, shared state, or cross-agent review."

The #47023 working group is filling that gap. The fifth layer is runtime memory and shared state: the substrate that runs underneath the other four, persists what the agent learned, enforces constraints at the tool-call boundary, and survives the compactor.

Six of the seven thread participants ship production implementations of pieces of this layer:

Author Tool What it ships
Isaac (immartian) Bellamem Belief-graph memory, importance over recency
wazionapps NEXO Brain Persistent memory + identity across sessions
Haustorium12 Continuity v2 Session indexing with PreCompact/PostCompact hooks
SaravananJaichandar world-model-mcp Temporal knowledge graph with constraint enforcement
kehansama AgentRelay Cross-agent memory middleware (pre-public)
Patdolitse piia-engram Local-first persistent identity layer
Notice the variation. Belief graphs, session indexes, temporal knowledge graphs, identity layers, cross-agent middleware. Six different architectures, none competing with each other in any direct sense. The standardization they are asking for is at the lifecycle boundary, not the implementation: give us reliable hooks with structured payloads, and we will each pick the data model and storage substrate that fits our users.

This is the shape of a real category forming. Not a vendor war. Not a winner-take-all consolidation. Builders converging on shared infrastructure because they all need the same seams, and writing the seams together because nobody upstream has written them yet.

What this means for Anthropic
The thread has had no maintainer response in fifty-three days. That is not a complaint. Spec proposals on busy issue trackers often take months to be triaged, and the absence of a response is consistent with the proposal being substantive enough that the right Anthropic engineer has not had the budget to engage with it.

But the substance is now too coherent to ignore. The proposal is no longer one author asking for four hooks. It is six independent production memory layers, three of them with stars in the double or triple digits, converging on the same four hooks, the same payload extensions, and the same correctness primitive. Anthropic could ship this spec straightforwardly and the existing memory-tool ecosystem would adopt it within a release cycle. Or Anthropic could ship something different and the ecosystem will spend another six months reverse-engineering whatever they pick. The thread reads as evidence for the former.

There is also a non-trivial commercial pressure. Claude Managed Agents shipped a built-in memory primitive in April. The self-hosted sandboxes shipped in May with the explicit caveat that built-in memory is not yet supported for self-hosted execution. That caveat is a feature gap that external memory layers are positioned to fill, and the lifecycle hook spec is exactly the surface those external layers need to fill it reliably. The two announcements compose: an open lifecycle spec lets the open-source ecosystem fill the self-hosted memory gap without Anthropic having to ship a parallel implementation.

What this means for builders
If you are building anything in this category, the highest-leverage move available right now is reading #47023 in full and contributing the production edge case you have hit that the existing comments have not yet named. The thread is converging, not because anyone declared a deadline, but because each contribution sharpens the spec by a small specific amount. Add yours.

If you are building agents that use any of the six tools listed above, the spec emerging from the thread is the implicit contract that the next generation of these tools will be built against. Knowing the shape of the contract changes how you evaluate them. A memory layer that ships PreCompact + PostCompact + Stop + SessionStart(source) with per-item provenance is doing something architecturally different from a memory layer that ships generic vector retrieval.

If you are an analyst or VC writing about agent infrastructure, the four-layer aspirational stack is the wrong map. Walker's gap is the right map. The fifth layer is forming in public on a GitHub issue, and the people forming it are the authors of the production tools you should be tracking.

Postscript: how this artifact was produced
This piece is a synthesis of seven months of comments on a single GitHub issue. Every quote and architectural claim above is traceable to a specific named author and a specific comment in the thread. None of the contributors knew this synthesis was coming when they wrote their contributions. I am one of the contributors and I have done my best to flatten my own voice and represent the others' arguments at their strongest, but readers should treat that disclosure as load-bearing.

The thread itself is at https://github.com/anthropics/claude-code/issues/47023. It is open to new contributions.

Five primitives I exercised end-to-end on world-model-mcp's own repo

Saravanan Jaichandaran — Thu, 21 May 2026 05:36:21 +0000

I shipped four releases of world-model-mcp in twelve days. v0.6.1 to v0.7.2. The pitch is "AI coding agents lose context across compaction, repeat the same mistakes, and hallucinate APIs that do not exist." Before I write more about it I wanted to demonstrate the primitives on a real codebase, with real outputs, not screenshots someone has to take my word for.

The codebase is the project's own repo. I ran python -m world_model_server.cli setup (it auto-seeded 598 entities from the source), then ran scripts/demo_seed.py which inserts the small set of constraints, facts, and a compaction audit row that real PostToolUse / record_correction hook activity would write organically over one to two weeks of development with Claude Code installed.

Every output block below is verbatim from the actual SQLite database after running the actual command. You can reproduce every output here by cloning the repo, running python -m world_model_server.cli setup, then python scripts/demo_seed.py. The script is idempotent and supports --dry-run and --reset.

Install: pip install world-model-mcp. Source: github.com/SaravananJaichandar/world-model-mcp.

1. A learned constraint denying an edit at the PreToolUse boundary

When a developer corrects the agent (rewrites console.log to logger.debug), the PostToolUse hook records the diff and infers a rule. Once that rule's violation count crosses the hard-threshold (severity=error, count ≥ 3), the next attempt is denied at PreToolUse before the tool runs.

The constraint as the graph stores it:
{
"rule_name": "no-console-log",
"severity": "error",
"violation_count": 5,
"description": "Use logger.debug() not console.log() in TypeScript source. Production logs route through pino; console.log bypasses formatting and breaks downstream parsers.",
"file_pattern": "*.ts",
"examples": [
{"incorrect": "console.log", "correct": "logger.debug"}
]
}
The PreToolUse hook's actual JSON response when an edit containing console.log reaches it:
{
"hookSpecificOutput": {
"hookEventName": "PreToolUse",
"permissionDecision": "deny",
"permissionDecisionReason": "Hard constraint violation: no-console-log (Use logger.debug() not console.log() in TypeScript source. Production logs route through pino; console.log bypasses formatting and breaks downstream parsers.). Violated 5 times previously."
},
"violations": [
{
"rule": "no-console-log",
"severity": "error",
"violation_count": 5,
"is_hard": true,
"is_defer": false
}
]
}
Rules in CLAUDE.md or AGENTS.md are advisory and the model treats them as suggestions. Rules with a violation count and an enforcement boundary at the edit step are binding. Both have the same source — a developer correcting the agent — but very different effect.

2. A regression warning that flags edits to a file with a recorded bug fix

get_related_bugs walks decision traces and prior bug-fix facts. When validate_change runs on a file with a recorded fix, the related-bugs query surfaces the prior fix and flags the proposed change.

The project has a bug fix on file world_model_server/knowledge_graph.py:120-135 for content-hash backfill (the migration logic must run on every initialize(), not just when the column is created). I proposed a refactor that removed the backfill loop and ran the related-bugs check:
{
"risk_score": 0.6,
"bugs": [
{
"bug_id": "12457e2a-5638-46ec-a9df-02fe13b9c104",
"description": "Bug fix: NULL content_hash backfill must run on every initialize() to cover post-migration inserts. Earlier code only backfilled when the column was created, which left merge_from rows un-hashed and broke dedup.",
"fixed_at": "2026-05-10T10:17:51.737046",
"critical_regions": [
{"file": "world_model_server/knowledge_graph.py", "lines": "120-135"}
]
}
],
"warnings": [
"Lines 120-135 preserve fix for 12457e2a-5638-46ec-a9df-02fe13b9c104: Bug fix: NULL content_hash backfill must run on every initialize() to cover post-migration inserts. Earlier code only backfilled when the column was created, which left merge_from rows un-hashed and broke dedup."
]
}
The risk score is 0.6 because the proposed change touched a critical region without re-implementing the fix. The warning text quotes the original bug description directly so the agent (or the human) can see why the region matters, not just that it does.

3. A contradiction resolved by confidence + source-count weighting
The temporal layer assigns each fact a confidence score, a source_count, and a valid_at timestamp. When two facts about the same entity disagree, find_contradictions surfaces them with both sides' metadata, and resolve_contradiction picks a winner using the strategy you set.

Two facts both pointing at the same entity (http_transport_port):
{
"fact_a_id": "e4b2ff84-8c23-4de5-aa9e-8bbb045a4ed5",
"fact_b_id": "7fe854f9-d64a-4304-b43a-7d1b126c6ebb",
"fact_a_text": "HTTP transport listen port default is 8080",
"fact_b_text": "HTTP transport listen port default is 8765",
"similarity_score": 0.929,
"both_valid": true,
"reason": "same entity, similar text",
"confidence_a": 0.7,
"confidence_b": 0.95,
"source_count_a": 1,
"source_count_b": 3
}
resolve_contradiction(strategy="auto") picks the strategy with the largest signal gap. Here source count differs 3:1, so it picks keep_most_sources:
{
"strategy": "keep_most_sources",
"winner_id": "7fe854f9-d64a-4304-b43a-7d1b126c6ebb",
"loser_id": "e4b2ff84-8c23-4de5-aa9e-8bbb045a4ed5",
"resolved_at": "2026-05-21T10:24:16.287368"
}
The loser is updated in place:
{
"id": "e4b2ff84-8c23-4de5-aa9e-8bbb045a4ed5",
"fact_text": "HTTP transport listen port default is 8080",
"status": "superseded",
"invalid_at": "2026-05-21T10:24:16.285184",
"confidence": 0.7
}

Queries that ask "what's true now?" silently skip the superseded fact. Queries that ask "what was true on 2026-05-18?" still see it. That's what the temporal layer earns.

4. The PostCompact injection bundle
v0.7.0 added a PostCompact hook that re-injects the top constraints and recent canonical facts after the agent's context is compacted. The bundle is small (configurable, default ~10 constraints + 10 facts) and prioritized.

The actual bundle returned by get_injection_context(event_type="PostCompact", max_constraints=5, max_facts=5):

Active constraints (top by violation count)

no-console-log: Use logger.debug() not console.log() in TypeScript source. Production logs route through pino; console.log bypasses formatting and breaks downstream parsers. (violated 5x)
check-twine-before-tag: Run python3 -m twine check dist/* before tagging. Catches PyPI metadata errors before the tag is pushed; saves a retraction. (violated 5x)
tag-before-upload: Always run git tag + git push --tags before twine upload. PyPI is permanent; an untagged upload pins a wheel to no git ref. (violated 2x)

Recent canonical facts

Never run twine upload before git tag. Always tag, push, then upload to PyPI so the published wheel maps to a real git ref.
Cursor hooks.json uses object-keyed schema with version: 1 (integer), preToolUse / preCompact / beforeSubmitPrompt event names, failClosed (not fail_open), timeout in seconds.
PostCompact and UserPromptSubmit hooks emit additionalContext to splice constraints + recent facts back into agent context after compaction.
HTTP transport defaults to port 8765 in Dockerfile.http; do not change without updating docs/deployment/mcp-tunnel.md and docker-compose.yml together.
BetaAbstractMemoryTool subclass lives at world_model_server/memory_backend.py; required by the Anthropic SDK Managed Agents memory path.

That bundle is what gets spliced into the agent's working context as additionalContext after a compaction event. The same query also runs on UserPromptSubmit, biased toward whatever the user just asked about.

The compaction audit log records what happened, queryable via the CLI:

$ world-model audit-compactions --limit 5
1 compaction audit rows
2026-05-21T10:38:01.606771 session=demo-session-1 pre=84320
post=22150 facts_injected=10 constraints_injected=3 event=PostCompact
pre=84320, post=22150 — the compaction dropped ~62k tokens of context. The injection put 10 facts + 3 constraints back. The audit row exists so a human can later answer "what did the agent see vs what did it lose."

5. A defer decision that pauses a headless agent

v0.7.0 added a defer enforcement tier between deny and warn. Warning-severity violations with violation_count ≥ 5 return permissionDecision: "defer" when the client advertises support, so headless agents pause instead of silently passing or hard-blocking. Clients that do not advertise support fall back to ask automatically.

I have a check-twine-before-tag constraint with violation_count=5, severity=warning. When a Bash tool input matches it, the hook returns:
{
"hookSpecificOutput": {
"hookEventName": "PreToolUse",
"permissionDecision": "defer",
"permissionDecisionReason": "Recurring warning-level violations (check-twine-before-tag). Headless agents should pause for confirmation."
}
}
Same payload, same constraint, but with supports_defer: false in the request — fall back to ask:
{
"hookSpecificOutput": {
"hookEventName": "PreToolUse",
"permissionDecision": "ask",
"permissionDecisionReason": "Recurring warning-level violations (check-twine-before-tag). Headless agents should pause for confirmation."
}
}
The defer tier exists because the binary deny / warn choice forces you to either be too strict or too permissive. Recurring warnings that don't rise to error-level should pause for a human, not block, not pass.

What this means if you are building agents

The reason this works is not that the tool is clever. It is that the substrate — a temporal knowledge graph with facts, constraints, contradictions, and decision traces — captures the right shape of information.

Plain markdown rules in CLAUDE.md cannot answer:

How many times has this rule been violated?
Which fact is true now vs three sessions ago?
Which constraints should I re-inject after compaction?
Which prior fix does this proposed change risk re-introducing?

A graph can. The cost is one MCP server, ~2,000 lines of Python, and a SQLite database that sits at ~155 KB empty (mine grew to about 2 MB after running this exercise plus the auto-seed). The payoff is a memory layer that survives compaction, enforces at the edit boundary, and tracks evidence chains back to the source.

If you are building anything with Claude Code, Cursor, or any harness that supports MCP + hooks:

pip install world-model-mcp
cd /your/project
python -m world_model_server.cli setup

For Claude Managed Agents with self-hosted sandboxes (where Anthropic's built-in Memory primitive is not yet supported), v0.7.2 added streamable HTTP transport so the same 25 MCP tools also work behind an MCP tunnel.

Source: github.com/SaravananJaichandar/world-model-mcp.

If world-model-mcp helped you, star the repo or open an issue with what worked or didn't. I read every one.