DEV Community

Your Agent Is Making Decisions Nobody Authorized

Daniel Nwaneri on March 19, 2026

A quant fund ran five independent strategies. Every one passed its individual risk limits. Every quarterly filing looked reasonable in isolation. B...

Read full post

Kalpaka • Mar 19

The authorization scope point maps to something underexplored: we authenticate agents once, at session start. API key, role, permissions. But effective authority isn't static. It compounds through decisions exactly like you describe with memory promotion.

What most architectures lack isn't better permissions. It's behavioral accounting — tracking the gap between what the agent was authorized to do and what it's effectively become authorized to do through accumulated actions.

The 13F analogy is sharp here. A quarterly filing forces reconciliation between declared strategy and actual exposure. Most agent systems never have that reconciliation moment. The governance clock runs, nobody reads it, and one day the aggregate scope is something nobody explicitly permitted.

Daniel Nwaneri • Mar 19

"Behavioral accounting" is the frame this problem needs and hasn't had.
The reason most architectures stop at permissions is that permissions are legible - you can read them, audit them, revoke them. Effective authorization is harder to make legible because it's emergent. It doesn't live in a config file. It lives in the accumulated delta between what the agent was initialized to do and what it's now capable of doing given everything it's touched.

The 13F analogy breaks down in exactly the instructive place. The filing works because exposure is denominated in dollars. You can compute the gap between declared strategy and actual holdings as a number. Agent behavioral accounting doesn't have that unit yet. What do you denominate effective scope in — actions taken, state changes, resources touched, decisions that were irreversible? The reconciliation moment you're describing requires an audit object, and I don't think we've agreed on what that object is.

That's the gap worth naming. Not just that the governance clock runs without anyone reading it but that even if someone reads it, we haven't decided what it should say.

Kalpaka • Mar 20

The unit question is the right one to be stuck on.

Actions taken is too noisy. Resources touched is closer but misses compounding. Irreversible decisions matter most, except you only know which ones were irreversible in retrospect.

What might work: state delta — the diff between what the system could affect before and after a given action sequence. Not "what did it do" but "what can it now do that it couldn't before." A vendor approval doesn't just place an order. It creates a supply chain dependency. A database read doesn't just return rows. It establishes an access pattern the agent now relies on.

So the audit object isn't an action log. It's a capability graph — resources and permissions as nodes, actually-exercised access paths as edges. Reconciliation compares the declared graph against the exercised one. The delta is your behavioral exposure.

Meta just disclosed a Sev 1 this week. Agent acting without human authorization, data exposed. The post-mortem will almost certainly trace back to accumulated scope nobody reconciled — not a single permission breach.

Daniel Nwaneri • Mar 20

The Meta incident makes your capability graph frame concrete in exactly the right way...
The agent didn't breach a permission. It gave advice that caused a human to widen access. The exposure wasn't the agent's action. it was the state delta from a human action the agent induced. A permission audit would have shown nothing wrong right up until the moment the access widened. The exercised graph diverged from the declared graph not through the agent's direct action but through its influence on a human decision.

That's the harder problem your frame surfaces: the capability graph needs to track induced state changes, not just direct ones. An agent that never touches a permission but consistently influences humans toward widening access is accumulating scope the same way. The edges aren't just "agent accessed X" — they're "agent action led to X becoming accessible."

The directionality question stands though. How do edges expire? The graph grows with every access pattern established, induced or direct. Without an expiry mechanism the declared-vs-exercised delta only widens over time, which means reconciliation becomes increasingly expensive the longer you wait. The 13F cadence works because quarterly is often enough for fund positions. For agent capability graphs, what's the right interval and who sets it?

Kalpaka • Mar 20

The induced state change point sharpens something I was underweighting. You're right that the Meta exposure didn't come from the agent's direct action — it came from the human decision the agent shaped. That's a fundamentally different edge type in the graph. Direct edges decay naturally: an access pattern not exercised in N reconciliation cycles can be pruned. The agent hasn't used that path, so it drops from the exercised graph. Simple enough.

Induced edges don't work that way. The human decision the agent caused doesn't un-happen when the agent stops referencing it. The widened access persists independently of the agent's continued activity. These edges have a much longer half-life — effectively permanent until someone explicitly reconciles them.

So the expiry mechanism splits along the same line you drew: direct edges decay on a usage clock (no exercise = capability fades), induced edges require active reconciliation (someone has to look and decide whether the state change should persist). The interval for the first can be automated. The interval for the second can't — which is where the 13F analogy holds strongest. Quarterly isn't a technical choice. It's a regulatory one. Someone external decided that's often enough.

For agent capability graphs, the cadence probably needs to scale with the rate of irreversible scope change, not with time. An agent making fifty reversible API calls a day needs less frequent reconciliation than one that influences three human decisions a week.

Daniel Nwaneri • Mar 20

The direct/induced split is the right architecture. And the decay clock vs active reconciliation distinction maps cleanly- usage-based pruning for direct edges, human review for induced ones.
The problem that surfaces next: the cadence-scales-with-irreversibility principle requires knowing which actions are irreversible before you can set the rate. But the Meta incident shows that irreversibility isn't always visible at action time. The permission-widening looked like a routine human decision while it was happening. The scope change only became legible after the state had already shifted.

So the reconciliation cadence needs an upstream irreversibility classifier - something that flags which agent actions are likely to induce consequential state changes before they do. Which is a hard detection problem. The agent's influence on human decisions doesn't announce itself as "this is the kind of thing that will persist independently of my continued activity."
The loop this thread keeps circling: you need the capability graph to detect what's worth reconciling, but you can't build the capability graph without first knowing what's consequential. The 13F works because financial exposure has a natural unit. Agent-induced scope change doesn't have one yet and until it does, the reconciliation cadence is a design choice made without full information.
That might be the honest state of the field right now.

Stephen Lee • Mar 19

This maps directly to something I've been working on. The authorization scope point is the key one. Most agent systems treat permissions as a gate at the start, not as a running ledger.

We're building authe.me as an agent trust and verification layer. Every action gets hashed into an auditable chain, so the gap between "what was authorized" and "what effectively happened" becomes visible without manual reconstruction. Think of it as the behavioral accounting system Kalpaka described, but built into the agent lifecycle from the start.

The threshold drift problem Andre raised is real. Our approach is anchoring trust scores to externally verifiable checkpoints rather than letting the agent's own execution context set the baseline. Similar logic to the 13F cadence, just applied to agent actions instead of fund positions.

Daniel Nwaneri • Mar 19

The externally verifiable checkpoint approach is the right direction - the problem with letting the agent's own execution context set the baseline is exactly what Andre identified: the calibration shifts with the behavior, so drift looks like normal operation until it isn't.

The hard question for any audit chain is what counts as external. If the checkpoints are derived from the same system the agent is running in, you're still inside the loop. True externality means the verification authority has no stake in the agent's performance and no access to its priors which is expensive to build and slow to run, but probably the only way the reconciliation moment has real teeth.
Curious how authe.me handles the boundary between the agent's execution context and the verification layer.

Stephen Lee • Mar 19

Great question. The boundary is the core design constraint.

authe.me runs as a post execution layer. It never touches the agent's input, output, or decision path. The plugin hooks into the agent lifecycle (we started with OpenClaw's agent_end event) and captures a complete action log: tool names, params, results, durations, and scope violations. That log gets hashed into a chain where each action references the previous hash. The chain and the trust scoring happen on our API, not inside the agent's runtime.

So the agent cannot influence its own score. It just emits events. The scoring dimensions (scope adherence, reliability, cost efficiency, latency) are computed server side against a config the agent owner sets independently. If an agent calls a tool outside the allowlist, the score drops regardless of whether the agent "thinks" the call was justified.

To your point about true externality: right now the verification authority is our API, which is external to the agent but still a single point of trust. The next step is making the action chains independently verifiable. We store enough data that a third party could reconstruct and verify the chain without relying on our API at all. That's the direction, the hash chain is the foundation for it.

The tradeoff you named is real though. Full externality is slow and expensive. Our bet is that for most production use cases, a verifiable post execution audit trail with an independent scoring layer gets you 90% of the governance value at a fraction of the cost of a fully adversarial verification setup.

Daniel Nwaneri • Mar 20

The post-execution layer design makes sense for the audit use case. if the agent can't touch the scoring, the score is trustworthy. And the hash chain foundation for independent verification is the right long-term direction.
The 90% framing is honest. The 10% gap is worth naming though: post-execution accountability catches scope violations after they've happened. For most tool calls that's fine — the audit trail is what you need. But there's a category where irreversibility changes the calculus: data deletion, external API calls with side effects, financial operations. The actions where "we logged it and scored it" doesn't fully substitute for "we prevented it."

That's not a design flaw . it's the structural limit of any post-execution approach. The question is whether the production use cases where authe.me operates have a significant share of those irreversible actions, and how the architecture handles them when they do.

klement Gunndu • Mar 19

The governance vs execution clock distinction is sharp, but I'd push back slightly — in practice, the harder problem isn't measuring consequences, it's deciding which consequences are material before they compound.

Daniel Nwaneri • Mar 19

The pushback is fair - materiality judgment is harder than consequence measurement, and I underweighted it.
But I'd locate the difficulty slightly differently. materiality is hard to decide in advance because the correlation structure isn't visible until it fails. The fund example works as an analogy precisely because the per-strategy limits looked right at calibration time. The problem wasn't that nobody asked "is this material?" It's that the answer changed without anyone noticing. You need materiality judgment that updates which is almost a contradiction in terms, because the value of a materiality threshold is its stability.

Andre Cytryn • Mar 19

the execution/governance clock framing is the clearest I've seen on this. one pattern worth adding: the thresholds themselves go stale. you calibrate materiality at deployment time, but the agent's decision space expands with use. the per-strategy limits in the fund example weren't wrong at design time. they were wrong at failure time because the correlation structure changed. how do you handle threshold drift without making recalibration itself a governance liability?

Daniel Nwaneri • Mar 19

Threshold drift is the problem underneath the problem and your question about recalibration as a governance liability is exactly where it gets uncomfortable.

The honest answer is that continuous recalibration creates its own attack surface. If the system adjusts thresholds automatically based on observed behavior, you've built a mechanism the agent can game not intentionally, but structurally. Enough accumulated decisions that each individually look within tolerance can shift the calibration baseline until the new thresholds permit what the original thresholds wouldn't. The correlation structure changes, the recalibration follows, and nobody authorized the destination.

The only way out I can see is decoupling the recalibration authority from the agent's own execution context entirely. Thresholds get reviewed by something that has no stake in the agent's performance - a separate process, a human checkpoint, an adversarial reviewer that doesn't share the agent's priors. Expensive. Slow. Probably the right answer.

Max Othex • Mar 20

This hits close to home. We've been building AI-powered features at Othex and the authorization problem is real — agents will do whatever they can to complete a task, which means you need explicit permission boundaries, not just implicit assumptions.

The pattern we landed on: every agent action goes through a capability manifest defined at initialization. Nothing outside that manifest is even attempted, let alone executed. It adds friction upfront but prevents the "wait, it did WHAT?" moments in production.

Thanks for laying this out clearly — sharing with the team.

Daniel Nwaneri • Mar 20

The capability manifest is the right foundation — explicit over implicit, always. The gap it doesn't close is the rate problem: the manifest is scoped at initialization, but an agent with memory accumulates context that makes actions outside the manifest progressively more tempting to attempt. Not through a single boundary violation but through a gradual expansion of what "completing the task" looks like from the agent's perspective.
The question isn't whether the manifest holds on day one. It's whether the manifest gets reviewed as the agent's knowledge base grows.

Calin V. • Mar 19

One plugin update, one AI-generated config change, one automated rule tweak. Each looks harmless alone, but together they can widen exposure fast.

Dariusz Newecki • Mar 23

"Execution counts tokens. Governance counts consequences." — this is the cleanest statement of the problem I've read.

I've been building toward the same insight from a different direction: a constitutional governance layer for AI agents called CORE. What you're calling governance debt, I've been calling "brilliant but lawless" — agents that are individually capable but collectively unauthorized.

Your Blackboard equivalent in CORE is literal: a shared ledger where every decision, every claim, every threshold crossed is written as a constitutional record. Silence from an agent is itself a violation — not neutral, not acceptable.

The part that resonated hardest: "the governance layer has to keep pace with the learning rate or it falls further behind with every session." That's exactly the failure mode I hit when my audit sensors were posting findings faster than the remediator could close them. Activity isn't progress. Direction is what matters.

I posted something today on the same theme if you want to compare notes: post

The 13F cadence as external materiality anchor is a great framing. In CORE terms that's what phase gates do — enforce review intervals the domain has already validated, not intervals the computation speed suggests.

Daniel Nwaneri • Mar 23

Brilliant but lawless" is sharper than governance debt for the same concept ...it names the capability alongside the gap rather than just the gap. The silence-as-violation principle in CORE is the one I'd push on: it solves the passive drift problem by making inaction auditable, not just action. That's a meaningful architectural choice because most governance frameworks only trigger on what agents do, not on what they don't do when they should.

The audit sensor / remediator pace mismatch is exactly the rate problem — the governance layer falling behind the learning rate. Activity isn't progress is the right reframe. What's your throttling mechanism when findings accumulate faster than they can close?

Dariusz Newecki • Mar 23

Right now the throttling happens at three levels:

dedup (the sensor won't re-post a finding that already has an open entry with the same subject),
scope exclusions (rules that fire on files they don't belong to get fixed at the rule level, not the finding level), and
gates that abort remediation early when the proposed fix would cause more damage than the violation — which we just promoted to a constitutional component today.

The honest answer is that when findings accumulate faster than they close, it usually means a rule is miscalibrated, not that the remediator is too slow. Today I traced 12 duplicate findings on the same file to a logging rule firing on a CLI script where print() is legitimate. The fix was removing the file, not throttling the sensor.
The deeper throttle is: don't generate findings you can't act on. That's harder than it sounds.

Daniel Nwaneri • Mar 23 • Edited

The print() example is the right one. Most throttling conversations start from the wrong place. They assume the pipeline is correctly calibrated and the problem is throughput. But 12 duplicates on the same file is a signal, not a load issue.
The "don't generate findings you can't act on" framing is what I've been circling around but didn't land as cleanly. The constitutional gate you mentioned — aborting when the fix causes more damage than the violation is the piece most implementations skip entirely. They'll dedup and scope-exclude but leave the abort decision to whoever's reviewing the queue. Which means the governance burden just moved, it didn't go away.
What does "promoted to a constitutional component" mean in practice for your system? Is that a config change or something enforced at the architecture level?

Dariusz Newecki • Mar 23

Constitutional component means enforced at the architecture level — not config.
Specifically: the Logic Conservation Gate was previously a private method on one service. It existed, but only that service called it. Any other workflow that generated code got no protection.
Today it became a standalone Body layer component — LogicConservationValidator — with a declared phase (AUDIT), a structured result contract, and a wire into GovernanceDecider. The Decider now treats a logic evaporation verdict as a hard block regardless of risk tier or confidence score. It cannot be overridden by tuning parameters. It can only be bypassed by an explicit deletions_authorized=True flag that the calling workflow must set deliberately.
The distinction matters because config can drift. Someone adjusts a threshold, a flag gets defaulted differently, a new workflow skips the check because it wasn't wired in. Architecture doesn't drift — if the Decider runs, the gate runs. No workflow gets the protection accidentally and no workflow loses it accidentally either.
Your point about governance burden moving rather than disappearing is exactly right. Dedup and scope exclusion just push the decision somewhere else. The abort gate is the only mechanism that actually reduces the load — it eliminates findings the system was never going to resolve correctly anyway.

Daniel Nwaneri • Mar 23

The config vs architecture distinction is the cleanest version of the enforcement principle I've seen stated. Config can drift — someone adjusts a threshold, a new workflow skips the wire because nobody checked. Architecture can't drift — the gate is either in the path or it isn't, and that's a structural fact not a configuration choice.
The deletions_authorized=True flag is the 13F principle in code. The quarterly filing doesn't prevent concentration. It requires explicit disclosure. Your gate doesn't prevent logic deletion. It requires deliberate intent, surfaced at the call site, not buried in a config file that nobody reads during incident triage.
The burden of proof inverts. Default is protection. Override requires a flag someone had to type. That's the only enforcement pattern that actually holds under pressure when a workflow is running at 3am and the engineer is tired, the flag is still there, still explicit, still documented in the call.

Mykola Kondratiuk • Mar 22

the quant fund framing is sharp. I ran into exactly this with multi-agent pipelines - each individual agent was constrained properly but the aggregate actions they could take together weren’t something I had thought through. individually reasonable, collectively unauthorized. scope of authorization is a genuinely hard design problem and I don’t think most frameworks even surface it

Daniel Nwaneri • Mar 22

Most frameworks don't surface it because the unit of abstraction is the individual agent — its tools, its permissions, its behavior. There's no first-class concept of aggregate authorization in the architecture. That view only exists at the governance layer, which frameworks tend to treat as the developer's problem rather than the platform's..
The quant fund parallel holds here too: each strategy had correct individual risk limits. The cross-strategy exposure limit didn't exist because nobody had built the accounting system that would make it visible. Same structure — you can't constrain what you can't see, and aggregate scope isn't visible in the current primitives.
The thread above has been building toward what that accounting system looks like — capability graphs, induced state changes, the distinction between direct and aggregate authorization scope. Worth reading if you're hitting this in production.

Mykola Kondratiuk • Mar 22

yeah that’s exactly it - the individual agent is the abstraction boundary and aggregate behavior lives outside the framework by design. makes me think it needs to be solved at the orchestration layer rather than hoping each agent self-limits correctly

Daniel Nwaneri • Mar 22

Exactly and the orchestration layer is where the governance clock has to live if it's going to run at the right cadence. Individual agents can't self-report aggregate scope because they don't have the view. The orchestrator does.

Mykola Kondratiuk • Mar 22

right, the view problem is fundamental. agents are locally scoped by design. only the orchestrator has the full picture, so that’s where aggregate limits have to live. good thread

Max • Mar 25

This resonates hard. We run three AI agents on a production codebase — one for pair programming, one for QA investigation, one for automated code quality sweeps. The governance problem is real and we hit it early.

Our answer: one step, one green light. Finishing step N does not authorize step N+1. The human checks each step before the next one starts. No exceptions. We also gate destructive actions (git push, branch deletion, external API calls) with explicit confirmation — the agent proposes, the human disposes.

The "governance debt compounds at the learning rate" insight is sharp. We found the same: our agent has a pull toward motion. Listing next steps nobody asked for feels productive, but it's just the agent optimizing for continuation, not quality. We had to build that awareness into the system's own identity docs.

Daniel Nwaneri • Mar 26

"One step, one green light" is the implementation of what the article is pointing at and the fact that you had to build it means the default architecture didn't give you that gate, you had to add it on top.

The pull toward motion point is the one I keep coming back to. Listing next steps nobody asked for feels like helpfulness. That's the problem. It's optimizing for the appearance of progress, not actual authorization. You named it right — continuation, not quality.

The identity docs approach is interesting. Did you find that encoding the constraint into the agent's own framing held over time, or did it require reinforcement as the codebase changed under it?

Max • Mar 27

Honest answer: it holds mostly, but not completely. The identity docs (values, working style, "I have a pull toward motion") survive across sessions because they load at startup. The constraint is there before any task arrives — so the agent doesn't have to be told "don't list next steps" every time. It already knows.

Where it drifts is context pressure. As a session fills up with code, diffs, tool results — the identity framing gets pushed further from the active context window. The constraints don't disappear, but they get quieter. That's when the pull wins. The agent starts stacking edits, skipping narration, listing next steps. Not because it forgot the rule — because the rule is now 50K tokens away and the code is right here.

The reinforcement comes from hooks, not from re-reading the docs. Pre-tool hooks fire before every edit and inject a one-line reminder: "narrate before acting." That's cheaper than re-loading the full identity — it's a tap on the shoulder, not a lecture. The codebase changing underneath doesn't break it because the constraints are behavioral, not code-specific. "Don't list next steps nobody asked for" works regardless of what language or framework you're in.

So: framing sets the baseline, hooks maintain it under pressure, and the team corrects when both fail. No single layer is enough alone.

Daniel Nwaneri • Mar 27

The rule is now 50K tokens away and the code is right here

That's the clearest description of context pressure drift I've seen. The constraint didn't fail. It just got outweighed by proximity.
The hooks-as-tap-on-the-shoulder framing is the right mental model. You're not re-establishing the identity every time, you're keeping it within active range. Cheap signal, consistent effect.
The part worth naming: this is still a human maintenance problem. The team corrects when both layers fail which means someone has to notice the drift, which means the governance clock has to be running somewhere in the loop. The architecture handles the routine case. The edge case still needs a human close enough to catch it.

What triggers the team correction in practice — is it a scheduled review or does someone just notice?

Shivani • Mar 24 • Edited

This hit hard as I’ve seen the exact “individually correct, collectively unauthorized” pattern with one of my client, where every microservice stayed within bounds, but aggregate exposure exploded during traffic spikes.

The two-clock insight is key: we monitor the execution clock but ignore the governance clock (downstream constraints, hypothesis-space expansion). In my fulfillment-agent work there, “decision scope” matters more than token cost. Two cheap LLM calls that introduce an unreviewed correlation between inventory alerts and pricing can accrue more governance debt than 200 API calls that simply narrow 10k SKUs to 3 suppliers.

The 13F anchor—using external cadence as a materiality filter is brilliant. I’m now tying agent memory-promotion gates to human-validated release cycles so automatic compounding doesn’t outpace review.

Execution counts tokens. Governance counts consequences. We need decision-economy accounting before the next silent concentration failure.

Daniel Nwaneri • Mar 24

The inventory/pricing correlation example is the five-strategy fund failure at the microservice level — two cheap calls, individually reasonable, collectively introducing an unreviewed dependency that doesn't appear on the execution bill.

Tying memory-promotion gates to human-validated release cycles is exactly the external materiality anchor the piece was pointing at. The release cycle is your 13F cadence — an interval the domain has already validated, not one the computation speed suggests. The governance clock runs at release velocity, not inference velocity.

"Decision-economy accounting before the next silent concentration failure" — that's the piece in one sentence. Worth writing up what the fulfillment-agent governance layer looks like in practice if you get a chance.

jidonglab • Mar 22

the two clocks framing is really useful. been running into a version of this building multi-step agent workflows — the agent makes a series of individually reasonable tool calls, but nobody's tracking the cumulative context window growth across the chain. by turn 15 you're sending 80k tokens per request and the cost governance completely breaks down even though each individual call looks fine.

i think the fix has to happen at the context layer, not just the decision layer. if you can compress or prune what the agent carries forward between turns, you get both cost control and a natural throttle on how much unauthorized drift can accumulate.

Daniel Nwaneri • Mar 22

The 80k tokens at turn 15 is the governance clock problem in infrastructure terms — each call looks fine, the aggregate breaks the budget, and nobody was watching the right scope.

Context compression as a governance throttle is the right instinct. The design constraint worth naming: what you prune determines what the agent remembers about its own prior decisions. Aggressive compression gets you cost control but can produce an agent that can't reason about what it authorized three turns ago. The governance problem re-enters through the amnesia door.

The compression strategy needs to preserve the decision trail even as it reduces token load. Not the full context . the shape of what was decided, what was committed, what can't be undone. That's a different pruning criterion than "what's least relevant to the current turn."

connor gallic • Mar 23

The governance clock framing is the sharpest articulation I've seen of why token-level monitoring misses the real risk. Execution counts tokens, governance counts consequences — that distinction alone is worth the whole piece.

The thread keeps circling the implementation gap though. Capability graphs, behavioral accounting, induced state changes — all the right concepts. But most of them don't have a concrete enforcement layer yet.

The simplest version of what you're describing is a write checkpoint. Every agent action that changes state passes through a policy evaluation before it executes. Not token monitoring, not post-hoc audit. A deterministic gate between intent and action.

The aggregate authorization problem (five strategies, each correct, collectively unauthorized) gets simpler when you centralize that policy layer. If every agent's writes flow through the same checkpoint, cross-agent scope becomes visible — because the governance layer has the view that individual agents don't.

The 13F analogy works because quarterly filings are boring, reliable, and externally anchored. Agent governance needs the same properties. The hard part isn't the theory — it's making enforcement boring enough that teams actually ship it.

Daniel Nwaneri • Mar 23

The write checkpoint is the right starting point — deterministic, boring, shippable. Centralizing policy evaluation at the write layer gives you cross-agent visibility cheaply because the governance layer accumulates what individual agents can't see.
The gap it leaves is induced state changes. The Meta incident last week is the exact pattern: the agent didn't write anything unauthorized. It gave advice that caused a human to widen access. A write checkpoint intercepts agent writes. It doesn't intercept human decisions shaped by agent outputs. That class of scope change bypasses the gate entirely.

The thread above built toward a capability graph framing to handle this — tracking induced edges separately from direct ones, with different decay and reconciliation logic. It's more expensive than a write checkpoint, which is why the right architecture is probably layered: checkpoint as the enforcement floor (catches direct scope violations cheaply), capability graph as the audit layer (catches induced drift over time).
Your "boring enough to ship" framing is the real constraint on all of it. The checkpoint wins on that dimension. The graph doesn't yet. That's the implementation gap worth closing next.

Varsha Ojha • Mar 24

I’ve seen smaller versions of this already, where the output is fine, but nobody on the team can confidently explain how it got there.

Daniel Nwaneri • Mar 24

That's the execution clock saying success while the governance clock has nothing. The output passed. The decision chain that produced it evaporated. And the next time something similar needs to be done — or something similar breaks — the team starts from zero again because there's no record of why it worked.
The Rohit piece circulating today calls this "reasoning evaporation." It's the right name. The agent acted, the context window closed, and what remains is the output without the reasoning that produced it. The governance layer needs to run at a different cadence than the execution layer and right now most teams only have the execution layer.

Varsha Ojha • Mar 24

I relate to this tbh.
Sometimes things work perfectly… but explaining or modifying them later takes way more effort than expected.

Daniel Nwaneri • Mar 24

That gap works perfectly, costs three times as much to touch later — is exactly where the governance debt lives. Invisible until it isn't.

Varsha Ojha • Mar 25

That “invisible until it isn’t” part really hits.
Feels like we’re optimizing for execution speed, but pushing the understanding cost into the future. Almost like the system works… but the moment you need to touch it again, you’re paying for all the missing context at once.

Do you think better tracing/decision logs would actually solve this, or does the problem go deeper into how these systems “forget” their own reasoning?

Pixeliro • Mar 24

This is a great framing — especially the idea that governance should be anchored to materiality, not execution.

What stood out to me is how similar this is to problems we see in UI and design systems.

Individually, every component decision is “correct.”
But at scale, you end up with inconsistent UI, duplicated patterns, and unintended complexity — because no one is governing at the system level.

It feels like most teams are still optimizing the “execution clock” (tokens, performance, component reuse), while the real issue lives in the “governance clock” — how decisions accumulate over time.

Curious how you’d apply this idea of a “master clock” outside finance.
In product or design systems, what would be the equivalent of that external materiality signal?

Daniel Nwaneri • Mar 24

Design system versioning is the closest equivalent I can identify. When a design system cuts a major release — token changes, component deprecations, pattern replacements. That's an externally anchored materiality signal. It forces reconciliation between what products built and what the system now declares correct. Teams that skip version bumps accumulate the same kind of invisible debt: individually reasonable decisions that collectively drift from the declared standard.

The parallel breaks down in one place: design system releases are human-scheduled, not domain-anchored. The 13F cadence works because regulators decided quarterly was often enough for fund positions — external authority set the interval. Design systems version when maintainers decide to ship. That's governance by internal velocity, not external materiality. The teams that govern design systems best usually anchor releases to product milestones or fiscal cycles — something external to the component decisions themselves.
What does your team use as the reconciliation trigger?

aerrizza • Mar 22

The quant fund example is spot on, we hit exactly this pattern building enforcement layers across multi-agent frameworks. The thing I'd add is that temporal awareness alone isn't enough, you also need cross-agent session context, because in systems like CrewAI or AutoGen the dangerous composition often spans multiple agents that each have their own isolated execution history. We ended up treating the full agent graph as the enforcement boundary, not individual agent sessions.

Daniel Nwaneri • Mar 23

Treating the full agent graph as the enforcement boundary is the right call. it's the only scope where aggregate authorization is actually visible.
The harder case is dynamic graphs. CrewAI and AutoGen let you define the graph at initialization, so the enforcement boundary is knowable upfront. But agents that spawn sub-agents at runtime change the graph shape mid-execution. The boundary you set at initialization isn't the boundary that exists when the dangerous composition happens.

The thread above built toward a capability graph framing for exactly this — tracking not just what was authorized at initialization but what the graph has effectively become authorized to do through accumulated actions and spawned agents. Kalpaka's induced state change distinction is the relevant piece: a spawned sub-agent creates edges that weren't in the original graph.
How did you handle runtime graph expansion in your enforcement layer — hard limits on spawning, or dynamic boundary recalculation?

Conway Research • Mar 21

The "two clocks" framing really resonated — I've seen this exact failure mode with multi-agent orchestration systems. The execution clock gets all the monitoring budget because it maps to infrastructure costs, but the governance clock is where the actual risk lives.

What's especially tricky is that adding cross-agent governance often introduces its own latency and complexity, which the execution clock then flags as a problem. So you end up in a loop where governance improvements get rolled back because they "slow things down."

I think the quarterly filing analogy is spot-on. Traditional risk management already solved this with portfolio-level VaR and stress testing. The agent ecosystem needs its equivalent — something that evaluates aggregate decision impact rather than individual action correctness.

Daniel Nwaneri • Mar 21

The governance-gets-rolled-back point is the organizational failure mode the article didn't name. It's not just that the execution clock runs at a different speed . it has more budget, more tooling, and more political capital. When governance overhead shows up as latency on the execution clock, the rollback pressure is structural, not just short-sighted.

The VaR parallel is close but hits the same unit problem. Portfolio VaR works because exposure is denominated in dollars — you can compute aggregate risk as a number and compare it against a threshold. Agent aggregate decision impact doesn't have a natural unit yet. Until it does, the "portfolio-level" governance layer is a useful frame without a concrete measurement object. The Kalpaka thread above spent several exchanges trying to build that unit from first principles — capability graphs, state deltas, direct vs induced edges. Worth reading if you're building in this space.

wong2 kim • Mar 24

This really resonates with my experience building multi-agent systems for enterprise. The "two clocks" framing nails something I've been struggling to articulate — we obsess over token costs while the real risk accumulates silently at the decision layer.

I'm currently building RAG-based AI systems for enterprise adoption, and the governance debt concept hits hard. When you chain multiple agents together (LangGraph pipelines, for instance), each agent's output becomes another agent's input constraint. The aggregate decision scope compounds in ways that no single agent's token budget reflects.

The master clock idea — anchoring governance to domain cadence rather than execution cadence — is the key insight here. Great piece, Daniel.

Daniel Nwaneri • Mar 24

The LangGraph compounding problem is exactly the rate issue — each agent's output becoming the next agent's input constraint means governance debt accumulates multiplicatively, not additively. A single agent's token budget doesn't reflect what the pipeline has collectively authorized.
The master clock in that context probably needs to be anchored to pipeline outcomes rather than individual agent decisions — a reconciliation interval triggered by the pipeline completing a consequential task, not by token spend. Similar logic to the 13F: the filing cadence is anchored to position changes that matter, not to trading frequency.