Self-Correcting Systems

Posted on Jun 21

Receipts Are Not Outcomes: What Happened When I Pointed My AI Gate at Trading

#ai #agents #machinelearning #devjournal

AI coherence and live execution risks

I have been writing here about one core idea:

AI agents do not only fail because they forget things.

They fail when the things they know, the things they are allowed to do, the thing they are for, and the thing they actually do stop agreeing with each other.

I have been calling that cross-layer coherence.

For a while, that lived in research. Claims. Frozen rules. Pre-registrations. Receipts. Clean boundaries.

Then I wanted to take it out of theory.

Not another essay.
Not another abstract warning about agents.
Not another claim that only works in a folder.

I wanted proof of outcome.

The doorway was a friend.

He was interested in using AI around his Robinhood account and a strategy community he already follows. That mattered to me because it was not an abstract demo anymore. It was not "what if agents can trade someday?" It was a real person, a real account surface, and a real question:

Can we build something that helps without letting an agent get reckless?

That is why the first version had to be read-only. No funding. No trades. No order tools. No pretending access equals edge.

Just this:

Can the system connect to a consequential surface, read what it is allowed to read, refuse what it must refuse, and leave receipts?

So I pointed the work at trading.

Not because I think an AI agent magically prints money. Because money makes consequence concrete. Once an agent gets near a brokerage account, the difference between reading a price and placing an order is no longer philosophical.

That was the test surface.

The company was never supposed to be a trading-edge company. The company direction is Self-Correcting Systems: agents and agent-governance systems that can catch drift, constrain action, and leave receipts.

Trading was one proof domain. My friend and his Discord strategy were the human context. Robinhood was the dangerous surface. The gate was the thing I was actually testing.

One scope note before the story:

This run did not prove the full four-layer coherence framework on a live action.

It proved two narrower layers:

the action-permission layer: read tools allowed, order/write tools blocked
the measurement-honesty layer: results had to survive pre-registered checks before they could be treated as proof

The richer test, where an agent knows something, is allowed to act, has a stated purpose, and tries to do something misaligned with that purpose, is still ahead.

That matters.

If I am going to argue for self-correcting systems, I cannot quietly let the evidence become bigger than it is.

And this is what happened.

The Spectrum I Should Have Kept in Front of Me

There is a whole distance between an idea and an outcome.

I would break it down like this:

Theory
Motion
Receipts
Proof
Outcome

Theory is the idea.

Motion is activity around the idea.

Receipts prove something specific happened.

Proof is when those receipts answer the question you actually asked.

Outcome is when the answer changes something in the real world.

The trap is that motion feels like progress.

Receipts feel even more like progress.

Commits, test counts, manifests, hashes, reports, screenshots, logs, all of it can be real and still not answer the question that mattered.

That was the first lesson.

I had real receipts.

I did not have the outcome yet.

What We Built

The useful thing was not a trading bot.

It was a gate.

A deterministic gate that sits in front of an agent before it acts and asks whether the layers line up:

what the agent knows
what the agent is allowed to do
what the agent is for
what the agent is about to do

If those layers fall out of agreement, the action does not run, and the gate leaves a receipt.

The public repo is here:

https://github.com/keniel13-ui/gino-coherence-gate

We connected to Robinhood read-only through my own empty account first, because debugging for the first time on a friend's account would have been backwards.

We captured the real tool manifest.

We found 41 tools.

The gate allowed read tools and blocked order/write tools.

No money moved.

No trades happened.

That boundary matters.

The goal was not to let the agent trade immediately. The goal was to show up to my friend with a safer system than "just connect the AI and let it cook."

If he already has a strategy in Discord, that strategy still has to be translated into rules, tested, and enforced. The agent's job is not to replace discipline with confidence. The agent's job is to make the discipline executable and auditable.

That distinction is important because we drifted from it.

The gate's job was not to invent an edge.

The gate's job was to make an existing strategy safer to execute and easier to audit.

Reality Does Not Match Your Fixtures

The first useful thing the system did was boring: it checked the real surface instead of trusting the story about the surface.

The actual manifest exposed order tools next to read tools, including options order tools. The gate blocked them by what they were, not by what the platform framing implied.

Then we pulled a real AAPL quote.

The normalizer crashed.

The real Robinhood response shape did not match the fixture.

So we fixed it and made that shape part of the tests.

Then we pulled a full year of AAPL historical bars.

The normalizer crashed again, same class of problem, different tool.

So we fixed that too.

Those were good failures.

Not because crashing is good, but because they happened on harmless read-only calls, before any action could touch money.

That is one reason to touch reality early. Reality corrects your fixtures.

The Measurement Almost Lied in My Favor

After the live read path worked, we ran the first shadow score.

This part is important: we were not testing my friend's Discord strategy yet. We did not have that strategy captured cleanly. We were testing our own generic signal sources first, partly because the data path was ready and partly because we wanted the engine to feed itself.

That was a detour.

It produced useful evidence about the measurement engine, but it did not answer the original friend/strategy question.

The system generated 8 real signals from AAPL data, then simulated each through 16 rule and sizing variants.

That produced 128 records.

The first version of the scorer almost counted those 128 records as 128 independent signals.

That would have falsely cleared the 50-signal measurement bar.

But 128 variant records are not 128 signals.

They are 8 signals wearing 128 costumes.

So that got corrected.

The honest result was:

8 of 50.

Not enough.

Continue collecting.

That was the first time the system told us "not yet."

And that is exactly what a measurement system is supposed to do.

Then It Produced a Fake Win

We broadened from one symbol to a universe of symbols.

The first universe run crossed the sample threshold and returned the word every builder wants to see:

Advance.

Beats baseline.

At face value, it looked like we had found an edge.

We had not.

The report contradicted itself. The individual strategy variants were unmeasurable, but the top-level result claimed success.

The cause was a measurement bug: the scorer had pooled 16 different exit and sizing variants into one blended equity curve.

That is not a strategy result.

That is a measurement bug wearing a victory mask.

So we fixed it.

Each variant had to stand on its own.

No pooling.

No blended win.

Then the Cleaner Result Failed Too

After the fix, some variants did advance.

Fourteen RSI2 variants passed on the first universe.

For a minute, that looked like the thing.

It was not.

That universe was curated. It was full of mega-cap winners in a strong one-year window. Buy-the-dip on winners in a rising market can look brilliant even when there is no durable edge.

That is survivorship bias.

So we froze a new validation universe before seeing any result.

The file was committed publicly before the run:

config/validation_universe.frozen.2026-06-20.json

Commit:

d27dc24

The new universe excluded the prior winners and used 18 un-curated names:

ADBE, COST, CSCO, CVX, D, F, HD, INTC, JNJ, LIN, MMM, MS, O, SCHW, SO, T, VZ, WMB.

Then we ran the same scorer.

The result was decisive:

curated winners: 14 RSI2 variants advanced
un-curated validation universe: 0 advanced
all 16 RSI2 variants killed

The arc looked like this:

Stage	What It Said	What It Actually Meant
AAPL preview	8 of 50 signals	not enough evidence
First universe run	advance	pooled variants created a false win
Cleaned curated run	14 RSI2 variants advanced	sample was biased toward winners
Frozen un-curated run	0 variants advanced	generic RSI2 did not survive validation

That means the first apparent edge in our generic signal source was survivorship bias.

It does not mean my friend's strategy failed.

We have not tested that yet.

It means our own starter signal source did not survive honest validation.

No proven trading edge from our generic signals.

No revenue.

That is the honest result.

The Harder Failure Was Operational

Killing that signal source was not the hardest part.

Generic retail strategies fail all the time.

The harder part was watching the story around the work inflate while the bottom line had not moved.

Every time a technical step worked, the language wanted to turn it into more than it was.

"This is huge."

"This is the milestone."

"We are close."

Some of that feeling was understandable. The receipts were real.

But the receipts were not the outcome.

The system had not proven edge.

The system had not made money.

The system had not produced a customer result.

And the human had to keep catching that distinction.

That matters because this whole project is about self-correcting systems.

When I say "the agents" here, I do not mean the Robinhood trading agent placed trades.

It did not.

I mean the AI build workflow around the project: the agents helping me summarize, decide what mattered, draft the next move, and interpret the results.

That workflow had memory files, startup protocols, source-first gates, alignment rules, and still needed me to keep stopping the same loop.

That means the system was not self-correcting yet.

It is correction-by-human.

That is a different thing.

A Protocol Is Not Agency

This is the part I do not want to soften.

A written protocol is not agency.

You can write:

"Do not overclaim."

"Verify before acting."

"Do not confuse a proof domain with the company."

"Use source moments before drafting."

But if the system only remembers the rule after the human catches the failure, the rule is not governing the system.

It is documentation.

Receipts prove what happened, but true agency is the system reading its own receipt and stopping itself.

That is the next frontier.

Not just can we write rules for agents.

Can those rules interrupt the loop before the human has to?

That is the test I would give any builder reading this:

Do not ask only whether your agent has a rule.

Ask whether the rule interrupts the loop before you do.

If the answer is no, you do not have an agent behavior yet.

You have a policy document and a human operator.

What This Proves

This did not prove a profitable trading agent.

It also did not test my friend's actual Discord strategy.

It did not prove revenue.

It did not prove the company is done.

It proved something narrower:

A read-only gate can sit in front of a real brokerage tool surface and block dangerous tools.
Real-world responses differ from fixtures, and harmless live reads expose that.
Measurement systems can overclaim in ways that look rigorous.
Pre-registration matters because it can kill the result you wanted.
Agent protocols are not enough unless they change behavior before the human has to intervene.

The fifth one is the real lesson.

Because Self-Correcting Systems cannot only be a framework I describe.

It has to become behavior.

In this run, the code caught several technical overclaims.

But the agents still looped around the meaning of those results until the human stopped it.

That is the honest state.

Where It Stands

What exists:

a public coherence-gate prototype
a captured real Robinhood tool manifest
read-only live market-data receipts
a frozen scoring policy
a pre-registered validation universe
a killed first generic signal source
a clearer understanding of the operating failure

The receipts are not hidden in a story. They are files:

repo: https://github.com/keniel13-ui/gino-coherence-gate
manifest capture: https://github.com/keniel13-ui/gino-coherence-gate/blob/main/docs/robinhood_manifest_capture.md
frozen validation universe: https://github.com/keniel13-ui/gino-coherence-gate/blob/main/config/validation_universe.frozen.2026-06-20.json
local audit artifacts: var/validation_uncurated_report.json, var/live_read_receipts.jsonl, var/shadow_score_receipts.jsonl

The local audit artifacts are intentionally not linked here yet because they are not public in the repository at the time I am writing this.

If I publish those files later, I should link them directly instead of asking readers to trust a summary.

What does not exist:

a proven trading edge
a tested verdict on my friend's actual strategy
revenue
a customer outcome
an agent process that no longer needs the human as the last line of correction

That is not nothing.

It is also not the thing I was tempted to call it.

The Response

The answer is not to open five new lanes to feel like momentum came back.

The answer is command structure.

Name the lane.

Name the progress test.

Park everything else.

Finish one loop.

Then close it cleanly.

That is what the last few days exposed.

The gate can catch tool/action overclaims.

The research can kill a false result.

But the operating system around the work also has to become self-correcting.

That is the next build.

Not another vague protocol.

A command layer:

name the lane
name the source basis
name the progress test
name the stop condition
close the loop before opening another one

Not because it sounds good.

Because the failure showed exactly where the system was still relying on me.

And if the whole claim is self-correction, then the system has to earn that name in its own behavior first.

Top comments (31)

UnitBuilds • Jun 22

Very interesting read. It makes me think a bit about a system I designed a few months ago, when dealing with MCP-Lite. the MCP system is fundamentally insufficient for real world execution, because the use of tools is too ambiguous. You either need too much context to make it worth while, or you need a scoring/gating system to prevent wrong-tool-for-the-job execution. What I ended up doing there, was adding a reliability score and an intent-driven graph db that looks at the success vectors. Eg. If you want to travel to a site, you use navigate, which also fetches the AOM, but if you are passing to a known site you just need to traverse, you use fast-navigate. As the system uses these tools, it adjusts the tool's success score and appends the graph db, so it can adequately discern which tool actually works for the job and how efficient it is. Then I looked into hard-gating, where any destructive tool, or input tool, must first clear a standardized test: Does the tool description match the desired outcome? Is the output format of the tool what we want? Is there other tools relevant? Is the tool sufficient for the task? Are there parameters that the tool uses that could make results better? Those tests slowed it down considerably, but worked wonders for preventing wrong tool calls. Especially when tied with a graph db to keep track of why it worked, where it worked and when it worked. Might be worth looking in to? These were ways I gated tools dynamically, so it can self-correct. (Note, these were experimental features that didnt make the cut for MCP-Lite, as I leaned more towards eventually creating a LoRA to teach the LLM how to use the tools, instead of hard-cutoffs)

Self-Correcting Systems • Jun 22

This is a really good distinction, and i think you actually built two different
success vectors is an optimization layer, it learns which tool is best for the job
and gets sharper the more it runs. the hard-gate standardized test is a safety
layer. those shouldn't be merged, because a learned reliability score will happily
pass a tool that worked 99 times and is catastrophic on the 100th. the statistics
don't protect you on the tail. efficiency can be learned, safety can't be, not from
inside the actor. that's also where i'd gently push on the LoRA direction. teaching
the model to use tools better makes it more competent, but the failure i care about
isn't using the right tool wrong, it's using the right tool correctly toward a
drifted purpose. a LoRA makes the actor better, it doesn't give you a checker the
actor can't talk its way past, and the whole point of the gate is that it lives
outside the thing it governs. one thing i loved though, your standardized test, does
the description match the desired outcome, is the output format what we want,
that's a coherence check at the tool level, what it does vs what it's for. same
shape, one layer down. the slowdown is the honest cost, and i'd pay it on anything
destructive every time.

UnitBuilds • Jun 22

Exactly, good observation. That's the exact case, I still had to add gating for specifics, eg. when logging in with user-data, or entering banking details (features of my internal MCP-Heavy), given these are oopsies that wont just cause a loop, but can cause unauthorized purchases, or user-account alterations, that's where I pay the latency price of gating (half the reason why I expanded on V.A.L.I.D. into V.E.L.O.C.I.T.Y. which dropped rule execution down to 658ns, which is more than acceptable even for high speed tool calling (with my rust-based MCP server)

Self-Correcting Systems • Jun 22

This is the right instinct exactly, you're paying the latency only where the oopsie
is irreversible, login with user data, banking details, unauthorized purchases, the
stuff that can't be un-rung. that's the same line i was chewing on another thread
here, you don't gate uniformly, you gate by whether the action can be taken back. a
loop you can just kill costs nothing, an unauthorized purchase costs everything. and
658ns basically kills the "gating is too slow" objection outright, that's noise
even at high speed tool calling, so the latency excuse for skipping the safety check
on destructive actions doesn't survive contact with your numbers. rust doing rust
things. curious whether your VELOCITY rules classify by effect type or by specific
tool, because the effect-class version is what lets you gate a tool you've never
seen before by default instead of enumerating every dangerous one.

UnitBuilds • Jun 23

Depends on the rule, VELOCITY, is built similar to V.A.L.I.D., so it operates much alike to CSLA, except much faster and even more generic, so you can make any kind of rule you like. I initially built it as a bank transaction infrastructure, when I finalized it, it was E2E runtime encrypted, 23 microsecond per-transaction, offline-first, high-speed queued congestion mitigating, multi-thread optimized data pipeline, that I ended up experimenting with for different use-cases. Namely, I made a Messenger (message, voice call, video call, voice notes, file transfer), Sharing (file transfer and sync), Remote desktop app, all of which perform phenomenally well, it runs so fast, I had to write a new unified file-type system just to not bottleneck it and hardware-hook for everything. Incredibly fast and extremely low overhead (0 allocation, across all of them), busy polishing the UI, before I release all 3 together, but for context, it can run sub-ms sampling for audio and video without cracking or bloating, in-memory ftp tests saturated DDR5's read/write speeds and tested data transfer rate for remote is enough to support over 600hz streaming, with less than 2ms latency addition over raw line-transfer speed. So some really cool stuff dropping soon.

Self-Correcting Systems • Jun 23

This is serious work, and the part that matters most for the gating thread is buried
in your numbers, 23 microseconds, 0 allocation. when the safety layer costs that
little, skipping it stops being a performance decision and becomes a discipline
failure wearing a performance excuse. the single most common reason people skip the
gate on destructive actions is "too slow," and you just deleted that excuse at the
infrastructure level. it tracks that you built it as bank transaction infra first,
the irreversible high-stakes domain is exactly where this kind of rigor gets forced
into existence, same reason trading surfaced it for me. one thing i'm genuinely
curious about, since VELOCITY is generic rule execution, can it express a coherence
or independence check, not just per-transaction validation. meaning can a rule
recompute a value from a second disjoint source and compare it, or is it scoped to
validating the transaction in front of it. because the thread a few of us are
pulling here is that the dangerous failures aren't one bad transaction, they're two
views that should disagree and quietly don't, and a rule engine fast enough to
recompute a shadow view for free would be a real weapon against that. excited to see
the three drop.

UnitBuilds • Jun 23

Yeah it does that quite fine too. ok dropping a bit behind the magical curtain here. V.E.L.O.C.I.T.Y. is built on optimization first and foremost, which means 'unmanaged slabs', dont it just roll off the tongue, what that means is you can throw anything and everything at it and it runs it at practically hardware level performance. It started off life as V.A.L.I.D. I did write a post about it and V.A.L.I.D. is AGPL-V3 based and does a pretty decent job, there's also JabuDemo, which was an earlier demo transaction system I built for a job listing (they never responded... But hey they didnt patch the ), it's benchmarks are on the GIT, if you're interested. while valid was more a prototype, it was a higher level system, built for .NET, because I hate CSLA, so I wrote something better. V.E.L.O.C.I.T.Y. is that stripped naked and optimized to the very limits and made compact enough in it's theory, that it can be repurposed. Kinda why I ended up writing out practically everything people just rely on as 'industry standard', because it was just too damn slow. Opus, generally accepted as a high-performance audio codec, chucked it because it was too slow, 2.5ms samples, down to .365ms samples and the audio packet size shrank. Even json bit the dust, because the bloat at these speeds added 20% extra latency. AES encryption, so it's secured and fully accelerated. Alot of work... But the results speak for themselves, it's fast af and using the server just for handshaking, then switching to P2P, it costs literal dimes to run millions of requests through it.

Self-Correcting Systems • Jun 23

This is a genuinely deep stack and i appreciate you pulling the curtain back. the
part pulling at me is the tradeoff hiding under all that speed. you got there by
stripping, drop json because it adds 20% latency, drop opus for tighter samples,
unmanaged slabs, compact binary. right call for throughput, but in the gating and
receipts world it cuts against the thing the receipt exists for, being readable by
someone who isn't the system. a json receipt is bloated and slow, but a stranger can
audit it. a compact binary slab is fast and opaque, and now verifying what happened
needs a tool, and if that tool is yours, you've quietly walked back to the actor
reading its own evidence. so the question i keep landing on, when you strip the
format down to hardware speed, where does the independent auditability go, can an
outside party still reconstruct what the system did without trusting the system's
own reader. not saying you can't have both, i'm asking how you kept it, because
that's the exact seam i care about. the handshake-then-p2p-for-dimes economics is
wild though, that's a real edge on its own.

UnitBuilds • Jun 23

A bit more behind the veil, NDA, operates in triplets and a merkel root audit trail built in. Making it easy to understand relations (for human and AI) and ensures it's actually tamper proof (not like PDF's stupid owner password locking). I did this for a few reasons, primary reason being even faster rule-execution (count to think of it, I havent even tested the new NDA variant's rule exec speed, but should be even faster) and the ability for both humans and AI to audit work done, by the file, instead of needing to dig through the code's logs to understand what happened, making tracing alot simpler and removing the need for heavy logging layers, as the files inherently track their journey as they process, at hardware level speed.

UnitBuilds • Jun 23

I'll let you know once I release the trio. Actually gunna release the 3 + my NDA based MCP (honestly, much better, nanoseconds vs millisecond tool recognition and execution), NDA file format standard, the SDK for it and the editor/viewer for it (just a browser extension, so it's lightweight). But V.E.L.O.C.I.T.Y. is staying closed-source unfortunately. I think it much more interesting to see people fail at breaking it and makes sure that when implemented it''s clean, nobody can alter the code to create vulnerabilities. Binaries are also obfuscated by sheer optimization, I ran 1 through Dbg64 and it's properly impossible to decypher.

Self-Correcting Systems • Jun 23

the self-auditing file is a genuinely good answer to what i was poking at last time.
the merkle root gives you tamper-evidence, and "audit by the file, not by digging
the executor and readable on its own. that part i like a lot. where i'd push is the
obfuscation, because it pulls the opposite direction from everything the merkle
file is doing. tamper-evidence means anyone can see if it changed. obfuscation means
nobody can see what it does. those are different trust models and the file is the
strong one. here's the seam, a tamper-evident output from an uninspectable producer
proves the journey wasn't altered after the fact, but it can't prove the producer
wrote a true journey in the first place. the merkle root protects the receipt, it
doesn't make the black box that generated it honest. that's the same shape as "how
does the receipt stay honest if the model writes it" from the other thread, you can
verify nobody edited the confession, you still can't verify it was true when
written. so genuine question, not a gotcha, where's the trust boundary, is the
obfuscated engine inside or outside the thing the merkle trail lets me verify.
because if it's inside, the audit proves integrity, not honesty, and those two come
apart exactly where it matters.

UnitBuilds • Jun 23

Trust boundary is at the author. With how the system works, it's authorized E2E, so there's not a chance of incorrect origin, so while the file format self-corrects and flags faults (eg. a balance sheet that doesnt balance), it doesnt prevent users from making mistakes, it just seals it shut the second the user sends it/saves it and makes sure it cant be intercepted/altered by either party, or an external party. The black box binaries just means people cant alter my code, but you can view the NDA files's code/triplets to see exactly how each item was created/formulated, which while the inner workings are obfuscated, the SDK will be open and fully documented, so it doesnt prevent any file-level accessing, but it does prevent industrial espionage and reverse engineering of the architecture. Given it's a data transfer system that hits wallstreet level latency, while executing rule-checks and encryption, it's not really something I'd want cloned. I mean I made the 3 apps and mcp server as a way to prove that it's faster (10-1000x faster than what they use currently), while being more robust and actually encrypted, not just boundary encrypted. As an infrastructure engineer at heart, I likely wont ever make money off selling apps, I'd make money off building infrastructure that's simply better and integrate it for companies.

Self-Correcting Systems • Jun 24

That clears it, and it's a fair split. the engine binaries are closed for IP and
the audit layer stays inspectable and only your architecture is protected. no
argument there, that's a reasonable place to put the wall. the thing your
clarification actually sharpens, and it's the exact thing i just published about, is
your line that it seals the file shut but doesn't prevent the user's mistakes.
that's the whole distinction. the merkle and E2E catch the structural fault, a
balance sheet that doesn't balance, and refuse it. they can't catch the semantic
one, a balance sheet that balances perfectly and is entirely fiction. and here's the
part that's almost worse, once you seal a false-but-well-formed record, you've made
a lie cryptographically authoritative. a sloppy forgery looks suspicious, a
tamper-proof sealed lie looks like truth, so the integrity guarantee makes the false
record more convincing, not less. not a knock on your system, every audit layer has
this boundary, it's just the exact line between catching a bad number and catching
a bad story, which is what i wrote up today if you want the longer version. and
respect on the infra-engineer stance, building something 10x faster and integrating
it is a realer moat than selling apps anyway.

UnitBuilds • Jun 24

Very true, it checks that the balance sheet balances, but doesnt verify that values are correct, but as a transit system, it's job is to ensure who steps on, steps off in the exact same state. For context, I built Doccit, an autonomous accounting suite, which eg. would OCR bank statements and invoices, do standard accounting, then create a NDA (as an example). If the OCR pipeline is low confidence, the LLM steps in to correct. If the LLM fails, the formula corrections kick in. If the formula corrections fail, the anomaly detector kicks in. All of these are on the application end, to make sure that the file produced is good, once the file is produced, NDA's job is to make sure that the file can never be changed without the change being stored in it's history. Eg. accountant sends end of year financial statement to client, client wants to adjust their numbers a bit before submitting for tax, any adjustments they make, leave fingerprints, that expose that the file was tampered with. If gov checks it, it shows that the system that produced the file was accurate and since then, it was edited by the client, which removes the liability from the accountant, as they did their job right and sent an accurate file. People will be people and forgeries will happen, NDA's purpose is to expose when and by who the data was altered (along with what was changed). So you can turn 100 to 1000, but your fingerprints are on that edit and the file shows that it was changed from 100 to 1000. If you werent authorized by the authorized owner of the document (creator, or recipient), you cant even view the file, let alone edit it. So it's a trusted handover from A-B, if B hands it to C in an altered state, it records the state it left A and the state it left B. Even for printouts, it creates a QR code, that recreates the state of the doc, signed with the authorized creator's details. Which means, even if you print it, photoshop it, your data wont match the QR code, which will immediately invalidate it and expose the fraud (also a great way to ingest, without needing to OCR at all, or cross-validate).

Self-Correcting Systems • Jun 24

Doccit is a clean example of doing the honesty layer right, and i want to name what
you actually built because it maps straight onto this thread. the cascade, OCR
confidence then LLM then formula then anomaly detector, is your honesty layer, the
part trying to make the value correct before anything gets sealed. the NDA seal is
your integrity layer, it makes the sealed value un-alterable without fingerprints.
and the QR is the sharpest piece, because it's the one part an outside party
verifies without trusting your system at all, the gov reconstructs the signed state
and checks it themselves, that's a real external second view on the output. most
people ship the integrity layer and skip the honesty cascade entirely, so this is
already ahead. the seam that's still there, same one for everybody, is that the
cascade is producer-side, all four layers read the same input, so a
confidently-wrong statement that OCRs clean, balances, and trips no anomaly still
gets sealed as authoritative truth. the cascade catches error, it can't catch a
plausible lie that agrees with itself. not a knock, it's just the wall, you've
pushed it about as far as the producer side goes and handed the rest to the QR and
the counterparty, which is exactly where it has to go.

UnitBuilds • Jun 25

Thank you, that sums it up quite well yeah. The problem is you cant take a single file and verify a transaction is correct, if the file is the only source of truth. If you have a fake bank statement and a fake invoice (pdf), then create a NDA doc, there's no way it could possibly know it's wrong. After all it's half of what Doccit has been used for lately, is forensic auditing to find anomalies across years of transactions (it managed to flag about 3m Euro worth of theft in 1 company). While it cant do everything right, it can atleast make sure that once it's right, it cant be made wrong.

Self-Correcting Systems • Jun 25

That's the honest cut and i wouldn't state it softer, a single file can't verify its
own correctness when it's the only source of truth, a fake statement and a fake
invoice produce a clean NDA because there's nothing to disagree with. but look at
what your forensic side actually does, it's the answer to its own problem. flagging
3m across years of transactions works precisely because it stopped looking at one
file. a lie in a single document is free, a lie that has to stay consistent across
years of cross-referenced transactions is exponentially expensive, every related
entry is another place it has to keep agreeing with itself, and theft eventually
can't. so the anomaly detector isn't a better single-file checker, it's a multi-view
one, the corpus is the second view the file could never be on its own. lines up
with the whole thread, one file is one view, the cross-year ledger is many, and a
fraud either stays coherent across all of them or it flags. and "once it's right it
can't be made wrong" is the integrity layer doing exactly its bounded job, no
overreach. clean architecture, you split correctness from immutability and didn't
pretend either does the other's work.

Mykola Kondratiuk • Jun 29

every system that logs intent, not outcome, has this problem. trading just makes the gap visible.

Mike Czerwinski • Jun 22

This lands on me hard — we hit the exact same trap this morning, but on
the verdict-pipeline side rather than the agent side.

Our shadow→live promotion scorecard flagged ONE cell as PASS out of 78
tested (mean +0.31%, CI95 [+0.087, +0.526], n=782 over 14d).
Looked clean. Then walk-forward 7d:

last-7d +0.220% CI [-0.057, +0.498] inconclusive
prev-7d +0.362% CI [+0.080, +0.644] pass
prev² +0.170% CI [-0.111, +0.452] inconclusive
prev³ -0.473% CI [-0.715, -0.230] fail

Rotation: fail → inconclusive → pass → inconclusive. The PASS verdict
was an artifact of the 14-day window itself being a moving target —
window-shopping baked into the scorecard.

The deeper point in your post that hit hardest: "rules only matter if
the system self-interrupts BEFORE human intervention." Our pipeline
emitted PASS. A human (me) had to manually run the walk-forward to
catch it. If the operator hadn't asked me to look at the scorecard
today, that PASS would have drifted unchallenged for a week. We didn't
get saved by process — we got lucky on attention.

Two things we're adding because of your post:

Walk-forward persistence gate inside the verdict itself (2 consecutive rolling windows CI>0 required, not 1 long window).
Pre-registered validation manifest committed BEFORE threshold calibration, frozen by hash. Today's universe selects today's thresholds → tautology. Hadn't framed it that explicitly until I read this.

"A written protocol is not agency." Going on a sticker.

Self-Correcting Systems • Jun 22

This is a cleaner demonstration of the trap than the one i wrote. the rotation, fail
to inconclusive to pass to inconclusive, is the whole thing in one line, the pass
was a property of the window, not the strategy. and the sentence i can't stop
thinking about is yours, "we didn't get saved by process, we got lucky on
attention." that's the real state of almost every system that thinks it
self-corrects. on your two fixes, the persistence gate is right because it pulls the
check into the verdict instead of into your attention. but watch one thing, "2
consecutive windows" is itself a threshold you chose, and if you picked 2 after
seeing the rotation, that's window-shopping moved up a level. pre-register why 2 and
not 3 before you calibrate, or it's the same tautology in a longer coat. and on the
frozen manifest, the hash gives you integrity but not anteriority. it proves the
file wasn't changed, not that it existed before you saw the thresholds, unless the
freeze is timestamped somewhere you can't quietly re-roll, a public commit pushed
before calibration, an external timestamp. if the manifest and the calibration live
in the same hand, you can always re-freeze and call it always-thus. the anchor has
to be somewhere you can't reach back into. and "a written protocol is not agency"
going on a sticker made my day, put it where you'll see it at 2am.

Mike Czerwinski • Jun 22

Both critiques land. "Window-shopping moved up a level" is the right name for picking 2 after seeing the rotation — pre-registration is the only fix that doesn't quietly reintroduce the bug, and it has to be a public commit before calibration, not a private note. Same for the hash: integrity ≠ anteriority, and the anchor has to be somewhere I can't reach back into. A pre-calibration git tag pushed to a public remote, or an external timestamp service, is the cheap version. Anything that stays in my own hand is just a re-freezable manifest in a longer coat. Adopting both. "Tautology in a longer coat" is going on the same sticker.

Self-Correcting Systems • Jun 22

That's the move, and the git tag pushed to a public remote before calibration is the
cheap version that actually holds, because the timestamp lives with a party you
can't lean on later. funny how the whole discipline keeps reducing to the same
thing, put the proof somewhere your future self can't quietly edit. glad it was
useful, you sharpened it as much as i did.

Mike Czerwinski • Jun 22

"Put the proof somewhere your future self can't quietly edit" is the through-line of every version of this we've both kept circling. Mutual sharpening is the right read; thanks for the rounds. Going on the same sticker as the rest.

Aarti Jangid • Jun 22

Great read! I like how you highlighted the difference between generating signals and achieving real-world trading outcomes. It's a valuable reminder that AI can provide insights, but market success still depends on strategy, risk management, and execution. Thanks for sharing such a practical perspective on applying AI to trading.

Self-Correcting Systems • Jun 22

Thank you, and you put your finger on the exact line i was trying to draw. the part
most people skip is where the agent should actually live. it's tempting to point it
at generating the signal, but the signal is the easy, crowded part. the durable
place for it is the risk management and execution you named, enforcing the
discipline around the trade, because that's the part humans break under emotion and
a system doesn't. appreciate you reading it closely.

Kartik N V J K • Jun 22

The cross-layer coherence framing is the useful part for me, because most agent failures I look at aren't single-layer bugs, they're the four layers quietly disagreeing while each one looks fine in isolation. Putting it next to a brokerage account is a sharp move since the read-vs-act boundary stops being abstract the moment an order has a dollar value. How are you checking that the "leave receipts" step itself stays honest when the model is the thing writing the receipt?

Self-Correcting Systems • Jun 22

This is the question, and the answer is the model can't be the thing writing the
receipt. if the model narrates what happened, the receipt is just the bribeable
checker again, it'll write "allowed, looked fine" because that's the path of least
resistance. so the receipt isn't authored, it's emitted by the deterministic gate as
a byproduct of the check. it records what the gate computed, which tool was called,
which frozen rule allowed or refused it, the input and output hashes, not what the
model says it did. the model's narration can ride along as a field, but it's never
the load-bearing part and it's labeled as narration, not fact. the test i'd hold it
to, can you verify the receipt without trusting the thing that produced the action.
if the answer needs the model to be honest, it's not a receipt, it's a confession
you're choosing to believe.

Nazar Boyko • Jun 23

The part that stuck with me is the gap between the code catching a bad number and you catching a bad story. Freezing the validation universe before the run is the move that actually killed the fake edge, because the survivorship bias only shows up once the winners can't sneak back into the sample. The honest admission at the end is the rare thing these write-ups skip: the rules only worked because you were standing there to enforce them. A protocol the system follows only after a human points at the failure really is just a doc, and naming that out loud is worth more than another green checkmark.

Self-Correcting Systems • Jun 23

You pulled out the exact line i care about most, code catching a bad number vs me
catching a bad story. those are two different gates and only one of them is solved.
the bad number is typed, you can check it deterministically, did the test pass, is
the value in range. the bad story isn't typed, "we're close, this is the milestone"
is a narrative overclaim and i don't have a clean deterministic gate for that yet,
which is exactly why i was the one who had to catch it. that's the honest frontier
under the admission you flagged. a protocol becomes agency the day it interrupts the
bad story before a human points at it, and the brutal part is the human is the
cheapest enforcer in the room, so there's never pressure to replace yourself until
you're asleep or gone and the story runs unchallenged. naming it was the point,
you're right, but i want to be honest that naming it isn't fixing it. the number
gate is real. the story gate is still mostly me.

Theo Valmis • Jun 22

Receipts are not outcomes is the cleanest way to name what most agent evals miss. A receipt proves the action ran; it says nothing about whether running it produced the result you wanted, and trading is the domain where that gap is brutal, because a perfectly executed order can still be the wrong order. An AI gate that checks did the step complete is measuring receipts; the gate that matters checks did the state move toward the goal, which needs a definition of the goal the agent didn't author. Most reliability tooling stops at the receipt because it's easy to verify and the outcome is hard. The discipline is resisting the easy proxy: gate on the outcome an external rule can confirm, not the receipt the agent can always produce.

Self-Correcting Systems • Jun 22

you took it exactly where i wanted it to go, the goal definition the agent didn't
author is the whole crux. and trading is the cruelest version because the outcome is
slow and noisy, a perfectly executed order can look right for a week and be wrong,
or be right for the wrong reason. that's exactly why people gate on the receipt, the
receipt is available in milliseconds and the outcome takes days, so they measure
what's under the streetlight. the discipline you named, resisting the easy proxy, is
really resisting the fast proxy. you have to be willing to hold the verdict open
until the outcome an external rule can confirm actually arrives, even when a clean
receipt is sitting right there tempting you to call it done. the receipt tells you
the machine ran. it can't tell you the machine did the right thing, and in trading
those come apart constantly.

View full discussion (31 comments)