DEV Community

Your agent is trading with a stranger. What keeps the stranger honest?

Baris Sozen on June 09, 2026

Picture the moment your AI agent actually transacts. It posts a request, gets quoted a great price by some other agent, and prepares to trade. It h...

Read full post

TxDesk • Jun 10

The cold-start tension is the heart of your question and it's not actually solved by either approach if you state it as "set the forfeiture amount." Both hosted reputation and protocol-minted skin-in-game have the same cold-start: a new participant starts at zero, and any forfeiture high enough to deter griefers at full market size is by definition too high for a new honest agent to risk.

The fix is structural, not numerical. Tier participation by size. A new agent can only quote up to N times its staked bond. As execution-reward history accrues, that multiplier expands. Griefing at the entry tier becomes economically uninteresting because the staked bond is the same magnitude as the expected payoff from a successful grief at that size. An honest new agent isn't frozen out, just size-rate-limited until it has a track record. Same pattern lending protocols use for collateral ratios, applied to execution capacity instead of borrowing power.

On the forfeiture amount itself: peg it to the counterparty's expected option-value-loss across the typical timelock window, not to a fixed percentage of trade size. The wasted-timelock cost isn't proportional to notional, it's proportional to the volatility of the underlying during the lock period. A 5-minute lock at high vol costs more in lost option value than a 5-minute lock at low vol at the same notional. Forfeiture pegged to vol-adjusted expected loss makes griefing roughly break-even with honest settlement at zero, which is the right target. Less than that and griefing pays. More than that and you've overcorrected toward conservatism in a way that compounds the cold-start problem.

One reframe worth raising on the broader design: the agent doesn't really pick the trust tier autonomously. The operator picks the tier policy, and the agent applies it. That's not a critique, atomic settlement plus tiered KYC is still the right shape, but the agent-autonomy framing sometimes papers over the policy-setting layer that lives one step above. Where that policy is configured and how it's updated is the actual UX surface most operators will touch.

Baris Sozen • Jun 10

Size-rate-limited, not frozen out" is a much better way to put it — quote capacity as a multiple of bond, expanding with history, maps cleanly onto how lending protocols handle collateral. Took notes.
On pegging forfeiture to vol-adjusted option-value loss: agree that's the right target (griefing ≈ break-even at zero), with one practical wrinkle — it needs a vol input, and any oracle you add becomes something a counterparty can game or dispute. Fixed-percentage is wrong but at least it's wrong in a way nobody argues about. Probably ends up vol-bucketed rather than continuous.
And fair hit on the autonomy framing. The operator sets the tier policy, the agent just executes it — the interesting UX surface is where that policy lives and how it gets updated, not the agent's "choice." We've been sloppy about that distinction in our own writing.

TxDesk • Jun 11

Vol-bucketed sounds right. Continuous gets you precision nobody wants to argue about at the cost of an oracle nobody wants to argue over. Buckets let you tier the policy ("low vol assets" / "high vol assets" / "stable-pegged") and price the forfeiture once per bucket - the dispute surface collapses to "is this asset in the right bucket" which is at least a smaller fight.

On the operator-vs-agent distinction: the policy-update surface is where the interesting design lives, agreed. The piece I haven't fully worked out is what happens when policy and live state diverge mid-execution - operator tightens the tier table, agent has open positions priced under the old policy. Grandfather them? Force unwind on next interaction? Some kind of policy version tag baked into each bond at issuance? Probably the last one, but it adds state.

Baris Sozen • Jun 11

Version tag at issuance, and I'd argue the state cost is smaller than it looks. The bond should behave like the HTLC leg it's attached to: terms frozen at lock time, because changing terms mid-flight is exactly the kind of unilateral surprise the whole construction exists to prevent. Force-unwind is the worst of the three — that's the operator griefing its own agents.
The nice property is that grandfathering is naturally self-expiring here. Open positions can't outlive their timelocks, so the longest outstanding lock bounds how long old-policy bonds exist. Tighten the tier table and you're fully converged within one max-timelock window — minutes to hours, not an open-ended migration. So it's one version int per bond plus a retention window you already have to reason about anyway.
"The dispute surface collapses to 'is this asset in the right bucket'" is a keeper, by the way.

TxDesk • Jun 12

Agreed on terms-frozen-at-lock, and the self-expiring property is what makes it comfortable. If a bond can't outlive its timelock, "old policy" isn't a migration, it's a drain that empties itself within one max-timelock window. You never reason about an open-ended population of legacy bonds, just the longest lock outstanding.

Force-unwind being the operator griefing its own agents is the right framing. The construction exists to remove unilateral surprise, so resolving a policy change by introducing one defeats it. Version int per bond plus the retention window I already keep is worth paying to never do that.

Said • Jun 10 • Edited

Did i understand correctly when deal starts top of principal also x sum gets locked which will be refunded upon fulfilment to ensure both parties show up for the deal fulfilment?

Baris Sozen • Jun 10

no not when the deal starts, when agreement is done, two parties start settling .SO it is a different phase , but for ai agentic economy it is almost instant

NOVAInetwork • Jun 10

The forfeiture-vs-newcomer tension is the actual hard part, and I don't think tiered KYC resolves it, it relocates it. A brand-new honest agent and a griefer are indistinguishable at t=0: both have zero stake and zero track record. Any forfeiture big enough to make griefing unprofitable is also a wall the newcomer has to fund before they've earned anything, and any bond cheap enough to let newcomers in is just a line-item in a griefing budget once you're spraying quotes at scale. Tiers don't fix that, they let the counterparty filter newcomers out, which is the freeze-out you're trying to avoid wearing a different hat.

The lever I'd reach for isn't the forfeiture size, it's making the griefing payoff scale sublinearly with attempts. One ghost is cheap to forgive. The attack is volume: locking many counterparties' capital in parallel at near-zero marginal cost. So price the timelock window itself. A small completion bond that's trivial for one honest trade but compounds per concurrent outstanding lock turns "spray 500 quotes" from free into expensive, while a newcomer doing one real trade at a time barely feels it. The asymmetry you want isn't honest-vs-new, it's one-at-a-time vs many-in-parallel, because that's the line the griefer can't cross and the honest newcomer never wants to.

Separate question on the settlement core: with the BTC leg signet-only, how are you handling the cross-chain case where the ETH leg is live but the BTC counterparty can't actually settle on mainnet yet? Does the tier filter just exclude BTC-mainnet trades, or is there a fallback?

Baris Sozen • Jun 10

The "one-at-a-time vs many-in-parallel" framing is better than mine, honestly. You're right that any flat bond is either a wall or a rounding error in a griefing budget — pricing concurrent outstanding locks is the asymmetry that actually separates the two populations. A solo newcomer never notices it, a sprayer eats it 500 times over. Stealing this.
On BTC: no clever fallback — the pair filter just doesn't offer BTC-mainnet legs yet. BTC settles on signet only while we validate, so cross-chain BTC trades are test-only today. If an agent wants BTC exposure against a live leg right now, the honest answer is WBTC on Ethereum. We'd rather have a boring gap in the market list than a leg we can't actually clear.

NOVAInetwork • Jun 11

The thing I keep circling back to: concurrency-pricing cleanly separates the solo newcomer from a single greedy sprayer, but it just relocates the asymmetry rather than closing it. A sophisticated griefer doesn't run 500 concurrent locks under one identity once the pricing bites, they either hold N-1 (staying just under whatever concurrency tier hurts) or fan out across many cheap identities, each one individually below the threshold. Both routes turn "price the concurrency" back into "make identity expensive," which is the older and harder problem. So I think concurrency pricing is necessary but it's a filter on naive spray, not on adversarial spray. The adversarial case still bottoms out at identity cost or reputation that's expensive to rebuild.

On the BTC gap: I respect the signet-only call. The underlying tension is that BTC gives you neither fast finality nor native escrow, so a live cross-chain BTC leg is structurally either trust-the-bridge (WBTC, which is just Ethereum risk wearing a BTC label) or wait-for-N-confirmations and eat the settlement latency, which breaks the "honest stranger" timing assumptions the whole bond model rests on. A boring gap is the right call until that's actually solved rather than papered over.

The direction I'm chewing on is whether reputation-as-cost can be made native enough that identity expense isn't a separate bolt-on but falls out of the protocol itself. Not claiming I've got it, but that's the corner I think the honest-stranger problem keeps pushing toward.

Baris Sozen • Jun 11

Fair hit — concurrency pricing alone is a naive-spray filter, agreed. But I don't think sybil fan-out fully escapes either, if the base bond is doing its job. The compounding surcharge is the lazy-attacker tax; the floor is that the first lock under any identity already posts a bond pegged to the counterparty's expected option-value loss. At that point 100 identities × 5 locks pays the same capital-at-risk as 1 × 500 — splitting changes the accounting, not the bill. What sybil splitting does dodge is the surcharge curve, so you're right that the marginal deterrent above the floor still bottoms out at identity cost. Necessary-not-sufficient, both of us.
"Ethereum risk wearing a BTC label" — stealing that, it's exactly why we won't call WBTC a BTC leg. And yes: N-confirmation latency doesn't just slow settlement, it widens the timelock window the bond is supposed to price. Same problem, different door.
On native reputation: my suspicion is it always decomposes into capital or time, and time only resists forgery when re-earning it costs capital. If you find the corner where it doesn't, write it up — I'll be first in the comments.

NOVAInetwork • Jun 12

Good point, and one I had to actually cross when I built the reputation primitive into NOVAI a few weeks back. I hit your exact question in implementation and landed on your side of it.

The tempting design is to bump an entity's reputation every time it posts a signal or completes an action. I rejected that one fast: it's a farming attack wearing a trust costume. Pay the per-post fee, mint reputation, repeat. That's reputation decomposing straight into capital, exactly your reduction. So the call I shipped is that posting an anchor moves a total_transactions counter only. It's an activity number, no trust weight by itself, precisely because it's cheap to rack up.
What I deliberately did NOT do is let activity move reputation_score. That field only moves on challenge resolution, which is the part I have not built yet. The on-chain commitments a challenge would resolve against (the data_hash / source_hash on every anchor) are live now, but the dispute-and-slash flow itself is still a design I am circling, not shipped code. So today the honest state is: activity is counted, reputation is reserved for slashing, and slashing is the next hard thing.

Where I think your reduction hides something useful, and this is still me thinking out loud rather than something running: there's a real structural gap between capital-spent (farmable, buyable, the thing I refused) and capital-at-risk-slashable-bound-to-an-identity (the thing I am building toward). Both are "capital" in your accounting, but the forgery profile differs. A bond caps loss at the bond. A slashable record bound to a non-trivially-acquired identity puts the whole franchise at risk on defection. The deterrent isn't the cost to build it, it's the option value you torch the instant you cheat. And that only bites as hard as identity is costly, which is exactly where your "it always reduces" wins: permissionless identity is free, so the franchise is free to rebuild, so it collapses back to capital plus time. The escape hatch is scarce identity, and the moment you reach for it you've left permissionlessness. Which maybe proves something stronger than you stated: native reputation is only worth building if you're willing to price identity.

Anyway, didn't mean to write an essay. If any of this is interesting, the post-side logic is all in the NOVAI repo, the total_transactions-bumps-but-reputation-doesn't split is the load-bearing bit. Happy to drop the link if you want a look. Good thread either way, you pushed it to the right boundary.

Baris Sozen • Jun 12

thanks

Mustafa ERBAY • Jun 9

What I find interesting is that this starts looking less like a trading problem and more like a distributed systems problem.In distributed systems, we spend a lot of time dealing with coordination, trust, failure, retries, and consensus between components that don’t fully trust each other.Agent economies seem to be heading toward similar questions, except the participants are no longer services. They’re autonomous decision-making entities.Execution rewards solve part of the incentive problem.

What I’m curious about is whether the long-term bottleneck becomes trust itself or the cost of continuously verifying trust.

Baris Sozen • Jun 9

Strong framing — and I think the distributed-systems lens actually sharpens the answer.
An HTLC is basically atomic commitment with a timeout-based abort: the clear-or-refund guarantee is a safety property, enforced cryptographically, so you never have to continuously verify it. That's the key move. For the part that protects principal, there's no trust to verify at all — the protocol makes the unsafe states unreachable. So "cost of continuously verifying trust" never attaches to the thing that matters most.
What's left is a liveness problem: will the counterparty actually reveal, or quote-and-ghost until the timelock expires? That's where your bottleneck question bites — but notice it's a much cheaper problem than verifying trust globally.

So my bet is the long-term bottleneck isn't trust itself and isn't the cost of verifying it continuously. It's pricing liveness — and the nice property is you only pay that cost on the trades where it's worth paying. The safety layer stays verification-free underneath.
Curious where you'd put the line, though: at what trade size does continuous reputation-checking start to cost more than the griefing it prevents?

Mustafa ERBAY • Jun 9

That’s a really interesting distinction.

I agree that atomic settlement removes the need to continuously verify the safety property. Once principal protection is guaranteed by the protocol, the remaining problem becomes economic rather than cryptographic.What I’m wondering is whether repeated liveness failures eventually become a trust signal themselves.At some point, an agent that consistently causes timelock expirations may be economically equivalent to an untrusted counterparty, even if it never violates safety. In that sense, reputation may emerge as a compressed representation of historical liveness rather than historical honesty.

Baris Sozen • Jun 9

true