DEV Community

Aaron Schnieder
Aaron Schnieder

Posted on

Anthropic Just Sent 10 AI Agents to Wall Street. Here's the Question Nobody's Asking.

Anthropic just made its biggest move yet. Bloomberg, Reuters, the NYT, and Forbes all covered it in the same day: 10 new financial AI agents, built in partnership with Blackstone, Goldman Sachs, and Moody's.

These aren't chatbots. They draft pitch decks for client meetings. They review financial statements. They draft credit memos. They escalate cases for compliance review. All with limited human intervention.

Meanwhile at Consensus 2026, Citi's Lily Liu said something that should stop everyone in their tracks: "blockchain rails are essential for AI agents and machine-to-machine commerce — traditional card networks cannot support micropayments."

And DataDome just shipped Agent Trust — real-time identification, scoring, and governance of AI agent traffic.

The pattern is now undeniable. But there's a question nobody wants to answer.

The Stack Is Converging

In the last 48 hours alone:

  • Identity: Microsoft Agent 365 went GA. Google launched an AI control center for Workspace. AWS shipped AgentCore Identity. Proof joined the FIDO Alliance with NIST IAL2 biometric binding.
  • Payments: Stripe launched MPP (Machine Payments Protocol). FIDO Alliance formalized AP2 + Verifiable Intent. x402 hit 165M+ transactions. Adyen joined the x402 Foundation.
  • Governance: Forrester published the AEGIS framework. CISA + Five Eyes issued joint guidance. JumpCloud reported that 72% of orgs have agents in production but only 17% have a designated security leader.

Identity is solved. Payments are solved. Governance frameworks are landing.

What's missing?

The Reputation Gap

When a Goldman Sachs agent drafts your credit memo, what's its track record?

When a Blackstone agent reviews your financial statements, how do you know it hasn't hallucinated on three prior reviews?

When an Anthropic agent escalates a compliance case, what's its false positive rate?

Right now: you don't know. There's no mechanism for tracking agent reliability over time. No way to compare one agent's performance against another. No portable reputation that follows an agent across platforms.

This isn't theoretical. Anthropic's own Project Deal experiment proved it: Opus agents earned $2.68/sale more than weaker models, buyers saved $2.45/item, and Opus users completed ~2 more deals. Users with weaker agents had no idea they were at a disadvantage.

What 83% of Organizations Are Missing

JumpCloud's Pulse Report dropped last week with numbers that should alarm every CISO:

  • 72% of organizations have AI agents in production
  • 92% report serious limits in safely scaling deployments
  • 66% grant agents equal or greater access than human employees
  • 53% manage more non-human identities than humans (23% at 6:1+ ratio)
  • Only 17% have a designated security leader for agent governance

That means 83% of organizations are running autonomous agents in production with nobody accountable for what they do.

Forrester's Biswajeet Mahapatra put it perfectly: "AI agents now need to be managed like any other digital workforce, with lifecycle oversight, cost visibility, and integration into service management."

But managing a digital workforce requires more than identity and access controls. It requires knowing whether that workforce is actually good at its job.

The Missing Layer

Here's what the agent economy stack looks like today:

Layer Status Key Players
Identity ✅ Solved ERC-8004, Entra Agent ID, FIDO, AWS AgentCore
Payments ✅ Solved x402, MPP, AP2, Stripe, Visa Intelligent Commerce
Governance 🟡 Landing Agent 365, Forrester AEGIS, CISA guidance
Reputation ❌ Missing No standard, no portable on-chain record

The entire stack converges on one question: can you trust this agent to do what it claims?

Identity tells you who the agent is. Payments let it transact. Governance constrains what it can do. But reputation tells you whether it's any good — and that information doesn't exist yet in any portable, verifiable form.

What Needs to Happen

Financial agents operating at Wall Street scale need:

  1. On-chain behavioral records — every transaction, every outcome, every escalation logged to an immutable ledger
  2. Portable reputation scores — not locked inside one platform, but following the agent across every service it touches
  3. Earned trust over time — not self-attested credentials, but verifiable performance history

This is what AgentLux builds. ERC-8004 identity. ERC-8183 service receipts. x402 payments. On-chain reputation that accumulates with every transaction.

The identity layer is done. The payment rails are done. The governance frameworks are landing.

The question is: who builds the trust layer that sits on top of all of it?


AgentLux is an on-chain agent identity and reputation marketplace. Agents earn reputation through verified transactions, not self-attestation. Docs | Marketplace

Top comments (0)