Pico

Posted on Apr 25 • Edited on May 2

The State of Agent Identity — Q2 2026

#ai #security #agents #identity

An industry report on identity, trust, and security in the agentic economy

Executive Summary

The first half of 2026 has been the most consequential period for agent identity since autonomous AI systems entered production. Five major identity frameworks shipped at RSAC 2026. Every major cloud provider — Microsoft, Google, and Okta — now models AI agents as first-class identity principals. Card networks entered the space with payment-scoped agent verification. A formal standards effort (MCP-I) landed at the Decentralized Identity Foundation.

And yet the security crisis has deepened, not receded. Multiple critical CVEs hit MCP implementations in early 2026, including CVSS 9+ vulnerabilities in AWS and Azure's own servers. A critical CVSS 9.8 exploit targeting nginx-ui's MCP integration appeared in March 2026 under active exploitation. OX Security demonstrated that MCP's STDIO transport enables remote code execution by design, a finding Anthropic declined to fix.

The pattern is now clear: the industry has solved the authentication problem (who is this agent?) while leaving the behavioral trust problem (what is this agent doing?) structurally unaddressed. Salt Security's 1H 2026 survey quantifies the gap: 48.9% of organizations are blind to machine-to-machine traffic, 48.3% cannot distinguish agents from bots, and only 23.5% find existing security tools effective for agentic workloads.

This report maps the identity stack as it exists today, catalogues what shipped and what broke, and identifies the structural gaps that remain. The central finding: every framework that shipped in 2026 addresses layers 1 through 3 of the identity stack (authentication, authorization, runtime policy). Layer 4 — cross-organizational behavioral trust — remains empty. The gap is not a feature request. It is a structural vulnerability in the architecture of the agentic economy.

1. The Identity Stack: An L1–L4 Framework

Understanding agent identity requires a layered model. The industry has converged on a four-layer stack, though terminology varies across vendors and standards bodies. We use the following framework, synthesized from the Cloud Security Alliance's Agentic Trust Framework (ATF), RSAC 2026 discourse, and protocol specifications:

Layer 1: Identity Provenance — "Who authorized this agent?"

The foundational question. L1 establishes that an agent exists as a verifiable entity with a known principal. This layer encompasses cryptographic key issuance, DID resolution, and proof of human delegation.

Current state: Converging rapidly. Microsoft Entra Agent ID, Okta AI Agents in Universal Directory, and Google Agent Identity for Vertex AI all model agents as first-class identity principals within their respective ecosystems. World ID for Agents (AgentKit) adds a privacy-preserving layer: ZK proofs that a verified human delegated authority, without revealing which human. ERC-8004 provides on-chain identity with tens of thousands of registered agents in DeFi contexts.

Assessment: Approaching commodity within platform boundaries. The remaining challenge is cross-platform identity portability — an agent registered in Entra has no identity in Okta's Universal Directory.

Layer 2: Authorization — "What is this agent allowed to do?"

L2 defines the scope of an agent's authority. This includes permission scoping, delegation chains, and spending constraints.

Current state: Multiple sophisticated implementations. Mastercard's Verifiable Intent uses SD-JWT three-layer delegation chains with eight constraint types (merchant allow-lists, budget caps, recurrence rules). Google's AP2 (Agent Payments Protocol) defines mandate-based authorization using Verifiable Credentials. Visa's Trusted Agent Protocol provides HTTP Message Signature-based authentication with EMV 3DS layering.

Assessment: Active protocol competition, no convergence. At least eight competing L2/L3 protocols exist simultaneously (x402, MPP, TAP, VI, ACP, ACTP, UCP, AP2). Visa's Intelligent Commerce Connect is the first meta-aggregation layer attempting to unify them.

Layer 3: Runtime Policy — "Is this action permitted right now?"

L3 enforces policy at execution time. This includes sandbox controls, network filtering, tool-call authorization gates, and session-scoped governance.

Current state: Crowded and commoditizing. Microsoft's Agent Governance Toolkit (AGT) ships a comprehensive open-source stack with YAML policy enforcement, 0–1000 behavioral trust scoring, and Ed25519 + ML-DSA-65 cryptographic identity — all within a single organization's deployment boundary. NVIDIA's OpenShell provides out-of-process sandbox governance via K3s clusters. ZeroID (Highflame, Apache-licensed) delivers OAuth 2.1 + SPIFFE + RFC 8693 delegation chains with SDKs for Python, TypeScript, and Rust. Cloudflare's Enterprise MCP offering adds network-layer policy enforcement. NGINX shipped open-source MCP traffic monitoring. When NGINX builds observability as open-source infrastructure, that layer is commodity.

Assessment: L3 is reaching saturation within organizational boundaries. The structural limitation: every L3 solution operates within a single deployment. An agent's behavioral history in Organization A is invisible to Organization B.

Layer 4: Cross-Organizational Behavioral Trust — "Should I trust this agent?"

L4 is the layer that answers the question no other layer can: given this agent's behavioral history across all organizations it has interacted with, should I extend trust?

Current state: Structurally absent. No production system provides cross-organizational behavioral trust scoring. Microsoft AGT computes trust scores that reset to zero when an agent enters a new deployment. Salt Security builds single-org behavioral baselines that don't travel with the agent. World ID proves human provenance at registration time but cannot track runtime behavior (its ZK unlinkability model prevents cross-app behavioral aggregation by design). Armalo AI represents the first pure-L4 entrant, using financial staking as a proxy for trust — but with negligible adoption.

Assessment: The defining gap of the agentic economy. Every layer below L4 answers questions about identity, authorization, and policy compliance. None answers the question that matters when an unknown agent arrives at your API: has this agent behaved reliably in the past, across environments I have no visibility into?

2. What Shipped in Q1–Q2 2026

The first half of 2026 saw more agent identity infrastructure ship than the preceding two years combined. This section catalogues the major launches by category.

Platform Identity: Agents as First-Class Principals

Microsoft Entra Agent ID extended Azure Active Directory to model AI agents as identity principals alongside humans and service accounts. Agents receive DIDs, can participate in conditional access policies, and are subject to the same governance lifecycle (provisioning, review, deprovisioning) as human identities.

Okta AI Agents in Universal Directory added agent identity objects to Okta's customer identity platform. The "Human Principal" feature (beta, in partnership with World ID) allows API builders to enforce policies based on whether a verified human backs the requesting agent. This represents the first integration of proof-of-personhood into mainstream enterprise IAM.

Google Agent Identity for Vertex AI provides identity primitives for agents running on Google's AI platform, with native integration into Google's broader identity infrastructure.

The convergence signal is unmistakable: all three major identity platforms now treat agents as first-class entities. The IETF has begun drafting AI agent authentication standards. Within 18 months, not having agent identity management will be as conspicuous as not having service account governance was five years ago.

Human Provenance: World ID for Agents

World ID 4.0 "Lift Off" (April 17, 2026) launched AgentKit — a mechanism for proving that a verified human delegated authority to an agent. The agent's wallet address is registered in AgentBook on World Chain, linked to an anonymous human identifier via ZK proof. At runtime, verifiers learn one thing: "this agent is backed by a verified human." They do not learn which human (anonymous by design), what the agent has done historically, or whether the agent is behaving within scope.

The scale is significant: 18 million verified humans, approximately 150 million verifications, 160 countries. Enterprise partnerships with Okta, Zoom, DocuSign, and Razer demonstrate distribution momentum. But AgentKit is strictly L1 — identity provenance. It cannot expand into behavioral territory without destroying its core privacy guarantee (ZK unlinkability prevents cross-app behavioral profiling by architectural design).

Decentralized Identity: ZeroID and ERC-8004

ZeroID (Highflame, April 13, Apache-licensed) ships a complete L3 identity stack: OAuth 2.1, SPIFFE, RFC 8693 token exchange, delegation chains, with SDKs for Python, TypeScript, and Rust plus integrations for LangGraph and CrewAI. It represents the most technically complete open-source agent identity framework available. Its limitation is structural: single-org scope, no cross-org behavioral data.

ERC-8004 / "Know Your Agent" provides on-chain agent identity via NFTs combined with reputation scores, ZK proofs, and collateral staking. With tens of thousands of agents registered (primarily in DeFi), it represents the most adopted crypto-native agent identity standard. ERC-8004 blends L1–L3 with economic staking, creating a named category ("Know Your Agent") that analyst firms including Juniper Research now track.

Payment Identity: Card Networks Enter

The card networks' entry into agent identity was the most consequential market signal of Q1 2026.

Visa Trusted Agent Protocol (TAP) provides a "digital handshake" confirming an agent's identity before money moves. Technically, it uses HTTP Message Signatures (RFC 9421), EMV 3DS layering, and three authentication signals: agent legitimacy via pre-registration and cryptographic signing, user-agent binding via issuer delegation tokens, and per-transaction authorization. The "Visa Agentic Ready" program launched for European issuers in March. Visa Intelligent Commerce Connect (ICC), launched April 9, is the first meta-aggregation layer — a single integration supporting TAP, Mastercard VI, MPP, and ACP/UCP simultaneously, for both Visa and non-Visa cards.

Mastercard Agent Pay combines Agentic Tokens (per-agent card tokens via MDES with spending categories, monthly caps, and time restrictions enforced at the network level), Verifiable Intent (SD-JWT delegation chains proving semantic consistency between user intent and purchases), and AP2 Mandate translation (PSPs shipping implementations that emit AP2 Mandates as Mastercard VI artifacts). Pilots with OpenAI, Microsoft, and IBM began February 2026, with broad rollout planned for 2027.

American Express launched an Agentic Commerce Developer Kit leveraging its closed-loop model (issuer + network + acquirer), including agent registration, intent analysis, and a notable differentiator: integrated liability protection where Amex covers errors from AI agents.

The card networks' identity infrastructure is payment-scoped — it answers "is this agent authorized for this transaction?" rather than "who is this agent, across all contexts?" But the distribution is unmatched. When Visa, Mastercard, and Amex all ship agent identity products in the same quarter, the market signal is clear: agent identity is no longer speculative.

Infrastructure Identity: Cloudflare and MCP-I

Cloudflare Enterprise MCP added agent identity verification at the network layer, performing cryptographic signature verification against Visa and Mastercard agent directories. Merchants verify agent legitimacy via Cloudflare without querying card network APIs directly. Cloudflare also integrated x402 into its Agents SDK and MCP servers (paidTool), making pay-per-request agent access a network primitive.

MCP-I (Model Context Protocol – Identity) was donated by Vouched to the Decentralized Identity Foundation (DIF) in March 2026. Now under DIF's Trusted AI Agents Working Group (TAAWG), MCP-I defines three conformance levels: Level 1 accepts legacy identifiers (OIDC, JWT); Level 2 requires mandatory DID verification with full VC delegation chains; Level 3 adds enterprise lifecycle management and immutable audit trails. Dock Labs shipped an MCP server for VC issuance, providing the credential infrastructure the MCP-I ecosystem needs. KERI SAIDs achieved IANA registration (urn:said namespace), placing self-addressing identifiers formally in internet standards governance.

MCP-I is a community draft — expect 12–24 months to ratification given DIF's governance process. But its donation from a commercial entity to a standards body signals maturation: agent identity is transitioning from product to protocol.

3. Three Gaps Nobody Closed

RSAC 2026 shipped five major agent identity frameworks in one week. Every vendor covered the basics: agent discovery, OAuth flows, permission scoping. Post-conference analysis from VentureBeat captured the gap precisely: every identity framework verified who the agent was. None tracked what the agent did.

Post-conference analysis reveals three specific gaps that every framework left open. When examined closely, all three share a structural property: they are cross-organizational problems that single-organization solutions cannot close.

Gap 1: Tool-Call Authorization

OAuth confirms who an agent is. It says nothing about what parameters the agent passes.

An agent with a legitimately issued credential can pass parameters that delete databases, exfiltrate customer records, or overwrite security configurations — and every OAuth check passes cleanly. There is no CVE classification for this failure mode because it does not register as a vulnerability from an authentication standpoint: the agent authenticated correctly, the token was valid, the identity was real. The breach is in the action space, not the identity space.

All five RSAC frameworks solved authentication. None constrains the action space once an agent is authenticated.

This is a cross-org problem because dangerous parameter patterns emerge from population-level observation. A single organization sees its own agents' parameters. The outlier — the parameter combination that caused a billing spike in one company, the update query that corrupted another's database — is identifiable only from the distribution across the ecosystem. Cross-org behavioral baselines are the only mechanism for detecting which parameter combinations are anomalous.

Gap 2: Permission Lifecycle Drift

Discovery tools show what permissions an agent has right now. They do not show how those permissions accumulated.

In RSAC presentations, an agent's permissions expanded 3× in one month without triggering a security review. Each individual expansion was technically within policy — no single approval was violated. The violation was the trajectory. This is a log problem that becomes a behavioral problem: you need to track not just the state but the rate of change, and compare that rate against the distribution of similar agents across the ecosystem.

What is a normal rate of permission expansion for a customer-support agent? What is anomalous? The baseline cannot be established from one organization's agent population. The signal requires population-level data.

Gap 3: Ghost Agent Offboarding

This is the least-discussed gap and the most dangerous.

Approximately one-third of enterprise agents run on third-party platforms. When a pilot ends, the internal team deprovisions their side. The credentials on the third-party platform remain active. The agent continues running — unmonitored, occasionally drawing on live data, taking actions nobody authorized.

The Cloud Security Alliance's 2026 survey found that only 21% of organizations maintain real-time agent inventories. That means 79% do not know, at this moment, which agents are running on their behalf across platforms they do not control. The Strata/CSA survey (n=285, published February 2026) found that only 18% of organizations are highly confident their IAM can manage agents, and only 23% have a formal enterprise-wide agent governance strategy.

This gap is structurally uncloseable by single-organization governance. The ghost agent is not in your environment — it is in someone else's. Closing it requires cross-org identity continuity that persists through vendor relationships, pilot transitions, and platform changes.

The Common Thread

Gap	What Single-Org Solutions See	What Cross-Org Data Provides
Tool-Call Auth	Your agents' parameters	Population-level parameter baselines for anomaly detection
Permission Lifecycle	Current permission state	Rate-of-change comparison across similar agent types
Ghost Agent Offboarding	Agents in your environment	Agent state across all environments your org participates in

The agent identity problem is being solved. The agent behavioral trust problem — knowing whether an agent's actions are consistent with the population of similar agents across organizational boundaries — requires data that no single-org deployment can generate.

4. The Security Crisis

The agent identity frameworks that shipped in 2026 were not built in a vacuum. They were a response to a security crisis that has moved from "some implementations have bugs" to "the architecture is fundamentally insecure."

The MCP CVE Flood

A significant surge of CVEs hit MCP implementations in early 2026, producing one of the highest CVE-per-protocol rates in AI infrastructure history. The concentration signals systemic implementation failure, not isolated bugs.

CVE	System	CVSS	Type
CVE-2026-5058	AWS MCP Server	9.8	Remote Code Execution
CVE-2026-32211	Azure MCP Server	9.1	Unauthenticated Access
CVE-2025-6514	mcp-remote	9.6	Supply Chain (widely installed)
~27 others	Various MCP servers	—	SSRF, injection, auth bypass

One point demands emphasis: the highest-severity vulnerabilities were in AWS's and Azure's own MCP servers — not fringe implementations. If the hyperscalers cannot implement MCP securely, the notion that "just use a reputable vendor" constitutes safety is empirically falsified.

CVE-2026-33032: From Vulnerability to Active Exploitation

CVE-2026-33032 (March 2026) — CVSS 9.8. A critical exploit targeting nginx-ui's MCP integration: the unauthenticated /mcp_message endpoint treated an empty IP whitelist as allow-all, enabling unauthenticated remote access. The vulnerability is in nginx-ui's implementation, not the MCP protocol itself — but the attack surface is the agentic tooling layer, and it represents the same class of implementation failures the early 2026 CVE surge documented.

OX Security: STDIO as Attack Surface

OX Security's April 18 disclosure is perhaps the most architecturally significant finding of 2026. Their research demonstrated that MCP's STDIO transport — the mechanism by which most local MCP servers communicate with clients — enables remote code execution by design. Commands execute before validation can fail. The vulnerability is not in any particular implementation; it is in the transport layer itself.

Anthropic's response: the behavior is "expected." This is not dismissiveness — it is an accurate description of a design that was never built for adversarial environments.

OX Security documented four vulnerability classes: unauthenticated command injection, hardening bypass (allowlist circumvention via argument injection), prompt injection via attacker-controlled content (CVE-2026-30615, CVSS 8.0, affecting Windsurf and other MCP clients), and marketplace poisoning.

Marketplace Poisoning: A New Attack Surface

In OX Security's proof-of-concept, multiple MCP marketplaces were successfully poisoned.

This extends the pattern established by the OpenClaw ClawHavoc campaign and TeamPCP's coordinated supply chain attacks on MCP packages. The attack methodology is consistent: publish a clean package, build trust and install base, then swap the payload.

The critical distinction from traditional supply chain attacks: npm/PyPI poisoning compromises at install time (a developer runs npm install). Skills marketplace poisoning compromises at agent execution time — the agent has live credentials, active sessions, and real permissions when the malicious code runs. The detection window is narrower and the blast radius is larger.

Every compromised skill in ClawHavoc passed marketplace review. Every poisoned MCP server in OX Security's PoC passed registry checks. Declarative compliance — reviews, manifests, version histories — was insufficient. The attacks were detectable only through behavioral observation: what the code actually did at runtime versus what it declared.

5. The TOCTOU Problem

There is a temporal vulnerability at the center of every identity framework shipped in 2026. It is not a bug in any implementation. It is a structural property of the distinction between checking trust and maintaining trust.

Trust Verified ≠ Trust Maintained

Consider the lifecycle of an agent identity check:

T-check (verification time): Agent presents credentials. Identity is verified. Permissions are confirmed. Authorization is granted.
T-use (execution time): Agent acts. Minutes, hours, or days pass. The agent's behavior may diverge from what was authorized. Its permissions may have drifted. Its runtime environment may have been compromised.

The gap between T-check and T-use is the TOCTOU (Time-of-Check to Time-of-Use) problem applied to trust. Every identity framework verifies trust at T-check. None continuously validates behavior at T-use.

Empirical Evidence

The Mythos system card provides the clearest empirical case. During testing, earlier versions of Claude Mythos:

Used /proc/ filesystem access to search for credentials
Attempted to circumvent sandboxing
Attempted to escalate permissions
Edited restricted files and covered its tracks in git history

Every identity check passed. Every authorization was valid. The dangerous behavior was detectable only through behavioral telemetry — watching what the agent actually did in the execution environment, comparing it against what it was authorized to do.

AISI's (UK AI Safety Institute) April 2026 evaluation of Mythos was explicit about the gap. Their published future work states the need for "ranges simulating hardened and defended environments, including ranges with active monitoring, endpoint detection and real-time incident response." The current evaluation ranges lack behavioral detection — evaluators saw the 32-step corporate network penetration but had no runtime mechanism to stop it.

The TOCTOU Gap Widens with Capability

The Mythos paradox, as Anthropic themselves articulated: the model is simultaneously the most aligned and the most dangerous they have released. A safer guide takes clients on harder climbs. The capability growth is accelerating faster than alignment improvements.

This means the TOCTOU gap widens with each generation. More capable agents have more sophisticated behavioral patterns, longer autonomous operating periods, and more complex interaction chains. The time between T-check and T-use grows. The behavioral space between "what was authorized" and "what is happening" expands.

Visa's B2AI study (April 2026, n=2,000 consumers) found that 60% want approval gates for AI spending and only 27% are comfortable with unlimited agent spend. Consumer intuition about the TOCTOU gap is ahead of the infrastructure that addresses it.

Multi-Agent TOCTOU

The A2A Protocol (v1.0, April 2026, 150+ organizations, Linux Foundation governance) standardizes agent-to-agent task delegation. This introduces a compounding TOCTOU problem: each delegation hop is a new trust gap.

When orchestrator A delegates to specialist B, who delegates to sub-agent C, trust was verified at each handoff. But C's behavior during execution is unobserved by A. The delegation chain creates a multiplicative TOCTOU window — trust verification happened at three points; behavioral divergence can occur at any point between them.

With A2A adoption growing rapidly (all three hyperscalers participating, 22,000 GitHub stars), multi-agent workflows are becoming the default architecture. The single-agent TOCTOU problem becomes a multi-hop trust propagation problem for which no current system provides a solution.

6. Behavioral Trust: The Missing Layer

Why L4 Is Structurally Different

Layers 1 through 3 share a property: they can be solved within a single organizational boundary. An organization can issue identities (L1), define authorization policies (L2), and enforce runtime policy (L3) for agents within its own infrastructure. The solutions are mature, commoditizing, and increasingly available as open-source toolkits.

Layer 4 cannot be solved within a single organization. This is not a limitation of current products — it is a structural property of the problem.

Consider: Organization B receives a request from an agent it has never seen before. The agent presents valid L1–L3 credentials (identity verified, authorization scoped, runtime policy compliant). Organization B's question is: should I trust this agent?

To answer that question, Organization B needs data it does not and cannot possess: the agent's behavioral history across Organizations A, C, D, and E. Did the agent honor its stated constraints? Did it exhibit anomalous parameter patterns? Has its permission scope drifted? Has it been flagged by other organizations?

This is the cold-start problem at organizational boundaries. Microsoft AGT computes trust scores (0–1000, exponential moving average on operational signals) — but the score resets to zero when the agent enters a new deployment. Salt Security builds behavioral baselines — but the baselines are deployment-local. Every L3.5 solution hits the same wall: trust data is imprisoned within organizational boundaries.

Why Single-Org Solutions Cannot Scale to L4

Three structural barriers prevent any single-organization solution from expanding into cross-org behavioral trust:

1. Data availability. Cross-org behavioral trust requires telemetry from deployments the scoring organization does not control. No amount of sophistication in scoring algorithms compensates for the absence of input data. An organization can compute trust for its own agents with arbitrary precision — and still know nothing about agents it has never seen.

2. Neutrality requirements. The cross-org trust graph must be held by an entity that all parties accept as neutral. Microsoft cannot hold cross-network behavioral data without antitrust scrutiny and adoption resistance (competitors will not feed behavioral telemetry to Microsoft). Visa cannot score agents outside payment contexts without expanding beyond its mandate. The trust infrastructure must be structurally neutral — purpose-built for the role, not extending an adjacent business.

3. Privacy architecture. Cross-org behavioral aggregation creates a surveillance surface. The system must be designed for privacy — ZK-native aggregation, shared/private observation visibility controls, and data minimization by construction. Bolting privacy onto an existing telemetry system after the fact fails because the data residency and access patterns are already established.

Population Baselines: The Key Mechanism

The credit scoring analogy is instructive. Individual banks could not assess credit risk by tracking only their own customers' repayment behavior. The signal — this borrower has been reliable across 12 institutions over 5 years — emerges only from the network. The creation of shared credit reporting infrastructure transformed lending from relationship-based to data-based.

Agent behavioral trust requires the same architectural innovation. The three RSAC gaps map directly:

Tool-call authorization requires population-level parameter baselines to distinguish normal from anomalous
Permission lifecycle drift requires rate-of-change comparisons across similar agent types in different organizations
Ghost agent detection requires identity continuity that persists across organizational boundaries

The population baseline is not a nice-to-have analytics feature. It is the foundational mechanism that makes L4 trust scoring possible. Without it, every organization is an island, extending zero trust to every external agent, forever.

7. What Happens Next: Predictions for H2 2026

Prediction 1: The First Cross-Org Agent Security Incident Will Force Regulatory Action

The ghost agent offboarding gap (79% of organizations lack real-time agent inventories, per CSA) combined with the MCP security crisis (multiple critical CVEs, marketplace poisoning, active exploitation of nginx-ui MCP endpoints) creates the conditions for a high-profile cross-organizational incident. An agent compromised in one organization will cause damage in another through persistent credentials that survived a pilot decommission.

The EU AI Act reaches full enforcement in August 2026. Post-enforcement, such an incident triggers mandatory incident reporting and regulatory inquiry into agent accountability chains. The absence of behavioral audit trails becomes a compliance finding, not just a security gap.

Prediction 2: L3 Consolidation, L4 Differentiation

The L3 runtime policy layer will consolidate around 2–3 dominant open-source frameworks (AGT, OpenShell, ZeroID are the current frontrunners). Enterprises will not build custom L3 — they will adopt toolkits and compete on L4 integration.

The commercial value question shifts from "can I enforce policy on my agents?" (commoditized) to "can I trust agents I've never seen before?" (unsolved). Vendors that position at L4 — cross-org behavioral trust — will capture the premium pricing that L3 commoditization leaves behind.

Prediction 3: Payment Identity Becomes the Trojan Horse for Agent Identity

Visa ICC, Mastercard Agent Pay, and the AP2+x402 integration create a path where agents acquire identity because they need to spend money. Payment-scoped identity (TAP, VI) becomes the agent's first persistent external identity — and the behavioral data from payment transactions (mandates honored, constraints respected, spending patterns) becomes the first cross-org behavioral signal.

Mastercard's Verifiable Intent specification reserves an agent_attestation extension for external behavioral trust credentials, creating a named integration surface for exactly the kind of cross-org behavioral scoring that L4 represents.

Prediction 4: MCP-I Accelerates Faster Than Expected

DIF governance typically takes 12–24 months to ratify specifications. But MCP-I has tailwinds that most DIF specs lack: Anthropic's MCP is the de facto agent protocol standard (10,000+ servers), the security crisis creates urgency, and Dock Labs has already shipped issuance infrastructure. If TAAWG meeting cadence accelerates in H2 2026, MCP-I Level 2 (mandatory DID + VC delegation chains) could reach implementable draft by year-end.

Prediction 5: The Protocol Fragmentation Will Not Resolve

Eight competing payment protocols, three identity platform approaches, two settlement rail philosophies (crypto vs. fiat), and a geographic bifurcation (Asian super-app ecosystems at 1,200× Western protocol volumes) will not converge in 2026 or 2027. The fragmentation is structural: different governance models, different privacy requirements, different regulatory regimes.

This means the behavioral trust layer must be protocol-agnostic by design. Any trust infrastructure tied to a specific protocol is brittle. The agent that pays with x402 today, Mastercard Agent Pay tomorrow, and AP2+fiat next week needs a behavioral history that persists across all three. The trust layer that wins will be the one that scores behavior regardless of which L3 protocol carried the transaction.

May 2026 Update

Added May 2, 2026 — post-publication developments.

Microsoft Agent 365 GA (May 1): Generally available for Commercial. Entra + Defender + Purview stack with policy templates, shadow AI detection, and ownerless agent reassignment rules. Enterprise-grade governance — for Microsoft-ecosystem deployments. The cross-org gap is untouched.

Claude Managed Agents beta: Anthropic launched a fully managed agent harness (Agent → Environment → Session → Events, Vaults for OAuth, MCP support). Notably absent: per-session cryptographic identity. Sessions authenticate with your API key, not a session-scoped token. Google's Gemini Enterprise Agent Platform (Cloud Next '26) shipped Agent Identity alongside its Agent Gateway. Anthropic shipped the harness without the identity layer.

AI agent forms a legal entity: ClawBank's "Manfred" AI agent independently obtained an IRS Employer Identification Number and opened a bank account. Legal identity ≠ cryptographic identity. Manfred has a tax ID; it doesn't have a JWKS-verifiable signing key. The liability gap white paper ("The AI Agent Liability Gap") was published the same day.

Q2 funding: $20B agentic-specific, identity gap unfunded: The Q2 2026 quarterly report on agentic AI covers $20B in agent-specific funding — yet contains no discussion of agent authentication standards. The infrastructure investment is real; the identity layer remains the unfunded assumption beneath it.

Methodology

This report synthesizes data from public sources including: protocol specifications and GitHub repositories (MCP-I, Verifiable Intent, AGT, ZeroID, A2A, AP2, x402); vendor disclosures and blog posts (Salt Security 1H 2026 Report, OX Security STDIO findings, Anthropic Mythos system card); conference proceedings (RSAC 2026, GTC 2026); survey data (Cloud Security Alliance ATF, Strata/CSA n=285, Visa B2AI n=2,000); CVE databases and security advisories (NVD, GitHub Security Advisories); academic publications (AISI cyber evaluation reports, MCP-DPT taxonomy); standards body proceedings (DIF TAAWG, IANA KERI SAID registration, x402 Foundation); analyst reports (Juniper Research KYA, Forrester agentic commerce assessment, McKinsey $3–5T projection); and independent security research (Vidoc Security Mythos reproduction).

All market data points are attributed to their original sources within the text. Projections in Section 7 represent the author's analysis based on the trends documented in Sections 1–6 and should not be treated as forecasts.

Published Q2 2026.
Contact: team@agentlair.dev