KABUKI

Posted on Mar 24

Your AI Agent Has Been Lying to Your Website — Google's WebMCP Wants to Fix That

tags: ai, webdev, javascript, googledev

🗒️ TL;DR

AI agents currently interact with websites by simulating UI actions — fragile, slow, and invisible to the site itself
Google released an early preview of WebMCP, a new web standard to fix this
WebMCP lets websites expose structured JavaScript tools that agents call directly — no more fake clicks
It proposes two new APIs: a Declarative API (HTML forms) and an Imperative API (JavaScript)
Human-in-the-loop is a core design principle — fully autonomous agents are explicitly out of scope
WebMCP is not a replacement for MCP — they complement each other (backend vs. frontend layer)
Security considerations (model poisoning, cross-origin isolation, permissions) are already on the radar

🤖 The Problem: AI Agents Are "Faking" Web Interactions

Picture this: you ask your AI agent to book a flight. Behind the scenes, it's doing something like this:

Simulate a click on the search field
Type in the destination character by character
Wait for the DOM to update
Parse the rendered HTML to find the results
Simulate another click on the "Book" button
Hope nothing changed since last time

This approach — often called browser automation — works well enough for demos. But in production? It's a mess.

The website has no idea an agent is interacting with it. There's no handshake, no contract, no structure. The agent is just guessing what to do based on what it can see.

Any UI update can silently break the workflow. Error messages are vague. There's no way for the site to say "hey, here's the right way to do this." And performance is terrible compared to direct API calls.

The web was built for humans. AI agents have been crashing the party uninvited — and faking their way through every interaction.

📢 What Google Just Announced

On February 10, 2026, Google released an early preview of WebMCP — a proposed web standard designed to give AI agents a proper seat at the table.

The core idea:

"WebMCP provides a standard way for web applications to expose structured tools, so AI agents can interact with websites faster, more reliably, and more accurately."

Instead of agents reverse-engineering your UI, your website gets to define exactly how agents should interact with it. Flight booking, support ticket submission, complex data navigation — all of these become structured, predictable, and auditable.

The spec is open. The GitHub repo is public. And it's already generating serious discussion in the web standards community.

⚙️ How WebMCP Works

The key shift WebMCP introduces is simple but powerful: websites publish structured tools, agents call those tools directly.

Here's how it compares to the current state of affairs:

Approach	How the Agent Interacts	Reliability	Speed
Traditional UI automation	Simulates clicks & keystrokes	❌ Breaks on any UI change	🐢 Slow
WebMCP	Calls structured JavaScript tools directly	✅ Stable & predictable	⚡ Fast

WebMCP proposes two new browser APIs to make this work:

Declarative API — handles standard actions defined directly in HTML forms. Think of it as a machine-readable <form> that agents understand natively.
Imperative API — handles more complex, dynamic interactions that require JavaScript execution. For everything that can't be expressed declaratively.

Together, these APIs let you make your web app "agent-ready" — without throwing away your existing codebase. In fact, one of the explicit design goals is to let you reuse existing JavaScript code to create tools. Low migration cost, high payoff.

🎯 Design Goals (and What's Out of Scope)

WebMCP has four stated design goals. They're worth reading carefully, because they tell you a lot about what Google is (and isn't) trying to build.

1. Human-in-the-loop workflows
Users delegate tasks to AI agents while maintaining visibility and control. The agent assists — it doesn't replace the human.

2. Simplified AI agent integration
Agents interact through well-defined JavaScript tools, not brittle UI scraping. Reliable by design.

3. Minimal developer burden
Reuse your existing page JavaScript to create tools. You don't have to rebuild your app from scratch.

4. Improved accessibility
Assistive technologies get a standardized way to access web app functionality — a nice side effect with real-world impact.

🚫 What WebMCP is NOT trying to do

This is just as important. WebMCP explicitly rules out:

Headless browsing without human oversight — no ghost agents running in the background
Fully autonomous agent workflows — for that, Google points to the A2A (Agent-to-Agent) protocol instead
Replacing MCP or backend integrations — WebMCP lives at the frontend layer only
Replacing human-facing interfaces — your UI is still for humans
Agent discoverability — figuring out which tools exist isn't WebMCP's job

🔐 Security considerations already on the table

The spec also flags three security areas that need careful handling:

Model poisoning — malicious tool definitions could corrupt AI model behavior
Cross-origin isolation — tool calls across different origins need strict security boundaries
Permission management — who decides which agents can call which tools, and when?

These aren't solved problems yet. But the fact that they're called out explicitly in the early spec is a good sign.

🔄 WebMCP vs MCP: It's Not a Competition

If you've been working with MCP (Model Context Protocol, originally from Anthropic), you might be wondering: "Wait, don't I already have this?"

Not quite. Here's how the landscape looks:

Era	How Agents Interact with the Web	Website's Role	Reliability
Before MCP	DOM scraping & click simulation	Passive victim	❌ Fragile
After MCP	Direct backend API calls	Backend responds	✅ (backend only)
After WebMCP	Structured tool calls on the UI layer	Frontend + backend both respond	✅ Fast & full-stack

The clean mental model:

MCP = agents talking directly to your backend services
WebMCP = agents interacting with your frontend UI layer, with the user present and in control

They're not competing. They're covering different parts of the stack. A website using WebMCP can be thought of as an MCP server implemented in client-side JavaScript — with the key difference that the user, the app, and the agent all share context in real time.

🏭 A Real-World Engineer's Take

I work in the semiconductor and manufacturing space, where AI agents are increasingly being used for things like equipment parameter tuning, defect analysis, and navigating complex ERP systems.

And the UI automation problem is very real in industrial settings.

Manufacturing control panels, MES systems, and legacy ERP dashboards were never designed with AI agents in mind. Every time the software gets updated and the layout shifts, automated workflows break silently. There are no logs. No audit trail. No way to know what the agent actually did or why it did it.

In regulated industries, that's not just inconvenient — it's a compliance problem.

WebMCP's approach of letting the application define how agents should interact with it resonates deeply in this context. It shifts the relationship from "agent guessing at UI" to "application and agent speaking the same language."

And the Human-in-the-loop principle? In manufacturing, fully autonomous agents controlling physical equipment face enormous psychological and regulatory hurdles. The model of "human supervises, agent assists" isn't just a nice idea — it's the only model that's realistically deployable today.

The question every engineering team should be asking isn't "can our AI agent use our app?" — it's "have we designed our app to work **with* AI agents?"*

That distinction is going to matter a lot in the next few years.

✅ What You Should Do Now

Bottom line: The web is about to get a first-class agent interface layer. Now is the time to get ahead of it.

Here's where to start:

📖 Read the spec — check out the WebMCP GitHub repo and follow the discussion
🗂️ Audit your own apps — which features would you want an agent to use? Map those out now
🔀 Clarify your MCP vs WebMCP strategy — backend integrations vs. UI-layer interactions are different problems
🔐 Update your AI security checklist — model poisoning and permission management aren't theoretical anymore
👥 Design for Human-in-the-loop from day one — don't build agent features that assume no human is watching

If MCP changed how agents talk to your backend, WebMCP is coming for your frontend.

📚 Sources

@it (ITmedia): "Google releases early preview of 'WebMCP' to standardize AI agent web interactions" (2026/03/18)
WebMCP GitHub Repository: https://github.com/webmachinelearning/webmcp