Google I/O '26 dropped today, and for the first time in a while, the enterprise announcements are the ones worth paying attention to. Not because of model benchmarks — though those are interesting — but because Google just shipped an integrated agentic stack that reaches from the model layer all the way down to the individual worker's inbox.
The Problem It's Solving
Enterprise AI has had a deployment problem. Most organizations have access to capable models, but the path from "we have Gemini" to "our teams are actually running less manual work" has involved a lot of custom integration, fragile automations, and agents that can't see across tools. What Google is trying to do with this I/O release is close that gap — ship the plumbing, not just the model.
The announcement covers five distinct products: Gemini 3.5, Gemini Omni, Google Antigravity, Gemini Spark, and a Managed Agents API on Agent Platform. Each one sits at a different layer of the stack.
How It Actually Works
Start with the model. Gemini 3.5 Flash is the new baseline — Google's claim is that it rivals larger flagship models while staying within Flash's speed and cost profile. The numbers they're citing: 76.2% on Terminal-Bench 2.1, 83.6% on MCP Atlas, and 84.2% on CharXiv for multimodal understanding. Gemini 3.5 Pro is in testing and coming next month.
That MCP Atlas benchmark is worth noting specifically. Google scored Gemini 3.5 Flash against a benchmark designed around Model Context Protocol task completion — the same protocol that's become the de facto standard for tool-using agents across the industry. Getting 83.6% there isn't just a number; it's a signal about where Google thinks the evaluation bar for agentic models should be.
Gemini Omni is the video-first model — takes text, audio, image, and video inputs and produces dynamic video output. Think post-production automation, e-commerce virtual try-ons, content localization. It's rolling out in the coming weeks via the Gemini API.
Antigravity 2.0 is where things get more interesting for developers. It's a standalone desktop app and now integrates with Agent Platform, meaning it inherits Google Cloud's data privacy protections by default. There's also an Antigravity CLI for teams that want a lighter-weight interface. The pitch from AirAsia Next's CTO: over half of their production-ready code now comes through Antigravity agentic workflows. That's a real number from a shipping company, not a demo.
Gemini Spark is the personal agent layer. It runs 24/7 in the background, connects to Workspace plus external connectors like Salesforce, Zendesk, ServiceNow, and SharePoint, and can take multi-step actions autonomously — with approval gates for anything high-risk. Every task runs in an ephemeral VM, credentials never touch the agent directly, and all traffic routes through an Agent Gateway that enforces DLP policies. The isolation story is more specific than most personal agent announcements tend to be.
The Managed Agents API lets developers spin up custom agents via a single API call, running in Google-hosted environments. No infrastructure to manage; governance and security inherit from Agent Platform automatically.
And there's CodeMender — an AI security agent from Google DeepMind, now integrated into Agent Platform. It finds vulnerabilities, proposes patches, tests them, and can apply fixes across dependent systems with developer approval.
What Developers and Enterprises Are Actually Using This For
The use cases Google is demonstrating are specific enough to be useful as a map.
For IT operations: Spark monitors ServiceNow, detects recurring incidents, creates escalated Jira tickets, drafts incident reports, and pings the right manager for approval before sending anything externally.
For sales: Spark pulls account history from Salesforce, cross-references support tickets from Zendesk, identifies churn signals, and drafts a retention strategy — sitting in draft until the salesperson approves it.
For product launches: Antigravity 2.0 handles simultaneous agent-driven execution across code generation, asset creation, and customer email drafts, all orchestrated from a single workspace.
For security: CodeMender audits codebases, recommends patches, and deploys them with human sign-off. This is particularly relevant for teams carrying compliance obligations where every change needs an audit trail.
Why This Is a Bigger Deal Than It Looks
The piece that matters most here isn't any single product — it's that Google is shipping an end-to-end agentic stack with enterprise data controls built in from the start, not bolted on.
Most enterprise AI deployments today involve stitching together a model API, a separate orchestration layer, custom connector work, and some homegrown governance layer. Google is trying to collapse that into a single platform surface where the governance, security, and agent behavior are codesigned. The Managed Agents API making Agent Platform's data protections automatic is a specific example of what that looks like in practice.
The MCP Atlas benchmark score is also a tell. Scoring Gemini 3.5 Flash against an MCP-specific benchmark is an implicit endorsement of MCP as the standard evaluation surface for agentic capability — significant given how much momentum MCP has built across the industry since Google Cloud Next '26.
Availability and Access
Gemini 3.5 Flash is live today in Gemini Enterprise, Google AI Studio, and Antigravity. Gemini Omni Flash comes in the next few weeks. Gemini Spark in the Gemini Enterprise app is rolling out soon; Workspace preview for business customers follows. Antigravity in Gemini Enterprise arrives in the coming months. Managed Agents API documentation is live at docs.cloud.google.com.
Gemini 3.5 Pro remains in testing, expected next month.
The shift from AI-assisted work to AI-executed work — with humans approving rather than doing — is the actual direction this points. Google's bet at I/O '26 is that enterprises will adopt that model faster if the security and governance story is tight from day one, not something they have to build themselves.
Follow for more coverage on MCP, agentic AI, and AI infrastructure.
Top comments (0)