Kopano Labs

Posted on May 24

Google I/O 2026 for builders: Managed Agents are the product, not Gemini 3.5 Flash

#devchallenge #googleiochallenge #ai #gemini

Google I/O Writing Challenge Submission

This is a submission for the

Transparency: I used AI tools to help research announcements and tighten prose.
The opinions and framing below are mine.

If you skimmed the

Google I/O 2026 keynote, you probably walked away thinking about Gemini 3.5 Flash and Antigravity 2.0. Fair — Google led with speed and a slick agent-first IDE.

After reading the

developer highlights post and the Interactions API docs, I think the more durable story for application developers is different:

Google is shipping an agent runtime you can call over HTTP — not just a faster chat model.

That shift matters whether you use Antigravity, Cursor, or your own stack.

Here is the design principle I keep coming back to:

Realism accommodates aesthetics. The visual layer, the developer experience, and the agent story should not preach polish while hiding operational truth. Good aesthetics do teamwork: they make the system easier to inspect, easier to trust, and harder to fak e.

The headline model is table stakes

Gemini 3.5 Flash is impressive on paper: Google claims it beats Gemini 3.1 Pro on most benchmarks while running much faster — the kind of spec sheet that matters when agents loop tools dozens of times per task.

But model launches are now quarterly events across every major lab. Speed and benchmark wins are necessary; they are not a moat.

What is harder to replicate is the harness: orchestration, sandboxed execution, persistent environments, observable steps, and first-party integrations (Workspace, Firebase, Cloud Run, Play Console test track, etc.).

That harness showed up in three places at I/O:

Layer What Google shipped Who it's for
Model Gemini 3.5 Flash Everyone calling LLMs
Harness Antigravity agent runtime + Managed Agents Teams building agentic features
Surfaces Antigravity 2.0 desktop, CLI, SDK, AI Studio (incl. Android/Kotlin) Individual builders & fast prototypes
The interesting engineering bet is row two.

{
"io_2026_thesis": "managed_agent_runtime",
"model_layer": "Gemini 3.5 Flash",
"runtime_layer": [
"planner",
"tool_router",
"sandboxed_execution",
"server_side_state",
"observable_steps"
],
"developer_question": "Who owns the loop after the prompt leaves my app?",
"realism_rule": "Aesthetics must expose system truth, not cover it."
}
Managed Agents: one API call, full agent loop
Managed Agents are the clearest "this is a product" moment in the developer narrative.

Instead of wiring your own:

planner / reasoner
tool router
code execution sandbox
multi-turn state
…you can spin up an agent that reasons, calls tools, and runs code in an isolated Linux environment through the same Antigravity harness Google uses internally — exposed via the Interactions API and AI Studio.

Google's own framing is blunt: "With a single API call." That is the pitch.

Builder / App
Interactions API
Managed Agent
Planner + tool routing
Sandboxed Linux execution
Server-side state
steps timeline
Debuggable UI + audit trail
Why the Interactions API is the real companion piece
Managed Agents sit on the Interactions API, which Google now recommends for new projects (while generateContent remains supported for stable/production paths).

Three design choices stood out to me as someone who has shipped brittle agent demos:

Server-side conversation state

You can chain turns with previous_interaction_id instead of re-uploading full chat history every request. Google stores interactions by default (store=true), with retention depending on tier (docs cite ~1 day free / ~55 days paid).

That is not philosophically new (other vendors store threads too), but it changes cost and latency math for long agent runs — especially combined with implicit caching on continued conversations.

Observable execution steps

Interactions return a structured timeline (steps) — thoughts, tool calls, function results, final output — not just a blob of text. For debugging and for building UIs that show what the agent did, that is a meaningful DX upgrade over parsing ad-hoc JSON.

Honest beta posture

The docs label Interactions as beta with documented breaking changes (e.g. outputs → steps). Google explicitly says: for production workloads that need stability features (Batch API, explicit caching, some video metadata), stay on generateContent for now.

That transparency is refreshing. It also tells you Managed Agents are for explorers and greenfield services, not drop-in replacements for every existing Gemini integration on day one.

Antigravity is distribution, not duplication
It is easy to treat Antigravity 2.0 (desktop app), Antigravity CLI, and the Antigravity SDK as three unrelated SKUs. I read them as one runtime, three go-to-market paths:

Desktop / CLI — capture builders already living in IDEs and terminals (and nudge Gemini CLI users to migrate).
SDK — let you host customized agents on your infra while still using Google's harness.
AI Studio — fastest path from prompt → prototype, now with Kotlin/Android vibe coding, Workspace hooks, Cloud Run deploy, and export full project context to Antigravity.
The loop is deliberate: prototype in AI Studio, harden locally in Antigravity, ship via Cloud/Firebase/Play test track without losing context.

Whether you like that vertical integration depends on how allergic you are to platform bundling — but as a workflow design, it is coherent.

What I think is underrated: "Modern Web Guidance" + Chrome DevTools for agents
The developer keynote recap also tucked in Modern Web Guidance (agent skills for performance, a11y, security aligned with Baseline) and Chrome DevTools for agents (automated audits, emulation, session handoff).

That pair targets a boring, expensive problem: agents that write plausible but sloppy frontend code.

Most agent hype focuses on greenfield apps. Production pain is often "the agent shipped a layout that fails Lighthouse." Shipping vetted skills + DevTools access to the same agent loop might matter more day-to-day than another frontier model for web teams.

This is where realism and aesthetics stop fighting. A beautiful UI that fails performance, accessibility, or mobile proof is not beautiful yet. It is a sketch. The harder bar is creative work that can survive inspection.

Critique: what I would watch before betting production on this
Vendor-shaped agents. Managed Agents are powerful because Google owns the sandbox, tools, and model routing. That is also lock-in. The Antigravity SDK mitigates hosting concerns, but the gravity well is real.

Beta API surface. Interactions schemas and step formats are still moving. Budget refactor time if you build customer-facing features directly on Managed Agents this quarter.

Pricing & quotas. Antigravity's new AI Ultra tier and bonus credits are a launch promotion, not a long-term cost model. Agent loops burn tokens and tool minutes; watch spend on multi-step runs.

"Single API call" marketing vs. operational reality. You still own observability, PII boundaries, prompt injection in tool outputs, and human escalation. Google gives you the runtime — not accountability architecture.

None of that kills the excitement. It just means the mature play is: experiment on Interactions + Managed Agents now; keep stable user flows on generateContent until the beta stabilizes.

Yes
No, exploration orgreenfield
Yes
No
Yes
No
New agentic feature?
Needs production stabilitytoday?
Use generateContent /stable Gemini API path
Spike Interactions API
Need hosted planner +sandbox?
Try Managed Agents
Call Gemini directly withyour own tools
Inspect steps, cost, state,and failure modes
Truth survives telemetry?
Harden into productworkflow
Refactor before shipping
A minimal mental model for your next spike
If you have an afternoon before the challenge deadline (or for a hackathon):

Read the Interactions quickstart.
Try a Managed Agent template in AI Studio (Playground).
Log interaction.steps once — see if the timeline is enough for your UI/debug needs.
Decide whether you need store=false (stateless) or previous_interaction_id (server-side memory).
Only then open Antigravity if you need local iteration.
That order keeps you honest about whether you need Google's harness or just Gemini 3.5 Flash speed.

Bottom line
Google I/O 2026 did not introduce "a faster chatbot." It doubled down on agent infrastructure as a cloud primitive — Managed Agents on a stateful Interactions API, with Antigravity and AI Studio as onboarding ramps.

For developers, the question going into summer 2026 is no longer "which model?" It is "whose runtime owns my tool loop, sandbox, and session history?"

Google just raised their hand loudly on all three.

That is why the strongest agent platforms will not be the ones with the loudest demo reel. They will be the ones where realism accommodates aesthetics: the interface looks good because the underlying system is observable, bounded, and honest about what happened.

Links
I/O 2026 developer highlights
Managed Agents announcement
Interactions API docs
100 announcements roundup
What I/O announcement are you actually building against — Managed Agents, Android vibe coding in AI Studio, or something else? Drop it in the comments; I am curious what survives the hype cycle past June.

Top comments (1)

Some comments may only be visible to logged-in visitors. Sign in to view all comments.