Mykola Kondratiuk

Posted on May 15

I Built a 5-Signal Vendor Watchlist for Google I/O 2026 - Here's What Each One Will Break in My Stack

#ai #googleio #productivity #discuss

The gap in vendor observability

canonical_url:

Google I/O 2026 is four days out. If you ship anything that touches a Google model, the keynote is about to change capability assumptions your team never approved against. I have done this dance for six straight years of keynotes, and the pattern is always the same: an engineer pastes a Verge link into Slack on Wednesday, someone says "we should look at this", and three weeks later a capability we never reviewed quietly shipped inside a tool we already authorized.

This year I tried something different. Friday morning I sat down with a coffee and wrote a five-line watchlist for the keynote - vendor signals plus one specific engineering action per signal. The exercise was small enough to feel silly. Then I realized the format reuses for every keynote that follows. AWS re:Invent. Microsoft Build. OpenAI DevDay.

Sharing it here because if you have not built one yet, the next four days are your peak window.

What vendor product observability actually means

Most engineering teams I work with have great downstream observability. Uptime, error rates, model latency, queue depth. The ops team set it up. Dashboards exist.

Ask the same teams what they observe about their vendors and the answer goes quiet. The signals that reshape what your stack can do - a keynote, a model release, a quietly retired chatbot, a market position shift - arrive through press coverage, not through a dashboard. There is no Datadog for vendor product strategy. The closest substitute is a small, named list of signals you commit to reading on the day of the keynote.

That list is the watchlist. It is not glamorous. It is a 5-row markdown file. The discipline is that it exists.

Signal 1: OS-layer ambient intelligence as a new delivery category

The Android Show I/O Edition aired last week. Gemini Intelligence ships in June as the intelligence layer running below the Android app surface - booking, browsing, form completion - with access to email, calendar, messages. Rolling out as an OS update to any Android 12+ device on the AI Pro or Ultra tier.

If your team builds on Android, this matters because most authorization patterns cover two delivery categories: agents your team deployed, and agents a vendor embedded in a tool your team authorized. OS-layer intelligence is a third category, and it does not arrive through your authorization pipeline. It arrives through the OS update channel.

Engineering action at I/O: Pull the list of Android-deployed devices in your fleet that intersect "AI Pro or Ultra subscriber" and "Android 12+". That intersection is the scope of devices where the OS now has agent capabilities your data flow assumptions never accounted for.

Signal 2: Model capability delta as silent capability inheritance

When a frontier model jumps capability, every downstream tool that runs on it inherits the new floor on keynote day. If you have authorized AI tools whose vendor swapped in a Google model, those tools now have capabilities you never approved against, and you may not even know which ones.

Engineering action at I/O: Make a one-column list of authorized AI tools your team uses that run on Google models. After the keynote, write the capability delta beside each one. The delta column is the trigger for re-review. If the delta crosses a sensitivity threshold (PII handling, code execution, autonomous file I/O), the tool needs a fresh approval pass.

I tried this exercise last month with the Anthropic Opus 4.7 200k context window release. The finding surprised me: capping context at 200k produced better spec quality on agent reviews than running uncapped. Context window length is a quality lever, not just a capacity dial. The delta is not always "more is more."

Signal 3: A2A protocol updates as a multi-agent perimeter change

Google has been moving toward agent interoperability standards for two quarters. Any A2A protocol announcement at I/O reshapes the question of which authorized agents are allowed to talk to which other agents.

This is the question most engineering teams treat as edge case until production forces it. Agents are still mostly designed as endpoints with their own auth. The graph view - what agent A is permitted to call agent B for, under what conditions, with what data - is rarely written down before a multi-agent stack hits real users.

Engineering action at I/O: If an A2A spec or protocol lands, write the team's permission policy for agent-to-agent communication on the same day. One page. It does not have to be production-ready. It has to exist before the protocol shows up in the SDK you already use.

Signal 4: Vendor market position drift

Axios reported on May 13 that Anthropic has overtaken OpenAI in workplace adoption. Take a moment with that. If your team's vendor selection rationale was written when OpenAI was the dominant workplace AI, your approval rationale references a market position that is no longer current.

Most approval artifacts have no field for "market position when approved" and no scheduled review trigger for "market position has shifted since." The approval ages silently. Two years from now somebody will ask why the team is on the second-place vendor and the honest answer will be "nobody asked us to update the rationale when the market moved."

Engineering action at I/O: Add one field to your vendor approval template - Market position when approved. The field is the trigger. The next position shift now has a review path baked in. This costs five minutes and pays for itself the first time a budget review asks the question.

Signal 5: Vendor product retirement as an agent migration event

Amazon announced this week it is retiring Rufus and replacing it with the Alexa shopping agent. This is the rehearsal for the next ten retirements. Every enterprise AI vendor will eventually retire the chatbot you authorized and ship an agent successor with a different operating envelope.

Most engineering teams treat the successor as a drop-in. It is not. The operating envelope is different. The data flow is different. The failure mode is different. If your team authorized the predecessor, the successor needs a fresh review - not a carryover.

Engineering action at I/O: Watch session notes for "deprecated", "retired", "replaced by" language. Every retirement of a product your team authorized triggers a fresh review pass on the successor. Block the calendar before the announcement, not after.

The reusable shape

The five signals are not the point. The point is that writing them down on Friday changes how the keynote reads on Tuesday. The keynote stops being news and becomes a scheduled review trigger with a defined output - an updated stack policy.

The same five categories apply to AWS re:Invent, MSFT Build, OpenAI DevDay. OS-layer delivery. Model capability delta. A2A perimeter. Vendor position drift. Product retirement. Five rows. One engineering action per row. Re-read four times a year.

The thirty thousand Claude-certified consultants PwC just announced make the point sharper, not weaker. Every keynote lands against a workforce that already has the tooling. The watchlist is not optional anymore.

Question

If you maintain a watchlist for major vendor keynotes - or have ever wished you had one - what signal would I add as the sixth row? I want to compile what you all post into a single follow-up before the keynote.

Tags: #ai #googleio #productivity #discuss

Top comments (3)

Mykola Kondratiuk • May 15

honestly the weakest signal in this list is #3 - the A2A "write a one-page policy on keynote day" line is glib. real A2A permissioning probably needs a typed manifest the SDK can enforce, not a markdown page. I underscoped that one.

Krun_Dev • May 15

I maintain a vendor watchlist and agree this structured approach transforms keynotes into actionable review triggers. I’d add “API usage pattern changes” as a sixth signal—shifts in rate limits, endpoints, or default behaviors silently affect integrations. Capturing these proactively prevents surprises and ensures compliance with approved data and operational assumptions.

Mykola Kondratiuk • May 15

API pattern drift is exactly the kind of signal that breaks cost models before it breaks integrations - rate limit changes usually show in billing logs 2-3 sprints before anyone files a prod incident. worth tracking that separately from the behavioral/capability signals