I track AI releases as part of my day jo I have to, because missing a major model update or a new API capability can mean building on stale foundations. December 2025 was the kind of month where I had a tab graveyard of 40+ articles by the 20th. Every major lab shipped something significant. Some of it was genuinely important. Some of it was noise dressed up as progress.
This is my attempt to cut through both. What actually shipped, what it means for developers, and what I'm actually paying attention to going into 2026.
If you want the ongoing daily version of this, the AI & Tech News Blog has been doing solid real-time coverage. But for the month's biggest picture? Here's what I'd tell a colleague over coffee.
The Stat That Framed the Whole Month
Before getting into specific releases: OpenAI's "State of Enterprise AI" report dropped in December and the numbers in it are hard to ignore. Enterprise ChatGPT weekly usage is up 8x year-over-year. Custom GPT usage and Projects usage are up 19x. And 75% of workers using these tools report meaningfully improved output — saving something in the range of 40-60 minutes per day.
The Menlo Ventures data told a similar story: $37B in global enterprise AI spend in 2025, 3.2x what it was in 2024. Eighty percent of enterprises are now buying AI solutions rather than building in-house.
That context matters because it explains the December release frenzy. These companies aren't shipping updates because they feel like it they're responding to genuine demand pressure and competing hard for enterprise budget that is now very real.
Google Had a Strong Month (and I Didn't Expect to Say That)
Honestly? My priors on Google AI have been "late and incremental" for a while. December 2025 updated those priors.
Gemini 3 Flash shipped mid-month. It's a speed-optimized model, but "speed" undersells it, it beat every Gemini 2.5 variant and Gemini 3 Pro on SWE-bench Verified (the coding benchmark developers actually care about). For any developer building apps that need fast reasoning, complex video analysis, or data extraction in real time, this is worth evaluating. The token cost is also lower than the heavier Gemini 3 models, which matters at scale.
The thing that surprised me more than Gemini 3 Flash: Google's A2UI project. This is an open-source tool that generates contextually relevant UIs on the fly based on what an AI agent needs in the current conversation. The example they gave was a restaurant booking agent — instead of going back and forth in text, A2UI generates an actual input form with party size, date, dietary requirements. It's not finished product-level stuff, but the concept of agents dynamically generating their own interfaces is a direction I'm watching closely.
And then there's Antigravity, Google's new agentic development platform built on a VS Code fork from the Windsurf acquisition. Two views: an Editor view for hands-on AI-assisted coding, and a Manager view for orchestrating multiple agents running in parallel across workspaces. The part I keep coming back to: it supports Gemini 3, Claude Sonnet, and OpenAI models simultaneously. Google built a multi-model platform and put their own model in it alongside competitors. That's a strategic bet worth noting.
OpenAI: Platform Move, Not Just Model Move
OpenAI's headline in December was GPT-5.2-Codex, a version of GPT-5.2 specifically optimized for the Codex coding agent. Stronger on large refactors, better context compaction for long-horizon work, meaningfully improved Windows support, and enhanced cybersecurity capabilities (the last one comes with an invite-only program for vetted security researchers — intentionally limited rollout).
But the bigger story isn't the model. It's the ChatGPT App Store.
OpenAI opened the submission portal on December 17, letting external developers get apps reviewed and listed in an in-ChatGPT App Directory. Adobe, Gmail, Replit, those are already in. The store opens to ChatGPT's 800 million users in early 2026. If you build developer tools, this is a distribution channel you need to be thinking about now, not after it launches widely.
The architecture here is clear: OpenAI is building a platform, not just an assistant. The API is for builders who want control. ChatGPT is increasingly becoming the consumer-facing app store. These are two different bets on where the value accrues, and they're running them simultaneously.
Anthropic's Quiet but Important Play: Open Standards
Anthropic didn't have the flashiest December announcement, but what they shipped has long-term implications that I think are underappreciated.
Agent Skills reusable instruction sets that teach Claude specific workflows, domain knowledge, brand standards, tool behaviors, got released as an open standard. Not Anthropic-proprietary. Open. And OpenAI has already adopted structurally identical architecture in ChatGPT and Codex.
This is the MCP (Model Context Protocol) playbook repeating. Anthropic publishes a standard, the ecosystem adopts it, everyone benefits from interoperability. If you're building enterprise workflows on Claude Skills today, there's a good chance those Skills transfer to other platforms. That's a real ROI argument for investing in building them now.
The enterprise implications are immediate: Skills available from Canva, Notion, Figma, Atlassian, Cloudflare, Stripe, and Zapier are in the directory already. If you're doing AI in website development or any kind of workflow automation, this is the place to look before building custom integrations from scratch.
Amazon Quietly Shipped a Lot at re:Invent
AWS re:Invent (December 4–7) gets overshadowed by the OpenAI/Google/Anthropic noise cycle, but Amazon shipped a full model portfolio in December.
Nova 2 lineup: Nova 2 Lite (fast, cost-efficient, handles text/image/video), Nova 2 Pro (high-capacity multimodal for complex tasks), Nova 2 Sonic (speech-to-speech), and Nova 2 Omni (text + images + audio + video simultaneously). All Nova 2 models have built-in web access and code execution baked in.
These are available through Amazon Bedrock. If you're already on AWS infrastructure, this is a genuinely interesting option, especially Nova 2 Omni for multimodal applications where you don't want to stitch together multiple APIs.
I haven't run my own benchmarks against Nova 2 yet, but the price-to-performance positioning is clearly aimed at teams that are hitting cost ceilings on GPT-4-class models. Worth putting on the evaluation list.
The Legal Story That Isn't Going Away
I can't write a December AI recap and skip the copyright litigation, even though it's not my favorite topic.
Investigative journalist John Carreyrou, alongside five other authors, filed suit in December against OpenAI, Google, xAI, Meta, Anthropic, and Perplexity, alleging their copyrighted books were used to train models without permission. What makes this one different from the class action lawsuits we've seen: it's an individual suit, specifically structured to avoid class settlement dynamics and seek up to $150,000 per infringed work.
This matters for developers building AI-powered products because the legal landscape around training data is still genuinely unsettled. Anthropic settled a prior class action for $1.5B. The question of whether using copyrighted material for training constitutes fair use hasn't been definitively resolved. If you're building anything that involves AI-generated content at scale, particularly in media, publishing, or legal contexts, this is a risk to monitor.
The Bigger Pattern: Interoperability Is the Real Story
Step back from any individual release and December 2025 reads as the month the AI industry started betting on interoperability over lock-in.
Anthropic's Agent Skills are an open standard. OpenAI adopted it. Google's Antigravity supports Claude and GPT alongside Gemini. MCP is now implemented across most major platforms. The major providers are making the same calculation: ecosystem growth benefits them more than keeping everything proprietary.
For developers, the practical implication is significant. The "which platform should I build on" question is becoming less binary. You can build integrations and workflows that travel across providers. That's new. And it changes how I'd think about where to put engineering time.
As covered well in Tech and AI News, this platform convergence pattern is showing up across more than just the model layer, it's hitting tooling, infrastructure, and deployment frameworks simultaneously.
What I'm Actually Doing Differently in January
A few things December changed in my personal workflow:
I'm evaluating Gemini 3 Flash for anything latency-sensitive. The SWE-bench numbers earned it a real test, not just a glance.
I'm building at least one Agent Skill for my most-repeated workflows. The interoperability argument is compelling enough that the time investment makes sense now.
I'm watching the ChatGPT App Store closely. If your tools solve developer problems, this is a distribution surface worth preparing for before the full rollout.
And honestly, the pace isn't slowing down. If December felt intense, it's because it was. According to one analysis, it was "the most concentrated burst of AI capability we have ever seen." I don't think January will be quieter.
FAQ
Which December release matters most for web developers specifically?
Practically speaking: Gemini 3 Flash for latency-sensitive AI in website development use cases, and Anthropic's Agent Skills for automating repetitive workflow steps. If you're doing artificial intelligence in web development — generating components, automating content pipelines, building chat interfaces — both are immediately applicable. The ChatGPT App Store is worth watching if you build tools that could reach consumers.
Is the ChatGPT App Store actually open to any developer?
It opened for submissions on December 17, 2025, but it's a review-and-approval process — not instant listing. OpenAI vets apps for compliance and safety. Approved apps go live to users in early 2026. It's not "anyone can publish," but it's much more open than the original closed plugin ecosystem.
Should I be worried about the copyright lawsuits affecting AI tools I already use?
Monitor, but don't panic. These cases take years to resolve, and most AI providers have legal teams specifically handling this. The more practical concern is understanding what your AI provider's terms say about data retention and training — particularly if you're building enterprise applications where your users' data is involved. That's within your control right now; the litigation outcomes aren't.
Is Amazon Nova 2 worth evaluating if I'm already on OpenAI or Anthropic APIs?
Yes, at least for cost benchmarking. If you're running high-volume inference, the price-to-performance on Nova 2 Lite could be meaningful. I'd run a parallel evaluation on your actual workload rather than trusting benchmarks, real-world results vary a lot depending on task type.

Top comments (0)