DEV Community

Cover image for Gemini 3.1 Pro: A smarter model for your most complex tasks
Logan Kilpatrick for Google AI

Posted on • Originally published at blog.google

Gemini 3.1 Pro: A smarter model for your most complex tasks

Last week, we released a major update to Gemini 3 Deep Think to solve modern challenges across science, research and engineering. Today, we’re releasing the upgraded core intelligence that makes those breakthroughs possible: Gemini 3.1 Pro. We are shipping 3.1 Pro across our consumer and developer products to bring this progress in intelligence to your everyday applications.

Starting today, 3.1 Pro is rolling out:

Building on the Gemini 3 series, 3.1 Pro represents a step forward in core reasoning. 3.1 Pro is a smarter, more capable baseline for complex problem-solving. This is reflected in our progress on rigorous benchmarks. On ARC-AGI-2, a benchmark that evaluates a model’s ability to solve entirely new logic patterns, 3.1 Pro achieved a verified score of 77.1%. This is more than double the reasoning performance of 3 Pro.
Side-by-side comparison of different benchmarks for AI models.

Intelligence applied

3.1 Pro is designed for tasks where a simple answer isn’t enough, taking advanced reasoning and making it useful for your hardest challenges. This improved intelligence can help in practical applications — whether you’re looking for a clear, visual explanation of a complex topic, a way to synthesize data into a single view, or bringing a creative project to life.

Code-based animation: 3.1 Pro can generate website-ready, animated SVGs directly from a text prompt. Because these are built in pure code rather than pixels, they remain crisp at any scale and maintain incredibly small file sizes compared to traditional video.

Complex system synthesis: 3.1 Pro utilizes advanced reasoning to bridge the gap between complex APIs and user-friendly design. In this example, the model built a live aerospace dashboard, successfully configuring a public telemetry stream to visualize the International Space Station’s orbit.

Interactive design: 3.1 Pro codes a complex 3D starling murmuration. It doesn't just generate the visual code; it builds an immersive experience where users can manipulate the flock with hand-tracking and listen to a generative score that shifts based on the birds’ movement. For researchers and designers, this provides a powerful way to prototype sensory-rich interfaces.

Creative coding: 3.1 Pro can translate literary themes into functional code. When prompted to build a modern personal portfolio for Emily Brontë’s "Wuthering Heights," the model didn’t just summarize the text. It reasoned through the novel’s atmospheric tone to design a sleek, contemporary interface, creating a website that captures the essence of the protagonist.

What’s next

Since releasing Gemini 3 Pro in November, your feedback and the pace of progress have driven these rapid improvements. We are releasing 3.1 Pro in preview today to validate these updates and continue to make further advancements in areas such as ambitious agentic workflows before we make it generally available soon.

Starting today, Gemini 3.1 Pro in the Gemini app is rolling out with higher limits for users with the Google AI Pro and Ultra plans. 3.1 Pro is also now available on NotebookLM exclusively for Pro and Ultra users. And developers and enterprises can access 3.1 Pro now in preview in the Gemini API via AI Studio, Antigravity, Vertex AI, Gemini Enterprise, Gemini CLI and Android Studio.

We can’t wait to see what you build and discover with it.

Top comments (7)

Collapse
 
youngfra profile image
Fraser Young

Really excited to test this on svg animations like that

Collapse
 
ben profile image
Ben Halpern

Giving it some run in Antigravity, so far so good

Collapse
 
harsh2644 profile image
Harsh

The Antigravity integration is interesting — but what really caught my attention is how 3.1 Pro handles multi-step reasoning compared to 2.5 Pro.

For agentic workflows where the model needs to call tools sequentially (think: search → analyze → write → validate), are there improvements in maintaining context across steps? Would love to see benchmarks on tool-calling accuracy.

@ben since you're testing in Antigravity, have you noticed any difference in how it handles multi-turn tool calls vs 2.5 Pro?

Collapse
 
harsh2644 profile image
Harsh

The Vertex AI integration is what caught my attention. For teams building agentic workflows, having 3.1 Pro natively available in the same ecosystem as our training pipelines is huge.

Curious about the context window improvements — is it still 1M tokens? Also, any changes in function calling reliability compared to 2.5 Pro?

Collapse
 
theycallmeswift profile image
Swift

That jump in reasoning performance is massive! Can't wait to try it out!

Collapse
 
lukeponga profile image
Luke Ponga

Really cool to play around with!

Collapse
 
bingkahu profile image
bingkahu (Matteo)

Looks great, can't wait to play around with it!