bebechien for Google AI

Posted on Jun 10 • Originally published at bebechien.github.io

How we built AIventure, an AI-Powered Retro Dungeon

#gamedev #gemini #gemma #ai

Grab a refreshing slice of cold watermelon (because it’s summer in Tokyo and I love it!), and let me tell you a story about how a small team of us turned a wild idea into a living, breathing web game.

If you’ve spent any time in Google I/O over the last few years, you might remember Adventure—the virtual conference technology that powered major Google events during the pandemic. It was vast, beautifully illustrated, and collaborative. But it was also heavy, transient, and required an army of human moderators.

A few months ago, I and Tom started asking ourselves a question: What if we re-imagine Adventure? Instead of a one-off conference, what if we build a cozy, retro persistent world that teach developers the principles of Generative AI not through dry documentation, but through "vibe coding" and gameplay?

That was the spark for AIventure. Here is the tale of how we actually built it.

Step 1: Scribbles and Prototyping in Gemini Canvas

Every good game starts with a prototype. For AIventure, we didn’t start by writing complex engine code. We started by playing around with concepts inside Gemini.

We knew we want a split UI layout: a high-quality frontend framework (Angular) to manage the UI panels and layout, wrapped around a retro canvas powered by Phaser.JS for the actual game engine. This is the same approach we did for our previous I/O demo, the "Living Canvas".

We first built a simple educational puzzle game using various styles. We eventually landed on a top-down retro dungeon approach because it just felt right for the "vibe" we wanted. It was at this point that Tom brought in the brilliant idea of reviving the spirit and aesthetics of his previous project, "Adventure," to give the world its cohesive identity.

Screenshots of Initial prototypes in Gemini Canvas

Step 2: Designing the Rooms (Vibe Coding & Agentic Robots)

Once we knew the foundational logic worked, Tom and I began brainstorming more advanced puzzle mechanics. We didn't want the AI to just act as a lock-and-key mechanism; we wanted to showcase what makes modern LLMs so magical.

That led to two of our favorite creations:

The Vibe Coding Room: We wanted to introduce players to the concept of AI-assisted UI design. In this room, the game opens an iFrame tab. The player gives prompts like, "build a todo app for the chicken that only have eat and sleep buttons". The AI builds a web page on the fly, and the player can immediately see their code and update inside the game environment.

The Agent Puzzle: We introduced a little robot NPC character that accepts natural language instructions and executes them inside the Phaser grid world. If a player says, "Go flip the switch" then the model uses tool-calling and reasoning to break the instruction down into concrete, step-by-step game actions.

Step 3: From Gemini to Gemma

During our initial build and testing phases, we relied on cloud endpoints—specifically pushing prompt requests out to Gemini models via Gemini API to handle player commands and code generation. It worked flawlessly, but we wanted to push the boundaries of what a modern web app could do. We wanted this game to be entirely self-contained and accessible.

Enter Gemma 4.

We began testing the puzzles with Gemma 4 models. We experimented with different sizes (E2B and E4B) and frameworks, including Ollama, LM Studio, Transformers, and Vertex AI, tracking how well the model could handle function-calling and agentic workflows.

Step 4: Bringing it All to the Browser with MediaPipe

The final piece of the puzzle was deployment. How do we share an AI-powered game with millions of developers on the Google Developers site without forcing them to configure cloud backend architectures or input private API keys?

The answer lay in running the model locally, right inside the user's browser.

We used MediaPipe and the LiteRT team's pre-converted formats to host and serve the model distribution right from the website interface. By utilizing client-side web technologies, the entire experience runs locally on your machine. When a player prompts an NPC or writes code in the game, the inference happens entirely client-side. The frontend routes the request through a simple Event Bus directly to the browser-loaded model.

For developers exploring the source code at home, we built toggles so you can easily point the same game loop to a locally hosted LM Studio endpoint or Vertex AI!.

One more thing…

In addition to our current puzzles, we are exploring the multimodal capabilities of Gemma. Although this work is currently in progress, you can experiment with it yourself by cloning our GitHub repository.

What Will You Create Next?

Building AIventure was a journey of bridging old-school game design with modern client-side AI. Seeing a retro game talk to a local open LLM to vibe code inside a dungeon puzzle still feels like magic to us.

We’ve published the demo, complete with developer profile badges to unlock along the way. If you want to check out the solution, inspect our full architecture, or look through our prompts, you can explore the official page right here:

👉 Check out the AIventure Solution

Grab your sword, open up your console, and happy building! 🍉🎮

Top comments (7)

Megz Lawther • Jun 17 • Edited

Hello, I recall AIventure had come out a few months ago, as I viewed the original post and this current presentation here on Dev.to just reminded me!! I wonder — Have you iterated since the original development, implemented any other function calls or any other features? Very cool with Gemma integration. Simple, nostalgic art and a great concept.

bebechien Google AI • Jun 30

Thanks! Since the initial release, I've been testing the multimodal puzzle, and as you can see from the screenshot in this post, it works great. I'll definitely keep exploring new ideas from here. 🫡

Megz Lawther • Jun 30

Cool! 👏🏼:)

Hossein Yazdi • Jun 10 • Edited

This is actually a pretty cool use of AI. I like that the AI isn't just a chatbot here, it's part of the gameplay itself. The agent puzzle with natural language commands sounds especially fun!

Also reminds me of some interesting AI and game dev tools I've come across including Flexbox Froggy and Grid Garden.

Mykola Kondratiuk • Jun 17

Shifting from conference moderators to AI-powered NPCs moves the moderation problem, it does not remove it. You swap human room moderators for guardrail authors watching for prompt exploits. Surface area might be smaller but the failure modes are weirder.

Eva • Jun 11

I use AI to work for me 24*7 for free.

Some comments may only be visible to logged-in visitors. Sign in to view all comments.