DEV Community

Cover image for Gemini Tales: Turning Screen Time Into Active Adventure๐Ÿงธ
 Veronika Kashtanova
Veronika Kashtanova

Posted on • Edited on

Gemini Tales: Turning Screen Time Into Active Adventure๐Ÿงธ

Education Track: Build Multi-Agent Systems with ADK

This post is my submission for DEV Education Track: Build Multi-Agent Systems with ADK.

What I Built

I built Gemini Tales, an interactive storytelling experience that blends real-time AI conversation with physical activity. It solves a haunting statistic: 80% of children today don't move enough. While technology is often seen as the cause of sedentary behavior, I wanted to turn the screen into a catalyst for movement.

๐Ÿ“น Watch the Vision: See how we turn sedentary screen time into an active adventure.

Gemini Tales doesn't just tell a storyโ€”it sees your child, hears their voice, and asks them to ACT. Every physical movement becomes part of the magic.

Wizard casting magic with children in a cozy living room, golden sparkles and stars filling the air.

Cloud Run Embed

The project is currently running in Google Cloud Run:


๐Ÿงš The Experience: Live Multimodal Storytelling

The frontend is a direct bridge to Gemini Live API, enabling unified Voice + Vision interaction in real-time.

Features That Create Magic โœจ

Feature What It Does Tech Stack
๐ŸŽ™๏ธ Stable Voice Live Interruption-aware, low-latency conversation. Child speaks and changes story path anytime. Gemini Live API
๐Ÿ“ธ Visual Awareness Real-time video stream (1 FPS) lets AI "see" costumes, toys, movements. Gemini 2.5 Flash Native Vision
๐ŸŽจ Dynamic Illustrations Watercolor-style art that evolves with the plot. Gemini 3.1 Flash Preview
โšก Agent-Driven Context Before the story starts, Storysmith agents research & craft unique plot. Google ADK Backend
๐ŸŽฎ Interactive Challenges AI pauses story for "Hero's Challenges"โ€”physical actions detected via video. Computer Vision

๐Ÿค– The Brain: Multi-Agent Story Engine

The backend is a distributed multi-agent system built with the Google Agent Development Kit (ADK) and the A2A (Agent-to-Agent) protocol. This ensures specialization, reliability, and scalability.

๐ŸŽญ Meet the Agents

Agent Role Specialty Model
๐Ÿ” Adventure Seeker Physical activity planning Multi-step reasoning for movement-based quests Gemini 2.5 Flash + Google Search
โš–๏ธ Guardian of Balance Safety gatekeeper Strict age-appropriate & movement-focused validation Gemini 2.5 Flash (Temp 0.1)
โœ๏ธ Storysmith Narrative weaver Advanced storytelling & character depth Gemini 2.5 Pro
๐Ÿช„ Orchestrator Pipeline coordinator Manages state & flow between agents Gemini 2.5 Flash

Architecture Highlights

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚    Frontend (React + Live API)       โ”‚
โ”‚  Voice โ€ข Vision โ€ข Real-time Feedback โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                 โ”‚ WebSocket
โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ–ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚   API Gateway (FastAPI)              โ”‚
โ”‚   Authentication & Rate Limiting     โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
                 โ”‚ A2A Protocol
    โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
    โ”‚            โ”‚            โ”‚            โ”‚
โ”Œโ”€โ”€โ”€โ–ผโ”€โ”€โ”  โ”Œโ”€โ”€โ”€โ”€โ”€โ–ผโ”€โ”€โ”  โ”Œโ”€โ”€โ”€โ”€โ”€โ–ผโ”€โ”€โ”  โ”Œโ”€โ”€โ”€โ”€โ–ผโ”€โ”€โ”€โ”
โ”‚Seekerโ”‚  โ”‚Guardianโ”‚  โ”‚Storysmth   Orch.   โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜  โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜
Enter fullscreen mode Exit fullscreen mode

For a detailed deep-dive into the system design, see ARCHITECTURE.md.

Young knight Lily with sword on a magical meadow path, mushroom houses and flowers surrounding her.


๐Ÿš€ From Tutorial to Hackathon Reality

This project started as a journey through the Build Multi-Agent Systems with ADK track. The tutorial taught me Agent-to-Agent (A2A) communication and state-based orchestrationโ€”the exact foundation I needed for my entry in the Gemini Live Agent Challenge ๐ŸŽฌ

I took those core architectural patterns and pivoted toward something bigger: an AI Nanny that inspires children to move. The same ADK logic that powers educational tools now powers interactive, movement-based storytelling.


๐Ÿ› ๏ธ Tech Stack

Layer Technology
Frontend React 19, Vite, TypeScript, Tailwind CSS
AI Intelligence Gemini Live API, Gemini 2.5 Flash/Pro, Gemini 3.1 Flash Preview
Backend Framework Google ADK, Agent-to-Agent (A2A) Protocol
Infrastructure FastAPI (Python 3.12), WebSockets, Google Cloud Run
Observability OpenTelemetry, Google Cloud Trace
Dev Tools Antigravity IDE for agentic orchestration

๐Ÿ“‚ Getting Started

Prerequisites

  • Python 3.10+ & Node.js 20+
  • uv for Python dependency management
  • Google Cloud Project with Vertex AI enabled

1. Backend Launch

Easy Mode (Windows):

# Starts all 5 services with automatic cleanup
.\run_local.ps1
Enter fullscreen mode Exit fullscreen mode

Manual Launcher:

# Start distributed agents (Leaf Services)
uv run shared/adk_app.py agents/researcher --port 8001 --a2a
uv run shared/adk_app.py agents/judge --port 8002 --a2a
uv run shared/adk_app.py agents/content_builder --port 8003 --a2a

# Start Orchestrator & API Gateway
uv run shared/adk_app.py agents/orchestrator --port 8004
uv run app/main.py
Enter fullscreen mode Exit fullscreen mode

2. Frontend Launch

cd frontend
npm install
npm run dev
Enter fullscreen mode Exit fullscreen mode

Visit http://localhost:5173 and start creating stories! โœจ


๐Ÿ“š Key Learnings

Specialization > Monoliths

I was surprised at how much more reliable the system became when I stopped relying on one giant prompt and started treating agents like a specialized team with distinct responsibilities.

The Power of A2A Protocol

Implementing Agent-to-Agent communication was challenging, especially handling Google Application Default Credentials (ADC) on Windows. But once it clicked, the elegance of distributed agents became clear.

Movement Changes Everything

The most rewarding part? Seeing a child leap off the couch when the AI asked them to "show me how you jump." Screen time transformed from sedentary consumption into active play.


๐Ÿ“‚ Open Source & Reproducibility

The full source codeโ€”including ADK orchestration logic, deployment scripts, and frontendโ€”is available on GitHub:

๐Ÿ‘‰ GitHub: vero-code/gemini-tales

Features:

  • โœ… Full Docker & Cloud Run deployment
  • โœ… Multi-agent architecture with A2A protocol
  • โœ… Live API integration with fallbacks
  • โœ… Comprehensive ARCHITECTURE.md for deep-dives

๐Ÿšง Current Status & What's Next

Gemini Tales is in active development for the Gemini Live Agent Challenge. The multi-agent backend is fully operational, and I'm continuously refining:

  • ๐ŸŽฌ Multimodal streaming: Synchronizing text, generated images, and voice.
  • ๐ŸŽฏ Real-time movement detection: Analyzing video feed for physical actions.
  • ๐Ÿ† Gamification System: Badges and rewards to motivate physical activity.

New updates coming soon:

  • ๐Ÿ›ก๏ธ Strict Interaction Logic: Implementing "anti-cheat" verification so the agent visually confirms actions instead of relying on voice.
  • โšก Performance: Optimizing agent loading times for instant storytelling.
  • ๐ŸŽฅ Live demo video showcasing the full experience.

Stay tuned as we turn screen time into move time! ๐Ÿš€๐Ÿงธ


๐ŸŽฏ Why This Matters

Technology is often the villain in this story. But what if it could be the hero?

Gemini Tales proves that with the right architecture and intention, we can build AI experiences that:

  • โœ… Entertain (magical storytelling)
  • โœ… Engage (real-time interaction)
  • โœ… Activate (physical movement required)
  • โœ… Educate (safe, age-appropriate learning)

This is technology in service of human health.


๐Ÿ“œ License

MIT โ€” See LICENSE

Created with โค๏ธ for the next generation of active explorers.


Tags: #GeminiLiveAgentChallenge #GoogleAI #MultiAgentSystems #ADK #ChildHealth #InteractiveTech

Top comments (2)

Collapse
 
jess profile image
Jess Lee

How's it going?

Collapse
 
vero-code profile image
Veronika Kashtanova

Great, just added the embed + dev label. Thanks for checking in! ๐Ÿ™Œ