DEV Community

Rohit Mane
Rohit Mane

Posted on

๐ŸŽญ DreamLens โ€“ Turn Any Story Into a Mini Movie What I Built

Google AI Challenge Submission

 I built DreamLens, a multimodal applet powered by Google AI Studio that transforms any idea, story, or doodle into a short animated movie.

Everyone has imaginationโ€”kids tell fantasy tales, writers create worlds, gamers describe epic battlesโ€”but most canโ€™t turn them into visuals and sound. DreamLens solves this by using text, voice, and image understanding to automatically generate storyboards, narration, and background audio.

How I Built It

Frontend: React (simple input box + doodle/image upload + voice mic)

Backend: Python Flask on Cloud Run

AI: Gemini 2.5 Pro for multimodal input

Text + Voice processing via Gemini Live API

Image/doodle understanding via Gemini 2.5 Flash Image

Script/narration generation via Gemini text model

Deployment: Google Cloud Run

Other Tools: Tailwind (UI), Firebase (story saves), optional text-to-speech

Key Features

๐Ÿ“ Story-to-Movie โ†’ User types or narrates a story, Gemini generates visuals + narration.

๐ŸŽค Voice-to-Story โ†’ Kids or users can simply tell a story and see it animated.

๐Ÿ–ผ๏ธ Doodle-to-Scene โ†’ Upload a doodle/sketch, Gemini turns it into a polished scene.

๐ŸŽ™๏ธ Custom Narration Styles โ†’ Funny, dramatic, bedtime storyteller, etc.

๐ŸŽถ Background Music โ†’ Automatically matched to story mood (epic, spooky, calm).

๐Ÿ”„ Interactive Regeneration โ†’ Users say โ€œAdd a villain,โ€ โ€œMake it funnierโ€ โ†’ instantly updated.

Challenges I Faced

Synchronizing story + visuals + narration into one flow.

Handling latency in image + audio generation without breaking immersion.

Designing an interface thatโ€™s both simple for kids and powerful for creators.

Whatโ€™s Next

๐Ÿ“š Education Mode โ†’ Teachers can create animated lessons (history, science).

๐ŸŽฎ Gamers Mode โ†’ Auto-generate cinematic recaps of role-playing campaigns.

๐Ÿ“ฑ Mobile App โ†’ โ€œTell a story, watch it come aliveโ€ in seconds on phones.

๐ŸŒ Community Sharing โ†’ Users can share their AI mini-movies with friends.

Submission Category

๐Ÿ”น Innovation and Creativity

๐Ÿ”น Technical Implementation

๐Ÿ”น User Experience

๐Ÿ”น Effective Use of Multimodal Features

Team Members

Rohit Mane

Top comments (0)