<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Rohit Mane</title>
    <description>The latest articles on DEV Community by Rohit Mane (@rohit_mane_0c56c05bfdcc75).</description>
    <link>https://dev.to/rohit_mane_0c56c05bfdcc75</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F1909428%2F2be2b02f-74da-4be0-8d1f-88cf1d894891.jpg</url>
      <title>DEV Community: Rohit Mane</title>
      <link>https://dev.to/rohit_mane_0c56c05bfdcc75</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/rohit_mane_0c56c05bfdcc75"/>
    <language>en</language>
    <item>
      <title>🎭 DreamLens – Turn Any Story Into a Mini Movie What I Built</title>
      <dc:creator>Rohit Mane</dc:creator>
      <pubDate>Mon, 08 Sep 2025 12:05:52 +0000</pubDate>
      <link>https://dev.to/rohit_mane_0c56c05bfdcc75/dreamlens-turn-any-story-into-a-mini-moviewhat-i-built-45b5</link>
      <guid>https://dev.to/rohit_mane_0c56c05bfdcc75/dreamlens-turn-any-story-into-a-mini-moviewhat-i-built-45b5</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv99irjwuwptretlcwnnr.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fv99irjwuwptretlcwnnr.png" alt=" " width="800" height="800"&gt;&lt;/a&gt;I built DreamLens, a multimodal applet powered by Google AI Studio that transforms any idea, story, or doodle into a short animated movie.&lt;/p&gt;

&lt;p&gt;Everyone has imagination—kids tell fantasy tales, writers create worlds, gamers describe epic battles—but most can’t turn them into visuals and sound. DreamLens solves this by using text, voice, and image understanding to automatically generate storyboards, narration, and background audio.&lt;/p&gt;

&lt;p&gt;How I Built It&lt;/p&gt;

&lt;p&gt;Frontend: React (simple input box + doodle/image upload + voice mic)&lt;/p&gt;

&lt;p&gt;Backend: Python Flask on Cloud Run&lt;/p&gt;

&lt;p&gt;AI: Gemini 2.5 Pro for multimodal input&lt;/p&gt;

&lt;p&gt;Text + Voice processing via Gemini Live API&lt;/p&gt;

&lt;p&gt;Image/doodle understanding via Gemini 2.5 Flash Image&lt;/p&gt;

&lt;p&gt;Script/narration generation via Gemini text model&lt;/p&gt;

&lt;p&gt;Deployment: Google Cloud Run&lt;/p&gt;

&lt;p&gt;Other Tools: Tailwind (UI), Firebase (story saves), optional text-to-speech&lt;/p&gt;

&lt;p&gt;Key Features&lt;/p&gt;

&lt;p&gt;📝 Story-to-Movie → User types or narrates a story, Gemini generates visuals + narration.&lt;/p&gt;

&lt;p&gt;🎤 Voice-to-Story → Kids or users can simply tell a story and see it animated.&lt;/p&gt;

&lt;p&gt;🖼️ Doodle-to-Scene → Upload a doodle/sketch, Gemini turns it into a polished scene.&lt;/p&gt;

&lt;p&gt;🎙️ Custom Narration Styles → Funny, dramatic, bedtime storyteller, etc.&lt;/p&gt;

&lt;p&gt;🎶 Background Music → Automatically matched to story mood (epic, spooky, calm).&lt;/p&gt;

&lt;p&gt;🔄 Interactive Regeneration → Users say “Add a villain,” “Make it funnier” → instantly updated.&lt;/p&gt;

&lt;p&gt;Challenges I Faced&lt;/p&gt;

&lt;p&gt;Synchronizing story + visuals + narration into one flow.&lt;/p&gt;

&lt;p&gt;Handling latency in image + audio generation without breaking immersion.&lt;/p&gt;

&lt;p&gt;Designing an interface that’s both simple for kids and powerful for creators.&lt;/p&gt;

&lt;p&gt;What’s Next&lt;/p&gt;

&lt;p&gt;📚 Education Mode → Teachers can create animated lessons (history, science).&lt;/p&gt;

&lt;p&gt;🎮 Gamers Mode → Auto-generate cinematic recaps of role-playing campaigns.&lt;/p&gt;

&lt;p&gt;📱 Mobile App → “Tell a story, watch it come alive” in seconds on phones.&lt;/p&gt;

&lt;p&gt;🌍 Community Sharing → Users can share their AI mini-movies with friends.&lt;/p&gt;

&lt;p&gt;Submission Category&lt;/p&gt;

&lt;p&gt;🔹 Innovation and Creativity&lt;/p&gt;

&lt;p&gt;🔹 Technical Implementation&lt;/p&gt;

&lt;p&gt;🔹 User Experience&lt;/p&gt;

&lt;p&gt;🔹 Effective Use of Multimodal Features&lt;/p&gt;

&lt;p&gt;Team Members&lt;/p&gt;

&lt;p&gt;Rohit Mane&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>gemini</category>
    </item>
    <item>
      <title>LifeLens – My Multimodal Health &amp; Wellness Companion</title>
      <dc:creator>Rohit Mane</dc:creator>
      <pubDate>Sun, 07 Sep 2025 04:11:55 +0000</pubDate>
      <link>https://dev.to/rohit_mane_0c56c05bfdcc75/lifelens-my-multimodal-health-wellness-companion-42g4</link>
      <guid>https://dev.to/rohit_mane_0c56c05bfdcc75/lifelens-my-multimodal-health-wellness-companion-42g4</guid>
      <description>&lt;p&gt;🚀 LifeLens – My Multimodal Health &amp;amp; Wellness Companion&lt;/p&gt;

&lt;p&gt;This is a submission for the Google AI Studio Multimodal Challenge&lt;/p&gt;

&lt;p&gt;What I Built&lt;/p&gt;

&lt;p&gt;I built LifeLens, a personal health &amp;amp; wellness companion applet powered by Gemini’s multimodal intelligence.&lt;/p&gt;

&lt;p&gt;Instead of juggling separate apps for food logging, fitness, and mood tracking, LifeLens lets you:&lt;/p&gt;

&lt;p&gt;📸 Snap a meal photo → Get calorie &amp;amp; nutrition breakdown&lt;/p&gt;

&lt;p&gt;🏋️ Upload a workout photo/selfie → Detect exercise type and duration&lt;/p&gt;

&lt;p&gt;🎙 Record a voice note → Analyze mood, stress, or energy levels&lt;/p&gt;

&lt;p&gt;✍️ Write a quick journal entry → Extract lifestyle insights (e.g., sleep quality, focus)&lt;/p&gt;

&lt;p&gt;At the end of each day, Gemini compiles everything into a personalized daily health snapshot — a simple, infographic-style report that shows nutrition, activity, and mood in one place.&lt;/p&gt;

&lt;p&gt;The goal: make health tracking frictionless and holistic, using natural inputs (photo, voice, text).&lt;/p&gt;

&lt;p&gt;Demo&lt;/p&gt;

&lt;p&gt;🌐 Live App (Cloud Run):&lt;/p&gt;

&lt;p&gt;🎥 Demo Video: &lt;/p&gt;

&lt;p&gt;🖼 Screenshots:&lt;/p&gt;

&lt;p&gt;Meal photo → nutrition analysis&lt;/p&gt;

&lt;p&gt;Workout photo → activity detection&lt;/p&gt;

&lt;p&gt;Voice note → stress/mood insights&lt;/p&gt;

&lt;p&gt;Final daily health report (infographic)&lt;/p&gt;

&lt;p&gt;How I Used Google AI Studio&lt;/p&gt;

&lt;p&gt;I used Google AI Studio with Gemini 2.5 to power all the multimodal intelligence in LifeLens:&lt;/p&gt;

&lt;p&gt;Image understanding → Meal &amp;amp; workout detection&lt;/p&gt;

&lt;p&gt;Audio analysis → Emotional tone &amp;amp; mood recognition&lt;/p&gt;

&lt;p&gt;Text extraction → Journaling insights + summarization&lt;/p&gt;

&lt;p&gt;Multimodal fusion → Combine all three inputs into one personalized daily report&lt;/p&gt;

&lt;p&gt;The best part is that Gemini can process mixed inputs in a single prompt, so the final report feels contextual and cohesive rather than three disconnected analyses.&lt;/p&gt;

&lt;p&gt;Multimodal Features&lt;/p&gt;

&lt;p&gt;Here’s how multimodality shines in LifeLens:&lt;/p&gt;

&lt;p&gt;📸 Food recognition – Instead of manually typing calories, just take a picture.&lt;/p&gt;

&lt;p&gt;🏋️ Exercise recognition – Gemini detects workouts visually.&lt;/p&gt;

&lt;p&gt;🎙 Mood detection – Voice notes capture feelings more authentically than typing.&lt;/p&gt;

&lt;p&gt;✍️ Lifestyle journaling – Short reflections provide context Gemini can analyze.&lt;/p&gt;

&lt;p&gt;📊 Daily Health Report – Combines everything into a single, easy-to-read summary (optionally with an AI-generated infographic).&lt;/p&gt;

&lt;p&gt;Together, these make LifeLens a natural and engaging way to track health, lowering the barrier for consistent use.&lt;/p&gt;

&lt;p&gt;Why It Matters&lt;/p&gt;

&lt;p&gt;Most people abandon health apps because logging feels like work.&lt;/p&gt;

&lt;p&gt;LifeLens makes tracking feel like living your day normally: take a picture, talk to your phone, write a sentence.&lt;/p&gt;

&lt;p&gt;Gemini does the heavy lifting, turning it into insightful, actionable feedback.&lt;/p&gt;

&lt;p&gt;Roadmap&lt;/p&gt;

&lt;p&gt;If I expand this applet further, I’d add:&lt;/p&gt;

&lt;p&gt;📱 Wearable integration (steps, heart rate, sleep)&lt;/p&gt;

&lt;p&gt;🎮 Gamification with streaks &amp;amp; achievements&lt;/p&gt;

&lt;p&gt;🌍 Multilingual support for accessibility&lt;/p&gt;

&lt;p&gt;🤝 Community features for sharing anonymized health highlights&lt;/p&gt;

&lt;p&gt;Team&lt;/p&gt;

&lt;p&gt;Solo project by @rohitsmane57-glitch&lt;/p&gt;

</description>
      <category>devchallenge</category>
      <category>googleaichallenge</category>
      <category>ai</category>
      <category>gemini</category>
    </item>
  </channel>
</rss>
