William Henry King

Posted on Sep 14

Echo Location Project [Google AI Studio Multimodal Challenge]

#devchallenge #googleaichallenge #ai #gemini

Google AI Challenge Submission

🎯 This is a submission for the Google AI Studio Multimodal Challenge 🚀

🌿🔍 Echo Location Project 🦅✨

🚀 What I Built

I built 🌟 Echo Location 🌟, a web app that reframes the concept of an animal identifier into an interactive and purpose-driven conservation quest! 🎯🌍 The experience isn't just about answering "What animal is this?" 🤔; it's about solving the deeper problem of the disconnect between humanity and the natural world! 💚🌱

🚨 It's actually meant for mobile...but in this case it can also be seen as used on desktop, tablet...etc. 📱💻📲

🎪✨🌟⭐💫⚡🔥💥🎉🎊🌈✨

🌟 The Experience 🌟

Echo Location transforms every user into an "Eco-Scout" 🧭🌿 on a mission! When a user uploads a photo 📸, video 🎥, or even an audio clip 🎵 of wildlife, they're not just getting a name. They're filing a "Sighting Report" 📋 that initiates an immersive learning journey! 🎓✨

🤖 TECH MAGIC: The app uses Gemini 2.5 Pro and Flash 🧠⚡ to perform a deep multimodal analysis!

🎯 Generated Field Report Features:

🦆 The animal's story

🚨 Official conservation status

⚠️ Primary threats

📍 Simulated geolocation based on its environment

🎮 Gamification Elements:

This journey is gamified through a "Ranger's Field Journal" 📖 where users:

📈 Level up
🏅 Earn badges for completing ecosystem collections
🔓 Unlock "Hope Spotlights"—real stories of conservation success! 🌟

🎪 MOST IMPORTANTLY: Echo Location bridges the digital-to-real-world gap by issuing actionable "Field Missions," like plastic cleanups 🗑️ or pollinator pledges 🐝, turning learning into tangible, positive environmental action! 💪🌍

🎬 Demo

📱 Deployed Link: https://echo-location-849419792496.us-west1.run.app/

📸 Screenshots:

✨🌟⭐💫⚡🔥💥🎉🎊🎪🌈✨

🧠 How I Used Google AI Studio

Google AI Studio was the central nervous system 🧬 for developing Echo Location! Gemini 2.5 Pro 🤖 is not just a feature; it is the brain 🧠 and the heart ❤️ of the entire application!

🎯 WORKFLOW MAGIC: My primary workflow revolved around extensive prompt engineering in the AI Studio! 🛠️✨

I crafted a detailed system prompt that establishes the persona of "Gem," 💎 our AI Field Biologist! This prompt instructs Gemini to act as an enthusiastic guide 🗺️ and to structure all its responses in the format of our "Field Report." 📋

🚀 The Process

🔄 THE MAGIC FLOW:

📱 User uploads media (image, video, or audio)
        ↓
🚀 Backend sends to Gemini 2.5 Pro and Flash  
        ↓
🧠 Prompt directs chain-of-thought analysis
        ↓  
📊 Generates immersive Field Report!

🎯 Chain-of-Thought Analysis Breakdown:

🔍 Identify: Identify the species, its scientific name, and its conservation status

🌿 Analyze: Analyze the surrounding environment, flora, and context to deduce a probable ecosystem and geolocation

📖 Narrate: Weave this data into an engaging, educational story in Gem's persona

⚡ Act: Based on the species and its threats, generate a relevant, actionable "Field Mission" for the user

🎪 PRO TIP: Google AI Studio was indispensable for rapidly testing and refining these complex prompts! Being able to quickly iterate on inputs and outputs and then deploy the model via Cloud Run made the entire development cycle seamless! 🌟

🎨 Multimodal Features

The multimodal capabilities of Gemini 2.5 Pro are what elevate Echo Location from a simple app to an immersive experience! 🎪✨

🌍 Holistic Scene and Video Understanding

🎯 This is the core input! The app doesn't just recognize an animal; it understands the scene! 🎬 When a user uploads a video of a bear 🐻 catching a salmon 🐟, Gemini comprehends:

🎯 The action (hunting)

🤝 The interaction between species

📊 Context differences (e.g., a lion resting 😴 vs. a lion on the prowl 👀)

🌟 MAGIC MOMENT: This contextual understanding allows "Gem" to generate narratives that are incredibly rich and specific to the moment captured by the user, making every Field Report unique and insightful! 💫

🎵 Eco-Acoustic Analysis

🎪 A standout feature is the ability to accept audio uploads! 🎙️ A user can record:

🐦 Bird calls in their backyard

🦗 The sound of insects at night

🤖 GEMINI MAGIC: Gemini analyzes this soundscape to identify potential species ("That distinct call belongs to a Northern Cardinal!" 🐦) and describe the health of the local ecosystem! 🌿

This turns the user's own environment into a subject of discovery and makes them feel like a true field biologist 🔬 using advanced tools! 🛠️

🔄 Context-Driven Content Generation

The app demonstrates a powerful multimodal feedback loop! 🌪️ The visual 👁️ or audio 👂 input is the catalyst for all generated content!

🎯 Examples in Action:

🐢 Sea Turtle Image → doesn't just trigger a story about turtles; it triggers the generation of the "Plastic Patrol Mission" 🗑️

🐝 Garden Bee Audio → triggers the generation of the "Pollinator Pledge Mission" 🌻

🎪 THE SECRET SAUCE: This ensures that the conservation message and the call-to-action are always directly relevant to the user's discovery, creating a powerful and persuasive user experience that drives the app's core mission forward! 🚀💚

✨🌟⭐💫⚡🔥💥🎉🎊🎪🌈✨

Made with ❤️ by @williamhenryking and @linfordlee14 ! 👨‍💻👨‍💻

🙏 Thanks for reading our submission! 🎉🚀

Top comments (3)

Linford • Sep 14

It was amazing collaborating on this project! Learned a lot about multimodal AI and Google AI Studio along the way. Proud of what we achieved together!

Linford • Sep 14

Echo Location Project is such a creative application of AI! Projects like this show how much potential there is in multimodal AI for real-world solutions.

SS • Sep 15

Very innovative!