This is a submission for the Google AI Studio Multimodal Challenge
What I Built
I built Travel360, an app that offers guided tours of a city’s top landmarks through Google Maps’ immersive 3D view. Along the journey, users can capture AI-powered selfies at iconic spots.
Travel360 turns repetitive selfies into unique, shareable moments and makes city exploration fun, accessible, and engaging — without the cost or limits of physical travel.
With just a camera or a simple drag & drop, Gemini’s Nano Banana AI instantly places users in front of world-famous monuments like Times Square, Tower Bridge, or the Statue of Liberty.
Demo
Here’s a walkthrough of Travel360 in action:
1) Navigate to a city in immersive view : https://travel360-50134736379.us-west1.run.app/
2) Click on start tour
3) Upload or take a selfie using camera.
4) Generate a realistic selfie in the selected place (using Gemini nano banana api).
PRES & DEMO :
View live demo
How I Used Google AI Studio
I leveraged Google AI Studio to integrate Gemini’s multimodal capabilities into Travel360 and to accelerate development:
Code & Debugging Support: Gemini helped me generate code snippets, resolve bugs, and streamline integration with Google Maps immersive view.
Smart Place Suggestions: It suggested the best landmarks to feature in each city, along with their geographic coordinates, to build a realistic guided tour.
Selfie Generation: With Gemini Nano Banana, I combined user inputs (camera or drag & drop) with scene context to instantly generate realistic selfies at landmarks.
Multimodal Orchestration: Gemini handled both image + text inputs and produced AI-generated outputs, ensuring a seamless user flow between navigation, photo upload, and immersive rendering.
This combination of AI-assisted development and multimodal user experience made Travel360 both faster to build and more engaging to use.
Multimodal Features
Travel360 enhances the experience through multimodality in several ways:
Visual + Text Inputs: Users interact via photos and instructions.
Immersive Map + AI Output: Combines Google Maps 3D data with AI-generated images.
Social Layer: Share-ready outputs designed for virality on TikTok, Instagram, and beyond.
This fusion of navigation, creativity, and AI-powered storytelling creates a unique, playful, and viral user experience — making every city feel instantly accessible.
Only me.
Top comments (0)