Nizzad

Posted on Sep 15

🏡 DreamNest.AI: AI-Powered House Design, 2D & 3D Plan Audio & Video Walkthroughs & Smart E-Commerce

#devchallenge #googleaichallenge #ai #gemini

Google AI Challenge Submission

This is a submission for the Google AI Studio Multimodal Challenge

The Vision Behind DreamNest.ai - AI-Powered Home Designer & Marketplace

As someone passionate about democratizing design and making home planning accessible to everyone, my goal with this submission is to showcase the transformative power of Google AI Studio and demonstrate how its multimodal Gemini capabilities can revolutionize the home design industry. DreamNest.ai is a comprehensive AI-powered platform that transforms simple user requirements into complete, visualized home design packages - proving that sophisticated architectural visualization is no longer limited to expensive design firms.

Home design has always been an exclusive, expensive process requiring specialized knowledge and professional expertise. Most people struggle to visualize their dream home from basic floor plans or communicate their vision effectively to architects. I wanted to build something that breaks down these barriers - an intuitive, accessible platform that lets anyone input their requirements and instantly receive professional-quality design assets including photorealistic renderings, detailed blueprints, immersive video tours, and curated shopping lists.

In exploring Google AI Studio and its multimodal models, I discovered they can dramatically streamline the entire design workflow:

🏠 Generate photorealistic home renderings from simple text descriptions using Imagen 4.0, producing both exterior and interior visualizations that rival professional architectural renders

House' Front View

House's view from Inside

📐 Create detailed 2D and 3D floor plans with AI-generated blueprints and dollhouse-style cutaway views that help users understand spatial relationships

🎬 Produce cinematic 3D video tours using Veo 2.0 that provide immersive walkthroughs of the designed spaces

🗣️ Generate eloquent voice-guided walkthroughs with Gemini 2.5 Flash that describe the home's features in compelling, real estate agent style

🛍️ Curate personalized shopping lists with structured JSON output that matches furniture and decor to the home's architectural style
⚡ Deliver instant results through parallel processing of multiple AI models, providing a complete design package in minutes rather than weeks

🎨 Maintain design coherence across all generated assets by using consistent prompts and style guidelines throughout the generation pipeline

This submission demonstrates that Google AI Studio isn't just a research tool but a practical platform for building production-ready applications that can disrupt traditional industries and make professional services accessible to everyone.

Through a comprehensive technical implementation, I hope to inspire developers, designers, and entrepreneurs to explore multimodal AI for creating tools that genuinely democratize expertise and empower users to bring their visions to life.

1️⃣ What I Built
2️⃣ Live Demo
3️⃣ How I Used Google AI Studio
4️⃣ Multimodal Features
5️⃣ Real-World Problem Solving
6️⃣ Application Features & Best Practices
7️⃣ Development & Deployment Details
8️⃣ Challenge Compliance
9️⃣ Future Enhancements
🔟 Lessons Learned
✅ Conclusion

1️⃣ What I Built

DreamNest.ai is a comprehensive AI-powered home design platform that transforms the traditional architectural design process. Instead of requiring expensive consultations, technical drawings, and weeks of back-and-forth communication, users can simply input their requirements through an intuitive form and receive a complete design package within minutes.

The platform solves the critical problem of design accessibility and visualization. It addresses:

Visualization Gap: Most people can't interpret technical blueprints or imagine spaces from 2D plans
Cost Barrier: Professional architectural visualization is expensive and time-consuming
Communication Challenge: Translating personal vision into technical requirements is difficult
Decision Paralysis: Without seeing options, users struggle to make design decisions

The experience DreamNest creates is one of empowerment and inspiration. By transforming complex architectural concepts into beautiful, understandable visuals, it helps users make confident decisions about their most important investment - their home.

2️⃣ Demo

Live Application: DreamNest.ai - AI Home Designer

Video Demo of DreamNest.AI

Github Repository

mohamednizzad / DreamNest.AI

The Vision Behind DreamNest.ai - AI-Powered Home Designer & Marketplace

DreamNest.ai – AI-Powered Home Designer & Marketplace

🌟 Core Idea

DreamNest is an innovative AI platform that transforms user-defined house requirements into a complete, visualized home design package. Users can input parameters like plot size, number of rooms, and architectural style, and instantly receive AI-generated blueprints, photorealistic images, a voice-guided walkthrough, a 3D video tour, and a curated shopping list for materials and furniture.

This project serves as a powerful demonstration of the multimodal capabilities of the Google Gemini API, integrating text, image, and video generation into a cohesive and user-friendly web application.

✨ Features

Intuitive Input Form: A simple, multi-section form for users to specify their dream home parameters.
AI-Powered Generation: Leverages the Gemini API for a suite of generated assets
- Photorealistic Renderings: High-quality interior and exterior images generated by Imagen.
- Blueprint Description: A detailed textual layout of the home's floor plan.
- Voice-Guided Walkthrough: An eloquent, AI-generated script describing…

View on GitHub

Key Features Showcase

1. Intuitive Input Form: Clean, multi-section interface for specifying home requirements including plot dimensions, architectural style, room counts, and special features.

2. Real-time Generation Progress: Dynamic loading interface with stage-by-stage updates showing users exactly what's being generated.

3. Comprehensive Results Display:

Photorealistic exterior and interior renderings
Detailed 2D architectural blueprints
3D dollhouse-style floor plans
Cinematic video walkthrough tours
Voice-guided property descriptions with text-to-speech
Curated shopping lists with furniture and decor recommendations

4. Interactive Features:

Day/night theme toggle for user comfort
Downloadable design assets for offline use
Responsive design optimized for all devices
Error handling with graceful degradation

3️⃣ How I Used Google AI Studio

Google AI Studio Usage Demo

Multi-Model Orchestration Strategy

The entire DreamNest platform is built around a sophisticated orchestration of multiple Google AI Studio models, each optimized for specific tasks:

// Core generation pipeline using multiple Gemini models
const generateDesignAssets = async (formData: FormData): Promise<DesignAssets> => {
    const basePrompt = generatePrompt(formData);

    // Parallel text generation with Gemini 2.5 Flash
    const [walkthroughScript, shoppingList] = await Promise.all([
        generateWalkthroughScript(basePrompt),
        generateShoppingList(basePrompt), // Uses structured JSON output
    ]);

    // Image generation with Imagen 4.0
    const images = await generateImages(basePrompt);

    // Floor plan generation (2D and 3D)
    const [plan2D, plan3D] = await Promise.all([
        generate2DPlan(basePrompt),
        generate3DPlan(basePrompt),
    ]);

    // Video generation with Veo 2.0 (rate-limited)
    const videoUrl = await generateVideo(basePrompt);

    return { images, videoUrl, walkthroughScript, shoppingList, plan2D, plan3D };
};

Model-Specific Implementations

Gemini 2.5 Flash for Structured Data:

   // Shopping list generation with JSON schema
   const response = await ai.models.generateContent({
       model: 'gemini-2.5-flash',
       contents: shoppingPrompt,
       config: {
           responseMimeType: 'application/json',
           responseSchema: {
               type: Type.ARRAY,
               items: {
                   type: Type.OBJECT,
                   properties: {
                       name: { type: Type.STRING },
                       description: { type: Type.STRING },
                       priceRange: { type: Type.STRING },
                   },
                   required: ["name", "description", "priceRange"],
               },
           },
       },
   });

Imagen 4.0 for Photorealistic Renderings:

   const imageResults = await Promise.all([
       ai.models.generateImages({
           model: 'imagen-4.0-generate-001',
           prompt: `Photorealistic, ultra-detailed exterior view of ${prompt}. Cinematic lighting, 8k resolution.`,
           config: { numberOfImages: 1, aspectRatio: "16:9" },
       }),
       ai.models.generateImages({
           model: 'imagen-4.0-generate-001', 
           prompt: `Photorealistic interior view of living room and kitchen. Natural lighting, warm and inviting.`,
           config: { numberOfImages: 1, aspectRatio: "16:9" },
       })
   ]);

Veo 2.0 for Cinematic Video Tours:

   const operation = await ai.models.generateVideos({
       model: 'veo-2.0-generate-001',
       prompt: `Aerial and cinematic 3D architectural walkthrough of ${prompt}. Hyperrealistic rendering.`,
       config: { numberOfVideos: 1 }
   });

Cost Management & Rate Limiting

Implemented intelligent rate limiting for expensive operations:

// Video generation cooldown to manage API costs
const VIDEO_GENERATION_COOLDOWN_MS = 60 * 60 * 1000; // 1 hour
const lastVideoGenTimestamp = localStorage.getItem('lastVideoGenerationTimestamp');

if (timeSinceLastGen < VIDEO_GENERATION_COOLDOWN_MS) {
    updateStatus(setGenerationStatus, 'Video Generation', 
        `Skipped: Hourly limit active. Try again in ${minutesRemaining} mins.`);
}

4️⃣ Multimodal Features

DreamNest showcases the full spectrum of Google AI Studio's multimodal capabilities through an integrated design pipeline:

Cross-Modal Content Generation

Text-to-Image Pipeline: User text requirements → Detailed prompts → Photorealistic renderings
Text-to-Video Pipeline: Architectural descriptions → Cinematic walkthrough videos
Text-to-Structured Data: Design requirements → JSON-formatted shopping lists
Multi-Format Blueprints: Single prompt → Both 2D technical drawings and 3D visualizations

Coherent Multi-Asset Generation

The platform maintains design coherence across all generated assets by:

Using consistent base prompts derived from user input
Applying style-specific modifiers to each generation type
Ensuring architectural accuracy across 2D plans, 3D views, and photorealistic renders
Matching furniture recommendations to the generated visual style

Progressive Enhancement

Parallel Processing: Multiple AI models work simultaneously to reduce wait times
Graceful Degradation: If video generation fails, other assets still provide value
Real-time Feedback: Users see progress updates as each asset is generated
Incremental Loading: Results display as soon as each component is ready

5️⃣ Real-World Problem Solving

DreamNest addresses fundamental challenges in the home design industry:

Democratizing Professional Design

Accessibility: Makes high-quality architectural visualization available to everyone, not just those who can afford professional designers
Speed: Reduces design iteration time from weeks to minutes
Cost: Eliminates expensive consultation fees for initial design exploration
Understanding: Transforms technical blueprints into intuitive visual representations

Bridging the Visualization Gap

Spatial Understanding: 3D floor plans help users understand room relationships and flow
Style Exploration: Photorealistic renders let users see different architectural styles applied to their specific requirements
Material Planning: Shopping lists connect design vision to actionable purchasing decisions
Communication: Generated assets provide a common visual language for discussing design with contractors and family

Creative Multimodal Applications

Architectural Storytelling: Voice-guided walkthroughs create emotional connections to spaces before they're built
Cross-Modal Validation: 2D blueprints, 3D visualizations, and photorealistic renders provide multiple perspectives on the same design
Contextual Commerce: Shopping recommendations are generated based on visual style analysis, not just text descriptions
Progressive Disclosure: Information is revealed in stages (form → progress → results) to prevent cognitive overload

6️⃣ Application Features & Best Practices

Core Features Implemented

Multi-Section Input Form: Comprehensive form capturing plot dimensions, architectural style, room requirements, and special features

Real-time Progress Tracking: Dynamic status updates with stage-specific messaging during generation

Comprehensive Asset Display: Organized presentation of all generated content with intuitive navigation
Voice Integration: Text-to-speech functionality for walkthrough scripts using browser APIs
Theme Management: Persistent dark/light mode with smooth transitions
Download Capabilities: Users can save generated images and plans for offline use
Error Handling: Graceful failure modes with clear user feedback and recovery options

Engineering Excellence

// Structured error handling with user-friendly messages
try {
    const assets = await generateDesignAssets(formData, setGenerationStatus);
    setDesignAssets(assets);
} catch (err) {
    const errorMessage = err instanceof Error ? err.message : 'An unknown error occurred.';
    setError(`Failed to generate design. ${errorMessage}`);
} finally {
    setIsLoading(false);
}

Performance Optimizations

Parallel API Calls: Independent operations run simultaneously to minimize total generation time
Progressive Loading: UI updates as each asset becomes available
Memory Management: Blob URLs are properly managed for video content
State Management: Clean separation of loading, error, and success states

User Experience Design

Intuitive Flow: Clear progression from input → generation → results
Visual Feedback: Loading spinners, progress messages, and status indicators
Responsive Design: Optimized for mobile, tablet, and desktop viewing
Accessibility: ARIA labels and semantic HTML for screen reader compatibility

7️⃣ Development & Deployment Details

Technology Stack

Frontend: React 19 + TypeScript for type-safe component development
Styling: Tailwind CSS for utility-first responsive design
AI Integration: Google AI Studio SDK (@google/genai) for all multimodal operations
State Management: React Hooks with custom status management
Build System: Vite for fast development and optimized production builds

API Integration Architecture

// Service layer abstraction for clean separation of concerns
export const generateDesignAssets = async (
    formData: FormData, 
    setGenerationStatus: React.Dispatch<React.SetStateAction<GenerationStatus>>
): Promise<DesignAssets> => {
    // Orchestrate multiple AI model calls with progress tracking
    const basePrompt = generatePrompt(formData);

    // Sequential and parallel operations optimized for performance
    updateStatus(setGenerationStatus, 'Initialization', 'Crafting design brief...');

    const [walkthroughScript, shoppingList] = await Promise.all([
        generateWalkthroughScript(basePrompt),
        generateShoppingList(basePrompt),
    ]);

    // Continue with image and video generation...
};

Deployment Configuration

Platform: Google Cloud Run for serverless scaling
Environment: Node.js runtime with environment variable management
Security: API keys managed through secure environment variables
Monitoring: Built-in error tracking and performance monitoring

Code Organization

src/
├── components/          # Reusable React components
│   ├── Header.tsx      # Navigation and theme toggle
│   ├── InputForm.tsx   # User requirement collection
│   ├── GenerationProgress.tsx  # Real-time status display
│   ├── ResultsDisplay.tsx      # Generated asset presentation
│   └── ProductCard.tsx         # Shopping list item display
├── services/           # External service integrations
│   └── geminiService.ts       # Google AI Studio API wrapper
├── types.ts           # TypeScript interface definitions
└── App.tsx           # Main application orchestration

8️⃣ Challenge Compliance

This submission fully meets all requirements of the "Build and Deploy a Multimodal Applet" challenge:

✅ Built on Google AI Studio - Developed using Google AI Studio's comprehensive API suite with multiple model integrations

✅ Deployed using Cloud Run - Production deployment on Google Cloud Run with auto-scaling and high availability

✅ Multimodal Functionality - Implements multiple Gemini capabilities:

Gemini 2.5 Flash for text generation (walkthrough scripts, blueprint descriptions) and structured JSON output (shopping lists)
Imagen 4.0 for photorealistic image generation (exterior/interior renderings, 2D/3D floor plans)
Veo 2.0 for cinematic video generation (3D architectural walkthroughs)
**Voice Generation

// Hypothetical future API
const generateVoiceNarration = async (script: string) => {
    const response = await ai.models.generateAudio({
        model: 'text-to-speech-model',
        text: script,
        voice: 'professional-narrator'
    });
};

Beyond Basic Requirements

Production-Ready: Full error handling, rate limiting, and user experience optimization
Cost-Conscious: Intelligent rate limiting for expensive operations
Scalable Architecture: Clean separation of concerns and modular design
Real-World Application: Addresses genuine market needs in the home design industry

9️⃣ Future Enhancements

Advanced AI Features

Audio Integration: Voice input for describing design preferences using Gemini's audio understanding
Interactive Refinement: Conversational AI for iterating on designs based on user feedback
Style Transfer: Apply different architectural styles to existing designs
Sustainability Analysis: AI-powered recommendations for eco-friendly materials and energy efficiency

Platform Expansion

Marketplace Integration: Connect users with real contractors and suppliers
VR/AR Visualization: Immersive viewing of generated designs
Collaborative Features: Multi-user design sessions and sharing capabilities
Mobile App: Native iOS/Android applications with camera integration

Business Intelligence

Cost Estimation: AI-powered construction cost analysis
Permit Assistance: Automated building code compliance checking
Timeline Planning: Project scheduling and milestone tracking
Vendor Matching: AI-powered contractor and supplier recommendations

🔟 Lessons Learned

Building DreamNest.ai provided invaluable insights into multimodal AI development and production deployment:

Technical Insights

🔄 Orchestration Complexity: Managing multiple AI models requires careful coordination of API calls, error handling, and progress tracking
💰 Cost Management: Expensive operations like video generation need intelligent rate limiting and user communication
⚡ Performance Optimization: Parallel processing dramatically improves user experience but requires careful resource management
🎯 Prompt Engineering: Consistent, detailed prompts across models ensure coherent output quality

User Experience Learnings

📱 Progressive Disclosure: Breaking complex generation into stages prevents user anxiety during long operations
🎨 Visual Coherence: Maintaining consistent style across different AI-generated assets requires careful prompt design
🔄 Error Recovery: Graceful degradation and clear error messages are crucial for user trust
⏱️ Expectation Management: Real-time progress updates significantly improve perceived performance

Business Insights

🏠 Market Validation: Strong user interest in accessible design tools confirms market demand
🤖 AI as Enabler: AI doesn't replace human creativity but dramatically amplifies accessibility and speed
💡 Democratization Impact: Making professional tools accessible can disrupt traditional service industries
🔮 Future Potential: Multimodal AI opens possibilities for entirely new user experiences and business models

Development Best Practices

🏗️ Architecture Matters: Clean separation between AI services and UI components enables rapid iteration
📊 Structured Output: Using JSON schemas for AI responses eliminates parsing errors and improves reliability
🛡️ Defensive Programming: Comprehensive error handling and fallbacks are essential for production AI applications
📈 Scalable Design: Building for scale from the beginning prevents costly refactoring later

✅ Conclusion

Building DreamNest.ai has been an incredible journey that showcases the transformative potential of Google AI Studio's multimodal capabilities. This project demonstrates that AI isn't just about automating existing processes - it's about creating entirely new possibilities and democratizing access to professional expertise.

By combining Gemini 2.5 Flash, Imagen 4.0, and Veo 2.0 into a cohesive platform, DreamNest transforms the traditionally exclusive world of architectural design into an accessible, intuitive experience. Users can now visualize their dream homes with professional-quality renderings, understand spatial relationships through multiple visual formats, and make informed decisions with AI-generated guidance.

The technical implementation proves that multimodal AI can power production-ready applications that solve real-world problems. Through careful orchestration of multiple models, intelligent cost management, and user-centric design, we've created a platform that genuinely empowers users while demonstrating the practical potential of AI technology.

This project represents more than just a technical achievement - it's a glimpse into a future where AI democratizes expertise, making professional-quality services accessible to everyone. My hope is that DreamNest inspires other developers to explore the creative possibilities of multimodal AI and build applications that genuinely improve people's lives.

The future of AI isn't just about smarter algorithms - it's about creating more accessible, more human, and more empowering experiences. DreamNest.ai is just the beginning.

References and Resources

🏠 Experience DreamNest.ai Today!

Transform your home design dreams into reality with the power of AI.

👉 Launch DreamNest.ai - Powered by Google AI Studio

Built with ❤️ for Dev.to — Powered by Google AI Studio

Top comments (53)

Nizzad • Sep 15

ATTENTION: I have unbilled this project from google account billing in order to minimize the cost. Since, unprecedented reach of this post within the publication of few hours, I have already occurred nearly $25.

Therefore, now I have made this project unbilled from my GCP hence, the live applet link mentioned above cannot be accessed. However, I have provided the demo and my work in detail.

However, if inaccessibility will adversely affect judging, kindly let me know so that I will try to find any alternative to keep it accessible live.

Over to you @ben @noahbrinker @jess

I spent several days and hours into come up with this submission and I don't want this issue hinder judging.

Thank you in advance.

Ben Halpern • Sep 15

Thank you for the notice and we will take this into account

Nizzad • Sep 15

Much appreciated for your immediate response. If you want to see it working live, please do not hesitate to let me know.

Nikoloz Turazashvili (@axrisi) • Sep 16

can you please add link to AI studio applet? this way we can test it without you running it on GCP? :)

Comment deleted

Nikoloz Turazashvili (@axrisi) • Sep 20

thank you :) testing now

Ahamed Ahnaf • Sep 15

This is an absolutely brilliant project! DreamNest.ai captures exactly what the future of design tools should be: powerful, inclusive, and easy to use. The way you’ve harnessed Google AI Studio’s multimodal features to deliver 2D/3D plans, photorealistic renders, video walkthroughs, voice-guided tours, and curated shopping lists all from simple inputs is very impressive. It’s inspiring to see how you’re democratizing architectural visualization, making what used to be a luxury accessible to everyone. Kudos for both the technical depth and the thoughtful user-experience design. Looking forward to seeing how this evolves! 👏

Nizzad • Sep 15

Thanks a lot for your detailed response.

Shan F • Sep 15

Wow. Absolutely brilliant idea. I like the 3D rendering and how thoroughly you've organized the contents.

Nizzad • Sep 15

Thank you.

Fayaz • Sep 15

Cool!

You've done a lot of work with this!
All the best.

Nizzad • Sep 16

Great. Thanks

Ruwan Guna • Sep 19

An exceptional submission with in-depth details. Advanced Congratulations.

Nizzad • Sep 23

Thank you.

Anne Rose • Sep 20

Overwhelmed by the quality of your work and the details.

Nizzad • Sep 23

Thank you.

Joseph • Sep 21

Let's collaborate to take this for the next level

Yousuf Mohamed • Sep 22

Awesome work

Nizzad • Sep 23

Thank you.

IT Plus Info • Sep 22

Great work sir

Nizzad • Sep 23

Thank you.

Parag Nandy Roy • Sep 17

This is quite cool..

Nizzad • Sep 23

Thank you.

View full discussion (53 comments)

Some comments may only be visible to logged-in visitors. Sign in to view all comments. Some comments have been hidden by the post's author - find out more

The Vision Behind DreamNest.ai - AI-Powered Home Designer & Marketplace

Table of Contents

1️⃣ What I Built

2️⃣ Demo

Video Demo of DreamNest.AI

Github Repository

mohamednizzad / DreamNest.AI

The Vision Behind DreamNest.ai - AI-Powered Home Designer & Marketplace

DreamNest.ai – AI-Powered Home Designer & Marketplace

🌟 Core Idea

✨ Features

Key Features Showcase

3️⃣ How I Used Google AI Studio

Google AI Studio Usage Demo

Multi-Model Orchestration Strategy

Model-Specific Implementations

Cost Management & Rate Limiting

4️⃣ Multimodal Features

Cross-Modal Content Generation

Coherent Multi-Asset Generation

Progressive Enhancement

5️⃣ Real-World Problem Solving

Democratizing Professional Design

Bridging the Visualization Gap

Creative Multimodal Applications

6️⃣ Application Features & Best Practices

Core Features Implemented

Engineering Excellence

Performance Optimizations

User Experience Design

7️⃣ Development & Deployment Details

Technology Stack

API Integration Architecture

Deployment Configuration

Code Organization

8️⃣ Challenge Compliance

Beyond Basic Requirements

9️⃣ Future Enhancements

Advanced AI Features

Platform Expansion

Business Intelligence

🔟 Lessons Learned

Technical Insights

User Experience Learnings

Business Insights

Development Best Practices

✅ Conclusion

References and Resources

🏠 Experience DreamNest.ai Today!