Vikas Sahani

Posted on Apr 22

Google Stitch 2.0: Senior-Level UI in Seconds, But Editing Still Breaks

#ai #google #uidesign #nocode

Google Stitch 2.0 can generate production-quality UI in seconds.

But the moment you try to refine a single component, the system starts to break.

This post is a hands-on evaluation of Stitch while building a real product flow, focusing on where it excels and where it fails under practical usage.

TL;DR

UI generation quality is exceptionally high (near senior-level output)
Iteration speed with Gemini 3 Flash is extremely fast
DESIGN.md improves consistency across outputs
Component-level editing is unreliable
Agent can become unresponsive with no feedback

Conclusion: Strong for generation, weak for controlled refinement

Demo Video

Context

This evaluation was done while building a real product flow for an AI-powered agriculture application.

The workflow included:

Prompt → UI generation
Multi-screen navigation
Component-level edits using "Edit with AI"
Export pipeline testing

What Works Well

High-quality UI generation

Using Gemini 3.1 Pro ("Thinking" mode), Stitch produces layouts with:

Strong visual hierarchy
Clean spacing and alignment
Coherent component structuring

The output often matches what a mid-to-senior designer would produce as a first iteration.

Fast iteration loop

With Gemini 3 Flash:

Multiple UI variations are generated within seconds
Exploration cost is significantly reduced

This compresses early-stage design cycles into a single interaction loop.

Accurate instruction-to-design translation

Compared to earlier versions:

Prompts are interpreted more consistently
Layout intent is preserved
Component grouping is largely correct

DESIGN.md improves consistency

Defining design rules such as:

Colors
Typography
Component constraints

results in more consistent outputs across iterations.

Export layer is well structured

Available outputs include:

AI Studio
Figma
Code export

This creates a clear bridge between design and implementation.

Where It Breaks

Case Study: Scan Button Component

While refining a floating "Scan Crop" button, the following behavior was observed.

Intended behavior

Default state: circular icon button
First interaction: expands into pill with "Scan" label
Second interaction: triggers scan action

Step 1: Edit with AI

Result:

Button collapsed into circular form (expected)
Scan icon was removed (unexpected)

This resulted in a visually ambiguous control with no clear affordance.

Step 2: Targeted correction

Instruction:

"Add scan icon while keeping all other properties unchanged"

Result:

No visible update
No feedback
No error signal

Step 3: Repeated attempts

Same instruction issued multiple times
No change observed

Outcome

Edit pipeline became unresponsive
Component could not be refined further
Required abandoning the edit flow

Analysis

This behavior indicates a system-level limitation rather than a simple generation error.

Weak component-level editing

The system struggles with scoped modifications such as:

Modify one attribute while preserving structure

Lack of deterministic control

Edits are:

Non-reliable
Not precisely applied

Agent state inconsistency

The system appears unable to:

Maintain prior edit context
Apply incremental changes

Silent failure

There is no:

Error feedback
Retry mechanism
Recovery guidance

The system simply stops responding.

Why This Matters

This directly impacts practical usability:

Reduces trust in iterative refinement
Forces regeneration instead of controlled editing
Slows down later-stage prototyping

Stitch is strong in generation, but unreliable in refinement.

Missing Capabilities

For production-level workflows, the following are required:

Deterministic edit mode

Apply scoped, predictable changes.

Component locking

Preserve structure while editing specific attributes.

Feedback system

Provide clear signals when edits fail.

Visual diff

Show before and after changes.

Retry mechanism

Automatically handle failed edits.

Strategic Observation

Stitch 2.0 has largely solved the generation problem.

The remaining challenge is control.

Final Assessment

Strengths:

High-quality UI generation
Fast iteration
Strong direction toward AI-native design workflows

Limitations:

Fragile editing
Lack of deterministic control
Agent reliability issues

Conclusion

Stitch is already a highly capable prototyping system.

However, until component-level editing becomes reliable, it cannot replace traditional design workflows for production use.

For now:

Use it for rapid generation
Avoid relying on it for precise refinement

DEV Community