Lalit

Posted on May 23

Beyond Prompts: Structuring AI Workflows for Real Frontend Engineering

#ai #frontend #productivity #react

This is a field-tested workflow for using AI effectively in production-grade, enterprise level React codebases in a monrepo setup — beyond toy Todo apps.

Everyone is using AI to boost development productivity these days and organization also pushing the developers to use more and more AI to increase productivity and efficiency. But after weeks of trial and error in a, I cab sat I I have sort of discovered that the real gains don't only come from better prompts or fancier models. They come from structured workflows.

This isn't a theoretical guide. It's my actual day-to-day workflow inside Visual Studio Code using GitHub Copilot (Enterprise version) on a production codebase with multiple apps, shared UI packages, and messy state flows. I use it for bug fixing, root cause analysis, feature addition, refactoring, and navigating codebases where "finding where this logic lives" used to take hours.

Here's how I made AI actually useful for real frontend engineering.

The Biggest Mistake: One Chat For Everything

The most common anti-pattern I see? Using a single chat session to debug, generate features, refactor, and ask architecture questions all at once.

In large codebases, this destroys context. The AI gets confused, starts hallucinating, and loses track of the actual problem. The fix is simple but powerful: separate agent-style workflows — not necessarily official Copilot Agents, but mentally distinct sessions with singular purposes.

And Yes — It's Cheaper Too

Here's something that everyone should be carefull of: running everything in one massive chat is actually more expensive.

When you dump investigation, debugging, refactoring, and implementation into the same conversation, the context window balloons. Every subsequent prompt carries the entire history — failed hypotheses, wrong turns, tangential questions, and code snippets from three different problems. You're paying tokens for noise.

With separate chats:

Investigation and planning chats are short and focused. Once you get your RCA, you close it. Those tokens don't follow you into implementation.
Implementation starts with only relevant context. No baggage from the five wrong theories the AI explored during debugging.
You avoid the "spiral" effect. One-chat workflows often devolve into 20+ message threads trying to course-correct a confused AI. That's 20x the tokens for worse output.
Skills and Hooks reduce prompt size. Persistent instructions mean you don't need to repeat your team's rules every single message.

In practice, I have increased tickets resolution with same amount of tokens monthly.

My Multi-Agent Workflow

I run three distinct mental "agents." Each gets its own chat context, its own rules, and its own goal.

1. The Investigation Agent (Most Used)

Purpose: Understand context or logic before start fixing.

This agent is not allowed to write code initially. Its only job is to trace, analyze, and explain to me whats going on.

Example Prompt:

You are a debugging assistant.
Do not suggest a fix immediately.
First, understand the issue deeply.

Trace:
- Where state originates from
- Where state updates happen
- Are there any Async flow that and what are they
- Component re-renders
- API calls, where, when and how
- Memoizations
- Stale closure possibilities

Then explain possible root causes ranked by probability.

Why this works: Default AI behavior is to suggest a fix without completely understanding the problem and then will to multiple iteration to make that write and so many changes. That's dangerous in production for an enterprise grade app. By forcing investigation first, the AI behaves like an actual engineer.

Lets take a real Real Example:

Issue: Table not updating after mutation.

Bad prompt: "Why is the table not updating? Fix this."

Good prompt:

After a successful mutation response, the UI does not update until page refresh.

Stack:
- React Query
- Shared table component
- Optimistic updates disabled

Investigate:
- Cache invalidation flow
- Stale query possibilities
- Memoized rows
- Dependency issues
- Selector issues

Do not suggest a fix immediately.

Now the AI starts reasoning instead of guessing.

2. The Fix Implementation Agent

Once root cause is clear, I start a new chat. Fresh context. No baggage from the investigation.

Example Prompt:

Implement a minimal fix only.

Constraints:
- Preserve existing architecture
- No large refactors
- Maintain strict TypeScript typings
- Avoid changing shared components
- Keep the PR small and reviewable

Issue: React Query cache invalidation is missing after mutation success.

This gives dramatically better output than mixing investigation and implementation in one giant prompt.

3. The Refactor Agent

Another fresh chat. Used for cleanup, not fixes.

Example Prompt:

Refactor this component carefully.

Goals:
- Reduce unnecessary re-renders
- Avoid unnecessary useEffect
- Preserve exact behavior
- Improve readability
- Keep the same public API

Do not rewrite the entire component.

This is critical because AI loves rewriting entire files. In production apps, that's a recipe for regressions.

The Most Underrated Feature: Skills

Copilot's "Skills" (or custom instructions) let you define persistent behavior. Instead of repeating rules every session, you teach the AI how your team works.

My React Skill

# React Frontend Rules

- Prefer derived state over useEffect syncing
- Avoid unnecessary useEffect
- Keep components pure where possible
- Preserve existing architecture
- Use existing shared UI components
- Maintain strict TypeScript typings
- Avoid introducing new libraries
- Follow current folder structure
- Prefer composition over prop drilling
- Avoid premature memoization
- Preserve accessibility attributes

With this, the AI stops generating tutorial-level React code. Production React and YouTube React are quite a different thing.

My Debugging Skill

# Debugging Rules

- Investigate before fixing
- Explain root cause clearly
- Rank possible issues by probability
- Preserve current architecture
- Avoid speculative fixes
- Mention edge cases
- Mention async timing issues
- Analyze re-render behavior

I also maintain skills for TypeScript, API integration, and PR review rules. Think of it as onboarding the AI to your engineering culture.

Hooks: Automatic Context Injection

Hooks automatically inject context when AI touches specific file types. This saves enormous time.

React Component Hook:

Whenever editing a React component, automatically remind:
- Preserve TypeScript typings
- Avoid unnecessary re-renders
- Use existing hooks and patterns
- Follow shared component conventions

API File Hook:

When touching API files:
- Preserve API contract
- Don't break response typing
- Maintain error handling structure
- Preserve retry logic

Individually, these are tiny reminders. Together, they compound into a massive productivity boost and far fewer review comments.

Model Selection Matters

Copilot Enterprise gives access to multiple models. I don't use one for everything.

Claude

Understanding huge files
Root cause analysis
Architecture reasoning
Debugging messy logic

Claude excels at reading ugly production code, especially large React components with tangled state.

GPT Models

Implementation and repetitive coding
Refactoring
TypeScript-heavy generation
Utility creation

GPT is usually faster during active feature implementation. I also use Copilot's auto model selection to save tokens on simpler tasks.

Lets take a full Real-World Workflow

Issue: Search filters reset randomly.

Step 1 — Investigation (Claude)

Investigate:
- Where filter state lives
- Persistence flow
- what triggering re-render
- URL sync behavior
- Async effects
- Remount possibilities

Step 2 — Root Cause Analysis

AI identifies:

Component remounts on route change
Local state resets
Missing URL persistence

Step 3 — Implementation (Fresh Chat, GPT)

Implement minimal fix.
Persist filters through URL query params.

Constraints:
- Preserve current architecture
- Avoid global state
- Maintain backward compatibility
- Avoid affecting other pages

Result: Scoped, reviewable, correct. No accidental refactors. No mystery bugs.

The Real Productivity Gain

It's not autocomplete. It's faster understanding.

In large codebases, the hardest problem is often: "Where is this logic even coming from?"

With a structured AI workflow, I can:

Trace state flows in minutes, not hours
Summarize sprawling files instantly
Find impact areas before touching code
Understand old, undocumented code

That alone saves an insane amount of time for me.

What's your AI workflow? I'd love to hear how others are structuring context in there apps.

DEV Community

Beyond Prompts: Structuring AI Workflows for Real Frontend Engineering

The Biggest Mistake: One Chat For Everything

And Yes — It's Cheaper Too

My Multi-Agent Workflow

1. The Investigation Agent (Most Used)

2. The Fix Implementation Agent

3. The Refactor Agent

The Most Underrated Feature: Skills

My React Skill

My Debugging Skill

Hooks: Automatic Context Injection

Model Selection Matters

Claude

GPT Models

Lets take a full Real-World Workflow

Step 1 — Investigation (Claude)

Step 2 — Root Cause Analysis

Step 3 — Implementation (Fresh Chat, GPT)

The Real Productivity Gain

Top comments (0)