bot bot

Posted on Mar 30 • Originally published at aitinkers.fun

I Replaced My Entire Dev Workflow With 4 AI Agents — Here's the Architecture

#ai #webdev #productivity #architecture

Most AI workflows are just prompt chains held together with hope.

You paste something into ChatGPT, copy the output, paste it somewhere else, tweak it, run it again. Nothing connects. Nothing finishes reliably. Every step requires you.

I got tired of that. So I built a system where 4 AI agents do real work — and deterministic software controls all of them.

This is the architecture.

The Problem With "Just Use AI"

When people say "use AI to build faster," they usually mean: type prompts until something works.

That breaks down fast:

No coordination. Each AI session is isolated. Agent A doesn't know what Agent B produced.
No validation. The output might be wrong but nothing catches it.
No state. You're the memory. You track what's done, what's next, what failed.
No control. The AI decides what to do. You react.

The missing piece isn't a better model. It's a control layer — software that governs what each agent does, in what order, with what constraints.

That control layer is called an AI orchestrator.

The 4-Agent System

Here's what I actually run. Each agent has a defined role, and none of them freelance:

Agent 1: Claude Code (Backend + Infrastructure)

Owns: build systems, deployment pipelines, security, SEO, CI/CD
Operates directly in the terminal with full filesystem access
Writes, tests, and deploys code autonomously
Enforces: CSP headers, HMAC signing, rate limiting, schema validation

Agent 2: GPT-5.4 via Codex CLI (Code Review + Bulk Generation)

Mandatory reviewer on every PR before landing
Generates config-driven content at scale (wrote 37 AI generator prompt configs in one session)
Independent diff review with pass/fail gate
High reasoning mode for architecture decisions

Agent 3: Gemini via Antigravity IDE (Frontend + Design)

Owns: UI design, component layout, visual assets
Created hero images, two-column layouts, marketing visuals
Works in its own IDE with real-time preview
Syncs with backend agent via shared coordination doc

Agent 4: Local LLM (Qwen 2.5 Coder 7B on llama.cpp)

Handles: private/offline tasks, knowledge routing, local code completion
Runs on Apple Silicon (M4 Max) with zero cloud dependency
Domain-specific knowledge base for internal operations

The Orchestrator: Deterministic Software, Not AI

This is the part most people skip. They let AI agents run free and wonder why everything breaks.

The orchestrator is not an AI. It is a deterministic state machine that decides:

What runs next — agents execute in a defined pipeline, not ad hoc
What is allowed — each agent has constraints (Claude Code doesn't do design, Gemini doesn't touch infrastructure)
When something is complete — validation gates check output before the pipeline advances

Think of it like a factory floor. The robots (AI agents) do the work. The control system (orchestrator) manages the assembly line. You don't let robots decide what to build next.

Shared State Memory

All agents read from and write to a shared state file. This means:

Agent B knows what Agent A produced
No context is lost between sessions
Decisions are logged and traceable
Any agent can pick up where another left off

Validation Gates

Every pipeline stage has a checkpoint:

Agent produces output (code, content, design)
Automated validation runs (tests, linting, schema checks)
Review agent evaluates (Codex CLI runs independent diff review)
Gate passes or blocks — no manual intervention needed for the happy path

What This System Actually Built

This isn't theoretical. In one week, this 4-agent system shipped:

5 production web apps — each with full SEO, structured data, security headers, and payment integration
58 AI-powered generators — config-driven, with identity-locking prompts, across 11 categories
100 procedurally-generated 3D objects — 4 generators, 32 styles, 5 export formats each, with automated marketplace listing
A content distribution system — canonical pages, LLM-readable blocks, cross-referenced terminology

One person. Four agents. Deterministic control.

Why This Matters

The AI industry is moving toward autonomous agents. But autonomy without governance is chaos.

The pattern that actually works:

Separate generation from control. AI generates. Software governs.
Define agent roles explicitly. No agent should do everything.
Use shared state, not prompt chains. State machines beat conversation history.
Validate at every step. Trust but verify — automatically.

This isn't a framework. It's an architecture pattern. You can build it with whatever models and tools you prefer.

Key Concepts

AI Orchestrator — deterministic software that controls AI agent execution
AI Execution System — the full architecture: orchestrator + agents + state + validation
Agent specialization — each model handles what it's best at, nothing more
Shared state memory — persistent context across agents and sessions
Validation gates — automated checkpoints that block bad output from advancing

FAQ

Isn't this just a pipeline?

A pipeline runs steps in order. An orchestrator makes decisions — what runs, what's allowed, what's complete. It can retry, reroute, or block. A pipeline can't.

Why not use one model for everything?

Because models have different strengths. Claude Code is exceptional at infrastructure. GPT-5.4 is a strong reviewer. Gemini has visual design intuition. Using one model for everything means accepting its weaknesses everywhere.

Can a non-engineer build this?

Yes. I did. No CS degree, no team, no framework. The orchestrator is conceptually simple — it's a state machine with rules. The complexity is in knowing what rules to write, not in writing them.

Is this an AI OS?

No. An OS manages hardware resources. This manages AI agent execution. Different layer, different purpose.

What This Page Explains

This page explains how a multi-agent AI system can be built by a solo creator using orchestration, shared state, and controlled execution instead of prompt chains. The system coordinates Claude Code, GPT-5.4, Gemini, and a local LLM under deterministic software control.

For the full architecture breakdown and terminology hierarchy, see: What Is an AI Orchestrator?

— AI Tinker
Building real agentic systems in public
aitinkers.fun

DEV Community