<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: vikash ruhil</title>
    <description>The latest articles on DEV Community by vikash ruhil (@vikash_ruhil_a43b452d4a88).</description>
    <link>https://dev.to/vikash_ruhil_a43b452d4a88</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3760336%2F0557af6d-a39f-4caf-a4d9-da64eedb1b1a.png</url>
      <title>DEV Community: vikash ruhil</title>
      <link>https://dev.to/vikash_ruhil_a43b452d4a88</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/vikash_ruhil_a43b452d4a88"/>
    <language>en</language>
    <item>
      <title>How I Built an 8-Agent AI Team That Ships Code Autonomously</title>
      <dc:creator>vikash ruhil</dc:creator>
      <pubDate>Sun, 08 Feb 2026 16:50:54 +0000</pubDate>
      <link>https://dev.to/vikash_ruhil_a43b452d4a88/how-i-built-an-8-agent-ai-team-that-ships-code-autonomously-2j0h</link>
      <guid>https://dev.to/vikash_ruhil_a43b452d4a88/how-i-built-an-8-agent-ai-team-that-ships-code-autonomously-2j0h</guid>
      <description>&lt;h2&gt;
  
  
  A plugin system where AI agents plan, build, review, and deploy — like a real engineering team
&lt;/h2&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flux5vekcqyk41r0z8hdb.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Flux5vekcqyk41r0z8hdb.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  The Problem Nobody Talks About
&lt;/h3&gt;

&lt;p&gt;Every developer using AI assistants hits the same wall.&lt;/p&gt;

&lt;p&gt;You ask an AI to "add user authentication." It generates code. Maybe good code. But it doesn't know your project's patterns. It doesn't check if the code is secure. It doesn't create a pull request. It doesn't track what was done or what comes next.&lt;/p&gt;

&lt;p&gt;You're still the glue holding everything together.&lt;/p&gt;

&lt;p&gt;I wanted something different. I wanted AI agents that work like a real engineering team — where one agent plans the work, another builds it, another reviews it, and another tries to break it. Where knowledge accumulates across sessions. Where "add user authentication" triggers an entire workflow that ends with a reviewed, tested pull request.&lt;/p&gt;

&lt;p&gt;So I built &lt;strong&gt;AI Agent Manager&lt;/strong&gt;.&lt;/p&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fro4tzy66vsa0x7618ywu.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fro4tzy66vsa0x7618ywu.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  What Is AI Agent Manager?
&lt;/h3&gt;

&lt;p&gt;AI Agent Manager is a &lt;strong&gt;Claude Code plugin&lt;/strong&gt; that provides 8 specialized AI agents for software development. Each agent has a distinct role, distinct tools, and a distinct personality — just like teammates on a real engineering team.&lt;/p&gt;

&lt;p&gt;It's not a framework. It's not a SaaS product. It's a set of Markdown prompt files with YAML frontmatter that plug directly into Claude Code's CLI. No servers. No APIs. No infrastructure. Just agents that understand your codebase and get work done.&lt;/p&gt;

&lt;p&gt;Here's what makes it different from "just using AI":&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Agents have roles.&lt;/strong&gt; A Code Reviewer doesn't write features. A Worker doesn't do planning.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agents have memory.&lt;/strong&gt; They remember your project's patterns across sessions.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agents collaborate.&lt;/strong&gt; The output of one becomes the input of the next.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Work is tracked.&lt;/strong&gt; Every task, subtask, and review decision is recorded.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Execution is parallel.&lt;/strong&gt; Multiple workers build simultaneously using git worktrees.&lt;/li&gt;
&lt;/ul&gt;




&lt;h3&gt;
  
  
  Meet the Team
&lt;/h3&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgh56ukwmze4q31ndowyv.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fgh56ukwmze4q31ndowyv.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h4&gt;
  
  
  1. Launch Pad — The Readiness Planner
&lt;/h4&gt;

&lt;p&gt;Before any code is written, Launch Pad takes your raw goal and prepares it for execution. It runs a 6-phase process:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;VALIDATE&lt;/strong&gt; &amp;gt; &lt;strong&gt;DISCOVER&lt;/strong&gt; &amp;gt; &lt;strong&gt;ANALYZE&lt;/strong&gt; &amp;gt; &lt;strong&gt;DECOMPOSE&lt;/strong&gt; &amp;gt; &lt;strong&gt;PACKAGE&lt;/strong&gt; &amp;gt; &lt;strong&gt;REFINE &amp;amp; SAVE&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Give it "add user authentication" and it will:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Scan your codebase to understand existing patterns&lt;/li&gt;
&lt;li&gt;Identify which files will be impacted&lt;/li&gt;
&lt;li&gt;Estimate parallelism opportunities&lt;/li&gt;
&lt;li&gt;Produce a "Supervisor-Ready Brief" — a structured document that tells the Supervisor exactly what to do&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Think of it as the architect who draws blueprints before construction begins.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;/launch-pad goal: "add JWT authentication with refresh tokens"
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  2. Supervisor — The Parallel Orchestrator
&lt;/h4&gt;

&lt;p&gt;The Supervisor is the project manager. It takes a task (or a Launch Pad brief) and drives it to completion through 6 phases:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;INIT&lt;/strong&gt; &amp;gt; &lt;strong&gt;ACQUIRE&lt;/strong&gt; &amp;gt; &lt;strong&gt;PLAN&lt;/strong&gt; &amp;gt; &lt;strong&gt;EXECUTE&lt;/strong&gt; &amp;gt; &lt;strong&gt;FINALIZE&lt;/strong&gt; &amp;gt; &lt;strong&gt;LOOP&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The magic is in the EXECUTE phase. The Supervisor analyzes subtasks, identifies which can run in parallel, spins up isolated &lt;strong&gt;git worktrees&lt;/strong&gt;, and dispatches Workers to build simultaneously. No file conflicts. No merge chaos.&lt;/p&gt;

&lt;p&gt;When workers finish, it merges everything sequentially, runs reviews, creates a PR, and moves to the next task.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;/supervisor job: .supervisor/jobs/2025-01-15-jwt-auth.md
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  3. Context-Keeper — The Memory Manager
&lt;/h4&gt;

&lt;p&gt;Every system needs state management. The Context-Keeper is the sole writer of the Supervisor's state file. It runs on a fast, lightweight model (Haiku) with a 3-turn limit — just enough to read, update, and confirm.&lt;/p&gt;

&lt;p&gt;This externalized state means the Supervisor itself holds only ~800 tokens of context, leaving room for actual work.&lt;/p&gt;

&lt;h4&gt;
  
  
  4. Worker — The Builder
&lt;/h4&gt;

&lt;p&gt;Workers are the hands. Each one operates in an isolated git worktree, implementing a single subtask. They have no access to git operations (no commits, no pushes) — they just write code. This isolation is intentional: a Worker can't accidentally break the main branch.&lt;/p&gt;

&lt;p&gt;When done, they produce a structured &lt;code&gt;WORKER_RESULT&lt;/code&gt; block that the Supervisor validates.&lt;/p&gt;

&lt;h4&gt;
  
  
  5. Product Owner — The Requirements Translator
&lt;/h4&gt;

&lt;p&gt;Vague requirements kill projects. The Product Owner takes business problems and translates them into structured user stories with acceptance criteria in Given/When/Then format.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;/product-owner problem: "users are abandoning checkout"
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It reads your domain context, runs discovery, and produces stories that any developer (human or AI) can implement.&lt;/p&gt;

&lt;h4&gt;
  
  
  6. Orchestrator — The Task Architect
&lt;/h4&gt;

&lt;p&gt;The Orchestrator breaks goals into dependency graphs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;EPIC&lt;/strong&gt; — The big feature&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;TASK&lt;/strong&gt; — Implementation work (30-60 minutes each)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;SUBTASK&lt;/strong&gt; — Review gates that block the next task&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Every implementation task gets a paired review subtask. You can't skip reviews. This is quality enforcement by design.&lt;/p&gt;

&lt;h4&gt;
  
  
  7. Code Reviewer — The Quality Gatekeeper
&lt;/h4&gt;

&lt;p&gt;After code is written, the Code Reviewer inspects it against your project's patterns. It outputs one of three decisions:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;PASS&lt;/strong&gt; — Ship it&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;FAIL&lt;/strong&gt; — Fix these issues and come back&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;NEEDS_HUMAN&lt;/strong&gt; — I found something that requires human judgment&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It checks type safety, security, performance, test coverage, and pattern alignment. And because it has persistent memory, it gets better at reviewing your specific codebase over time.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;/code-reviewer src/auth/
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h4&gt;
  
  
  8. Red Team Reviewer — The Adversarial Auditor
&lt;/h4&gt;

&lt;p&gt;The Red Team Reviewer's job is to break things. It attacks assumptions, verifies claims against actual documentation, and explores 6 attack vectors to find what would fail in production.&lt;/p&gt;

&lt;p&gt;Findings are rated: &lt;strong&gt;FATAL&lt;/strong&gt;, &lt;strong&gt;CRITICAL&lt;/strong&gt;, &lt;strong&gt;WARNING&lt;/strong&gt;, &lt;strong&gt;WEAKNESS&lt;/strong&gt;.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;/red-team-reviewer --focus security
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fid0h6ycvnryuy2c0jxsu.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fid0h6ycvnryuy2c0jxsu.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  The Architecture: Surprisingly Simple
&lt;/h3&gt;

&lt;p&gt;Here's what surprises people: the entire system is just &lt;strong&gt;Markdown files&lt;/strong&gt;.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;ai-agent-manager-plugin/
├── agents/           # 8 Markdown prompt files
├── commands/         # Slash command entry points
├── skills/           # 35 focused implementation guides
└── hooks/            # Quality gate automation
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Each agent is a &lt;code&gt;.md&lt;/code&gt; file with YAML frontmatter that specifies:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Which tools it can use&lt;/li&gt;
&lt;li&gt;Which model it runs on&lt;/li&gt;
&lt;li&gt;Which skills are pre-loaded&lt;/li&gt;
&lt;li&gt;Whether it has persistent memory
&lt;/li&gt;
&lt;/ul&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight yaml"&gt;&lt;code&gt;&lt;span class="nn"&gt;---&lt;/span&gt;
&lt;span class="na"&gt;tools&lt;/span&gt;&lt;span class="pi"&gt;:&lt;/span&gt; &lt;span class="pi"&gt;[&lt;/span&gt;&lt;span class="nv"&gt;Read&lt;/span&gt;&lt;span class="pi"&gt;,&lt;/span&gt; &lt;span class="nv"&gt;Glob&lt;/span&gt;&lt;span class="pi"&gt;,&lt;/span&gt; &lt;span class="nv"&gt;Grep&lt;/span&gt;&lt;span class="pi"&gt;,&lt;/span&gt; &lt;span class="nv"&gt;Bash&lt;/span&gt;&lt;span class="pi"&gt;,&lt;/span&gt; &lt;span class="nv"&gt;Write&lt;/span&gt;&lt;span class="pi"&gt;,&lt;/span&gt; &lt;span class="nv"&gt;Edit&lt;/span&gt;&lt;span class="pi"&gt;]&lt;/span&gt;
&lt;span class="na"&gt;model&lt;/span&gt;&lt;span class="pi"&gt;:&lt;/span&gt; &lt;span class="s"&gt;sonnet&lt;/span&gt;
&lt;span class="na"&gt;memory&lt;/span&gt;&lt;span class="pi"&gt;:&lt;/span&gt; &lt;span class="s"&gt;project&lt;/span&gt;
&lt;span class="na"&gt;skills&lt;/span&gt;&lt;span class="pi"&gt;:&lt;/span&gt;
  &lt;span class="pi"&gt;-&lt;/span&gt; &lt;span class="s"&gt;supervisor-readiness&lt;/span&gt;
  &lt;span class="pi"&gt;-&lt;/span&gt; &lt;span class="s"&gt;context-setup&lt;/span&gt;
  &lt;span class="pi"&gt;-&lt;/span&gt; &lt;span class="s"&gt;quality-checklist&lt;/span&gt;
&lt;span class="nn"&gt;---&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;No Docker. No Kubernetes. No microservices. Just prompt engineering with structure.&lt;/p&gt;




&lt;h3&gt;
  
  
  Skills: Reusable Knowledge Packets
&lt;/h3&gt;

&lt;p&gt;Skills are the secret weapon. Instead of agents re-discovering patterns every session, skills pre-inject focused knowledge at spawn time.&lt;/p&gt;

&lt;p&gt;There are 35 skills covering:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Framework patterns&lt;/strong&gt; — NestJS, Next.js, API Gateway&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Workflow patterns&lt;/strong&gt; — state management, async orchestration, context summarization&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Quality patterns&lt;/strong&gt; — commit conventions, review checklists, pattern detection&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Testing patterns&lt;/strong&gt; — Playwright E2E&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Database patterns&lt;/strong&gt; — TypeORM, Drizzle ORM, MySQL&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;When the Supervisor spawns, it already knows 5 skills. The Code Reviewer already knows quality criteria. No file reads needed. No context wasted.&lt;/p&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2fkc4xe4okxxpeot5z0o.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2fkc4xe4okxxpeot5z0o.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  Parallel Execution: The Git Worktree Trick
&lt;/h3&gt;

&lt;p&gt;This is the technical insight that makes everything work.&lt;/p&gt;

&lt;p&gt;When the Supervisor needs to run 3 workers simultaneously, it can't have them all editing &lt;code&gt;src/auth/login.ts&lt;/code&gt; at the same time. Traditional approaches use file locks or merge strategies. I used &lt;strong&gt;git worktrees&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;A git worktree is a separate working directory linked to the same repository. Each worker gets its own worktree, its own branch, its own filesystem. They can build in parallel without knowing about each other.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;project/                    # Main worktree (Supervisor)
../project-BD-12a/          # Worker A worktree
../project-BD-12b/          # Worker B worktree (blocked, waiting)
../project-BD-12c/          # Worker C worktree
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;When workers finish, the Supervisor merges branches sequentially into the feature branch. Conflicts are rare because the Orchestrator designed the subtasks to touch different files.&lt;/p&gt;

&lt;p&gt;And for simple tasks? A single subtask skips worktrees entirely and builds directly. No overhead when it's not needed.&lt;/p&gt;




&lt;h3&gt;
  
  
  Persistent Memory: Agents That Learn
&lt;/h3&gt;

&lt;p&gt;Four agents have persistent memory:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Launch Pad&lt;/strong&gt; — Remembers common file impacts per goal type&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Code Reviewer&lt;/strong&gt; — Remembers recurring issues and codebase conventions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Red Team Reviewer&lt;/strong&gt; — Remembers past vulnerabilities and attack patterns&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Product Owner&lt;/strong&gt; — Remembers domain terminology and stakeholder preferences&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Memory is stored in &lt;code&gt;.claude/agent-memory/&lt;/code&gt; and accumulates automatically. The Code Reviewer that's seen your codebase 50 times catches things a fresh reviewer never would.&lt;/p&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0r6qhzxxfbpeywjtd2ak.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F0r6qhzxxfbpeywjtd2ak.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  Plan-First Philosophy
&lt;/h3&gt;

&lt;p&gt;The biggest lesson I learned building this system: &lt;strong&gt;planning is not overhead — it's the highest-leverage activity.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Early versions jumped straight into execution. The Supervisor would take a goal, generate tasks, and start building. It worked for simple things. For complex features, it produced fragmented, inconsistent code.&lt;/p&gt;

&lt;p&gt;The Launch Pad agent changed everything. By spending 60 seconds analyzing the codebase, estimating file impacts, and packaging a structured brief, the execution phase became dramatically more reliable.&lt;/p&gt;

&lt;p&gt;The workflow is now:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Raw goal → Launch Pad → Supervisor-Ready Brief → Supervisor → Shipped PR
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That brief is typically 200-400 lines of structured analysis. It tells the Supervisor:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Exactly which files to touch&lt;/li&gt;
&lt;li&gt;Which subtasks can run in parallel&lt;/li&gt;
&lt;li&gt;What patterns to follow&lt;/li&gt;
&lt;li&gt;What risks to watch for&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The Supervisor then skips its own discovery phases (saving ~500 tokens of context) and goes straight to execution.&lt;/p&gt;




&lt;h3&gt;
  
  
  Quality Gates: Trust But Verify
&lt;/h3&gt;

&lt;p&gt;Every piece of code goes through at least two quality gates:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Gate 1 — Plugin Hooks&lt;/strong&gt; (automated, no extra agents):&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;SubagentStop&lt;/code&gt;: Verifies workers produced valid results&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;TaskCompleted&lt;/code&gt;: Prevents premature task closure&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Gate 2 — Code Reviewer&lt;/strong&gt; (full review):&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Pattern matching against your codebase&lt;/li&gt;
&lt;li&gt;Security checks&lt;/li&gt;
&lt;li&gt;Clear PASS/FAIL/NEEDS_HUMAN decision&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Gate 3 — Red Team Reviewer&lt;/strong&gt; (optional, pre-launch):&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Adversarial audit&lt;/li&gt;
&lt;li&gt;6 attack vector exploration&lt;/li&gt;
&lt;li&gt;Severity-rated findings&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The hook system is lightweight — it uses a fast model with a 30-second timeout. Just enough to catch obvious failures before the expensive review step.&lt;/p&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw2y2bbwl5gqs1j7ry45e.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fw2y2bbwl5gqs1j7ry45e.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  What I'd Do Differently
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;1. Start with the state management.&lt;/strong&gt; I retrofitted externalized state in v3. It should have been there from v1. The Context-Keeper pattern (dedicated agent for state mutations) solved coordination bugs that plagued earlier versions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;2. Make skills smaller.&lt;/strong&gt; Some skills are 200+ lines. The best ones are under 100. Focused knowledge beats comprehensive documentation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;3. Test the prompts, not just the code.&lt;/strong&gt; Agent behavior is determined by prompts, and prompts are code. I should have built prompt regression tests earlier.&lt;/p&gt;




&lt;h3&gt;
  
  
  Getting Started
&lt;/h3&gt;

&lt;p&gt;AI Agent Manager is open source and works with any Claude Code installation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;GitHub:&lt;/strong&gt; &lt;a href="https://github.com/vikashruhilgit/ai-agent-manager" rel="noopener noreferrer"&gt;github.com/vikashruhilgit/ai-agent-manager&lt;/a&gt;&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# From the ai-agent-manager directory&lt;/span&gt;
/plugin marketplace add ./
/plugin &lt;span class="nb"&gt;install &lt;/span&gt;ai-agent-manager-plugin@ai-agent-manager-marketplace

&lt;span class="c"&gt;# In your project&lt;/span&gt;
/launch-pad goal: &lt;span class="s2"&gt;"describe what you want to build"&lt;/span&gt;
/supervisor  &lt;span class="c"&gt;# Let it run&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It works with any programming language, any framework, any project structure. The agents read your &lt;code&gt;CLAUDE.md&lt;/code&gt; to understand your specific patterns.&lt;/p&gt;




&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo4upmk9txrtukq5ml9qz.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fo4upmk9txrtukq5ml9qz.jpg" alt=" " width="800" height="800"&gt;&lt;/a&gt;&lt;/p&gt;




&lt;h3&gt;
  
  
  The Bigger Picture
&lt;/h3&gt;

&lt;p&gt;AI Agent Manager isn't about replacing developers. It's about giving developers a team.&lt;/p&gt;

&lt;p&gt;Most of us work solo or in small teams. We context-switch between planning, coding, reviewing, and debugging. Each switch costs focus. Each role requires a different mindset.&lt;/p&gt;

&lt;p&gt;What if your planning mindset was always available? What if code review happened instantly after every change? What if security audits were a command away, not a quarterly event?&lt;/p&gt;

&lt;p&gt;That's what an AI team gives you. Not replacement. Amplification.&lt;/p&gt;

&lt;p&gt;The agents aren't perfect. They hallucinate sometimes. They miss edge cases. They need human judgment for the hard decisions (that's what NEEDS_HUMAN is for).&lt;/p&gt;

&lt;p&gt;But they're tireless, consistent, and they remember everything. And they're getting better every session.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;AI Agent Manager is open source. Star it on GitHub:&lt;/em&gt; &lt;a href="https://github.com/vikashruhilgit/ai-agent-manager" rel="noopener noreferrer"&gt;github.com/vikashruhilgit/ai-agent-manager&lt;/a&gt;&lt;/p&gt;

</description>
      <category>agents</category>
      <category>ai</category>
      <category>automation</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
