UC Jung

Posted on Mar 19

Sub Agent that executes Claude AI CLI requirements in a work-pipeline

#claudecode #ai #agents #opensource

TL;DR

I built uctm (Universal Claude Task Manager) — an npm package that installs 6 specialized subagents into Claude Code. You type one request, and it automatically plans the work, builds the code, verifies it, and commits — all through structured XML messaging.

npm install -g uctm
cd your-project && uctm init
claude
# Type: [new-feature] Add user authentication with JWT
# → Pipeline runs automatically

GitHub: UCJung/uc-taskmanager-claude-agent
npm: uctm

The Problem

Claude Code is powerful, but when you ask it to build something complex, it tries to do everything in one shot. For simple tasks, that works fine. But for anything with multiple files, dependencies between components, or the need for proper testing — things get messy.

I wanted a system where:

Complex tasks get broken down into manageable pieces
Each piece gets built, tested, and committed individually
Context doesn't explode as tasks pile up
The whole thing runs with zero external dependencies

The Architecture: 6 Agents, 1 Pipeline

uctm uses 6 specialized subagents, each with a single responsibility:

User Request
  │
  Main Claude (orchestrator)
  │
  ├── router      → Analyzes request, picks execution mode
  ├── planner     → Decomposes into TASKs with dependency DAG
  ├── scheduler   → Manages DAG, dispatches ready TASKs
  ├── builder     → Writes the actual code
  ├── verifier    → Runs tests, lint, acceptance checks
  └── committer   → Writes result report, git commits

Key constraint: Claude Code subagents cannot call other subagents. So Main Claude (the CLI terminal itself) acts as the orchestrator, calling each agent in sequence and passing messages between them.

3 Execution Modes

Not every request needs all 6 agents. The router decides:

Mode	When	What Happens
direct	Simple change, no tests needed	Router handles everything alone
pipeline	Needs testing, single domain	router → builder → verifier → committer
full	Complex, multi-file, dependencies	All 6 agents with DAG management

The mode is determined by analyzing the request against rules in .agent/router_rule_config.json — fully customizable per project.

How Agents Talk: Structured XML

This is where it gets interesting. Instead of passing free-form text between agents, uctm uses structured XML messages.

Router → Builder (Dispatch XML)

<dispatch to="builder" work="WORK-05" task="TASK-00"
          execution-mode="pipeline">
  <context>
    <project>my-app</project>
    <language>ko</language>
    <plan-file>works/WORK-05/PLAN.md</plan-file>
  </context>
  <task-spec>
    <file>works/WORK-05/TASK-00.md</file>
    <title>Add JWT authentication middleware</title>
    <action>implement</action>
    <description>Create auth middleware with
    token validation...</description>
  </task-spec>
  <previous-results/>
</dispatch>

Builder → Main Claude (Task Result XML)

<task-result work="WORK-05" task="TASK-00"
             agent="builder" status="PASS">
  <summary>JWT auth middleware implemented</summary>
  <files-changed>
    <file action="created" path="src/middleware/auth.js">
      JWT verification middleware
    </file>
    <file action="created" path="tests/auth.test.js">
      12 test cases for auth flows
    </file>
  </files-changed>
  <verification>
    <check name="test" status="PASS">12/12 passed</check>
  </verification>
  <context-handoff from="builder" detail-level="FULL">
    <what>Auth middleware with JWT validation</what>
    <why>Used jsonwebtoken for RS256 support</why>
    <caution>Requires JWT_SECRET env variable</caution>
    <incomplete>None</incomplete>
  </context-handoff>
</task-result>

Why XML? It's parseable, validatable, and token-efficient. Each agent knows exactly what to expect and what to return.

The Sliding Window: Solving Context Explosion

Here's the problem with multi-task pipelines: if TASK-03 needs to know what TASK-00, TASK-01, and TASK-02 did, context grows linearly. With 10+ TASKs, you're burning tokens on history instead of actual work.

uctm solves this with a sliding window on context-handoff:

Distance	Detail Level	What's Passed
Previous TASK	FULL	what + why + caution + incomplete (~150 tokens)
2 TASKs ago	SUMMARY	what only (~50 tokens)
3+ TASKs ago	DROP	Nothing (0 tokens)

This keeps context transfer at a constant ~200 tokens regardless of how many TASKs have completed. In a 10-TASK WORK, this saves ~75,000 tokens compared to passing everything.

TASK-00 → TASK-01 → TASK-02 → TASK-03

TASK-03's builder receives:
  TASK-02 context-handoff  ← FULL   (what/why/caution/incomplete)
  TASK-01 context-handoff  ← SUMMARY (what only)
  TASK-00 context-handoff  ← DROP    (not transferred)

Real Pipeline Run: Simple Calculator

Here's an actual pipeline run I captured to verify the message format. The request: "Create a simple calculator with add, subtract, multiply, divide."

Step 1: Router

Input:  [new-feature] Create calc.js with 4 arithmetic functions
Output: execution-mode=pipeline + dispatch XML + PLAN.md + TASK-00.md

Router analyzed the request, determined it needs testing (pipeline mode), created the plan files, and generated dispatch XML for builder.

Step 2: Builder

Input:  dispatch XML from router
Output: task-result XML with context-handoff
        Created: calc.js (4 functions), calc.test.js (8 tests)
        Verification: 8/8 tests PASS

Step 3: Verifier

Input:  builder's task-result XML
Output: Independent verification — re-ran all 8 tests
        Checked: progress file, acceptance criteria, file existence
        Result: ALL PASS

Step 4: Committer

Input:  builder + verifier task-result XMLs
Output: TASK-00_result.md written
        Git commit: feat(TASK-00): calc.js ESM module
        Commit hash returned

Total: 136K tokens, 96 tool calls, 344 seconds. Every message between agents followed the exact XML spec.

Specification-Driven Development

uctm follows an SDD (Specification-Driven Development) philosophy:

Requirements → Architecture → Design are the real assets.
Code is just output.

The spec documents define the XML schemas, context-handoff rules, and pipeline behavior. Agents are .md files — pure prompt definitions. Zero runtime code, zero dependencies.

This means:

Portable: Works in any project with uctm init
Customizable: Edit .agent/router_rule_config.json to tune execution-mode rules
Transparent: Every WORK generates PLAN.md, TASK files, progress tracking, and result reports

Getting Started

# Install globally
npm install -g uctm

# Initialize in your project
cd your-project
uctm init

# Start Claude Code (bypass mode for uninterrupted pipeline)
claude --dangerously-skip-permissions

# Trigger the pipeline with a tagged request
# [new-feature] Add a REST API for user management
# [bugfix] Fix the race condition in data loader
# [enhancement] Improve search query performance

What uctm init does:

Copies 12 agent .md files to .claude/agents/
Creates .agent/router_rule_config.json
Appends agent invocation rules to CLAUDE.md
Creates works/ directory for WORK files

What's Next

MCP Server Integration (Phase 2 complete, Phase 3 planned) — HTTP transport for external tool integration
Multi-project orchestration — Run pipelines across dependent repos
Dashboard visualization — Real-time pipeline monitoring

If you work with Claude Code and find yourself managing complex multi-file tasks, give uctm a try. It's free, open-source, and takes 30 seconds to install.