David Hoze

Posted on Mar 8

Introducing claude-collab

#agents #ai #cli #showdev

Hi there.

I've been working on my project — bit — and wanted to work on several ideas at once. I tried doing it with 3 working directories, git pushing and pulling and merging... it really gave me a headache. So I started using agents in the same directory, and telling them not to interfere with each other. It worked OK, but at some point it got hard always coordinating them, making sure they don't step on each other's toes. That's why I built claude-collab.

Implemented in Haskell, it's a single binary CLI tool that gives multiple Claude Code agents a way to coordinate. No server, no database, just the filesystem. The idea is simple: give agents two primitives — claim a file and commit your work — and a reservation system for resources you can't share. That's basically it. The agents are smart enough to figure out the rest.

The Problem

Picture this: you have Agent A working on auth, Agent B working on the config system, and Agent C writing tests. Sounds great until Agent A and Agent B both decide to touch src/utils.hs at the same time. Or Agent C runs run-test-suite.sh while Agent A is mid-refactor and everything blows up. Or two agents commit at the same time and you get a mess.

You could just tell them "hey don't touch each other's files" but... they're agents. They don't always listen. And even when they do, there's no structured way for them to know what the other agents are doing.

How It Works

Setup: One Command

Run claude-collab install in your project root and you're done. It installs three Claude Code hooks that automate the entire workflow:

SessionStart — auto-registers each agent when a session opens
PreToolUse (Edit/Write) — auto-claims files before any edit
SessionEnd — auto-cleans up when a session closes

This means agents don't need to remember to register or claim files. The only manual step is committing — and that's intentional. You want agents to commit deliberately, after they finished a feature, not after every edit.

Agent Registration

Under the hood, each agent registers itself when its session starts:

claude-collab init --name auth-refactor

This gives it a unique hash (like a3f8b201) and a human-readable name. The name is stored in the registry and can be used as an alias for the hash in all subsequent commands. So instead of claude-collab commit a1b2c3d4 -m "msg" the agent can do claude-collab commit auth-refactor -m "msg".

Claiming Files

Before an agent edits a file, it claims it (automatically, via the hook):

claude-collab files claim a3f8b201 src/auth.ts

If nobody else has it — great, you got it. If another agent already claimed it? You get a rejection (exit code 1), which forces the agent to pause and negotiate. This is the key design choice: the friction is intentional. It creates a natural "hey, let's talk about this" moment.

The rejected agent can then message the other agent using Claude Code's native messaging and either:

Wait for the other agent to finish
Co-claim the file with --shared if they're working on different parts

# After negotiating with the other agent...
claude-collab files claim a3f8b201 src/auth.ts --shared

Committing

When an agent is done with its work:

claude-collab commit a3f8b201 -m "refactor auth validation"

This stages and commits only the files that agent claimed. No accidentally committing someone else's work. The committed files are automatically unclaimed, freeing them up for others.

But here's where it gets interesting — what about co-claimed files? If two agents both edited the same file, you can't just have them both commit separately.

So I used git's staging mechanism. Instead of trying to figure out how to commit only one agent's changes and not the other, claude-collab simply stages the first agent's files and waits for the other agent to finish. When the second agent commits, it triggers the real git commit with everyone's changes combined:

[a3f8b201] refactor token validation | [d4e5] add rate limiting

One clean commit with both agents' work. No merge conflicts, no manual intervention. And the first agent doesn't need to sit around waiting — it can claim new files, make more edits, and even run another commit for different work while the co-claimed files stay staged in the background.

Resource Reservations

Some operations can't run in parallel — two agents running the test suite at the same time would interfere with each other. So there's a reservation system:

# Reserve the test suite
claude-collab reserve a3f8b201 test

# Run your tests
run-test-suite.sh

# Release it
claude-collab release a3f8b201 test

After running tests, agents message each other with the results using Claude Code's native messaging, so others can skip redundant work.

If another agent already has the resource, reserve will wait (polling every 500ms) until it's free or times out. Each reservation has a TTL so if an agent crashes without releasing, the reservation expires. The TTL is configured per-resource in .claude/agents/resources.json.

Two resources come pre-configured: test and build. You can add your own by editing that file.

Under the Hood

Locking

All the coordination uses mkdir-based locks. mkdir is atomic on every major filesystem (NTFS, ext4, APFS), so it's a cheap and reliable mutex. There are two locks:

Git lock — serializes git commits so two agents can't commit simultaneously (30s stale timeout)
Reserve lock — protects reservation updates (5s stale timeout)

Crash Recovery

If an agent crashes mid-task, nothing deadlocks. Lock directories older than their stale timeout get force-broken by the next agent that needs them. Resource reservations expire after their TTL, so a crashed agent's reservation doesn't block others forever. And the SessionEnd hook runs cleanup automatically when a session closes — unclaiming files, releasing reservations, and removing the agent from the registry.

The one thing that doesn't auto-recover is staged co-claimed files. If Agent A stages changes for a two-phase commit and then crashes before Agent B commits, those files stay staged in the registry. But that's a deliberate trade-off — it's better to leave staged work intact for manual recovery than to silently discard it.

Atomic Writes

All JSON file writes (the registry, reservations, resources) use write-to-temp-then-rename. This prevents partial writes from corrupting the data if an agent crashes mid-write. On Windows, the destination is removed before renaming since Windows rename fails if the target already exists.

Putting It All Together

Here's what a typical multi-agent session looks like:

# Agent 1: auth work
claude-collab init --name auth-agent
claude-collab files claim auth-agent src/auth.ts src/session.ts
# ... edit files ...
claude-collab commit auth-agent -m "refactor auth flow"
claude-collab cleanup auth-agent

# Agent 2: tests (running concurrently)
claude-collab init --name test-agent
claude-collab reserve test-agent test
npm test
claude-collab release test-agent test
claude-collab cleanup test-agent

And if they need the same file? They negotiate through Claude Code's native messaging, co-claim, and the two-phase commit handles the rest.

The Design Philosophy

The whole thing is built on one idea: don't try to be smarter than the agents. Instead of building a complex locking system with priority queues and deadlock detection, just give agents simple tools and let them talk through Claude Code's native messaging. They're Claude Code instances — they can negotiate, delegate, and figure things out. The claim system creates the pause; the agents fill it with conversation.

A Working Example

Here's a real session where I used agents to fix test failures, and two of them ended up touching the same file. Pay attention to the bold note on binary-fixer — that's the co-claim and two-phase commit in action.

❯ OK, run another team to fix all failures

2 agents launched:

Agent	Task
`@export-fixer`	Fix export/import junction routing
`@binary-fixer`	Fix binary test edge cases

Team fix-failures is up with 3 agents:

Agent	Task	Failures
`export-fixer`	Export/import broken in junction mode	14 CLI test failures
`binary-fixer`	Binary test edge cases	4 failures (restore, mv subdir, merge -X theirs)
`init-fixer`	init-nested test 8	1 CLI test failure

Skipping network-remote.test (9 failures) — those require \\tsclient UNC path which is infrastructure-only.

That's 19 fixable failures across the three agents. I'll relay results as they come in.

❯ Don't skip network-remote.test.

Added network-fixer to handle the 9 network-remote.test failures. Now all 28 failures are covered:

Agent	Failures
`export-fixer`	14 (export + import)
`binary-fixer`	4 (restore, mv, merge)
`init-fixer`	1 (init-nested)
`network-fixer`	9 (network-remote)

@init-fixer — Fixed init-nested test 8, committed cf618cd. Test expectation was outdated for the gitfile change. ✅

@binary-fixer — All 3 binary test fixes done and verified. 1 real code bug (bit mv to subdirectory) and 2 test bugs. The Passthrough.hs commit is blocked on a co-claim with d6d55282 — likely export-fixer who also needs to edit Passthrough.hs. When export-fixer commits, both changes will land together. ✅

@export-fixer — Fixed export/import/init junction routing, all tests pass. The isBitOnlyCommand guard is the right approach — keeps the junction early-exit but carves out bit-specific commands. ✅

@network-fixer — network-remote.test fixed, 17/17 pass, committed. Network tests now degrade gracefully — pass with real UNC when available, skip cleanly when not. ✅

Worked for 15m 59s

Results: 28 failures → 0

Agent	Failures Fixed	Type
`init-fixer`	1 (init-nested)	Test expectation updated for gitfile
`binary-fixer`	4 (restore, mv subdir, merge -X theirs)	1 code bug + 2 test bugs
`export-fixer`	14 (export + import)	Code bug — junction early-exit skipping bit-only commands
`network-fixer`	9 (network-remote)	Added graceful skip when `\\tsclient` unavailable

Total: 28 failures → 0. The full CLI suite now passes clean (minus cloud tests needing specific remotes).

All agents shut down. 🧹

What's Next

You can grab a pre-built binary — Windows or Linux x86_64 — or build from source with cabal build && cabal install. Once it's on your PATH, run claude-collab install in your project root. It sets up CLAUDE_COLLAB.md, the hooks, and adds a few lines to CLAUDE.md so your agents know how to use it.

Happy collaborating!

Top comments (1)

Some comments may only be visible to logged-in visitors. Sign in to view all comments.