Gerus Lab

Posted on May 10

From Zero to Production: Building a Claude-Powered Agency Stack in One Weekend

#ai #claude #productivity #webdev

From Zero to Production: Building a Claude-Powered Agency Stack in One Weekend

You've decided to build your agency around Claude. Smart move. The question isn't whether Claude is capable — it is. The question is how you run it without burning through your budget on API costs, setting up infrastructure that breaks on Friday night, or losing track of which client account is doing what.

This is the guide I wish existed when I started. No fluff, no theory — just a concrete weekend plan to go from "thinking about it" to a production-ready Claude stack.

Why Most Agencies Get This Wrong

The typical path looks like this: someone on the team signs up for Claude's consumer subscription, shares the login, things work fine for a week, then Anthropic flags the account for suspicious activity. Or you set up direct API access, usage spikes during a client demo, and you get a surprise $400 bill. Or you spin up a reverse proxy on Cloudflare Workers, it works great for a month, then Claude updates their bot detection and the whole thing breaks on a Sunday.

The real problem isn't Claude — it's the infrastructure around it. Most teams treat it as an afterthought and then rebuild from scratch every few months.

Here's a better approach.

The Stack You Actually Need

A production Claude setup for an agency has four components:

A reliable Claude access layer (the proxy — this is where most people fail)
Account isolation (each client gets their own context, ideally their own access)
Usage visibility (you need to know what's being used before it becomes a problem)
A repeatable onboarding flow (so you can add new clients without touching infrastructure)

Let's build each one.

Saturday Morning: The Access Layer

This is the decision that shapes everything else.

Option A: Direct Anthropic API
You pay per token. Great for experimentation, terrible for production agency work. If you have five clients running Claude daily, token costs compound fast. A power user running 50 Nexus sessions a week can easily hit $150–200/month in API costs alone. Multiply by five clients and you've got a $1,000/month infrastructure cost before you've made a dollar.

Option B: DIY proxy
Technically possible. You run LiteLLM or a custom proxy server, route through your own Claude API key, and charge clients a markup. The economics can work, but you're now a hosting company. Someone has to maintain the server, handle rate limits, debug auth errors, and respond when it goes down at 2am.

Option C: Managed proxy via ShadoClaw
ShadoClaw is a managed Claude API proxy built specifically for Nexus users and agencies. Flat monthly rate. No per-token billing surprises. Multiple account slots so each client gets isolated access. Infrastructure maintained by someone else.

Pricing: Solo at $29/month (1 account), Pro at $79/month (5 accounts), Team at $179/month (20 accounts). There's a free 3-day trial at shadoclaw.com — start there.

For an agency running 5 clients: $79/month versus potentially $500–1,000/month in direct API costs. The math isn't close.

Saturday Afternoon: Account Isolation

This one is non-negotiable. Different clients should not share Claude context, access, or usage visibility.

If you're using ShadoClaw's Pro or Team plan, each slot is an isolated proxy endpoint. Client A's OpenClaw instance connects through one endpoint, Client B through another. They don't share tokens, history, or billing surface. If Client A hammers the API during a launch sprint, it doesn't affect Client B's response times.

The setup flow:

Create a ShadoClaw account at shadoclaw.com
Choose Pro or Team based on client count
For each client, generate an isolated proxy endpoint from your dashboard
Give that endpoint to the client to configure in their OpenClaw settings

Each endpoint behaves exactly like a direct Claude API — the client doesn't need to change anything in their Nexus config except the API base URL and key.

What not to do: Don't give all clients the same API key and tell yourself you'll "keep an eye on usage." You won't. And when something goes wrong, you'll have no idea which client caused it.

Saturday Evening: Usage Visibility

You need two layers of visibility: yours and your client's.

Your view: Through ShadoClaw's dashboard, you see per-account usage across all your client slots. You can spot if one account is unusually heavy before it becomes a conversation. You can also use this for honest billing — if you're charging clients a flat retainer that includes Claude access, you want to know if one client is using 10x more than others.

Client's view: Your clients don't need access to your ShadoClaw dashboard. What they need is to see that their Claude access is working and understand roughly what they're getting. A simple status page or monthly usage summary (exported from your dashboard) is enough for most clients.

For agencies running OpenClaw for clients, set up a lightweight weekly report: pull the usage summary, include it in your regular client report, flag anything unusual. This turns a potential surprise into a talking point.

Sunday Morning: The Onboarding Flow

Here's where most agencies lose time. Every new client becomes a bespoke setup project. You end up spending two hours configuring the same things you've configured five times before.

Build a checklist and stick to it:

Client onboarding checklist:

[ ] Create ShadoClaw slot for this client
[ ] Generate and store proxy endpoint + API key (use your password manager, not a Slack DM)
[ ] Configure Nexus for client: set API base URL + key
[ ] Verify connection with a test prompt
[ ] Set up basic usage monitoring (check dashboard weekly)
[ ] Document client's Nexus workspace location and primary use cases
[ ] Brief client on fair use (heavy file analysis, large codebases = more tokens)

This takes about 20 minutes per client once you've done it once. Document the exact steps, not just the checklist — what fields to fill in, what values to use, what the success state looks like.

The second client should take half the time of the first. The fifth client should take 15 minutes.

Sunday Afternoon: Stress Testing

Before you call it production-ready, run your stack through some realistic scenarios.

Test 1: High volume
Have one account run 20–30 Nexus prompts in quick succession. Watch for rate limit errors. If you're hitting limits, understand why and whether it's expected behavior or a configuration issue.

Test 2: Isolation check
Make sure Client A's configuration genuinely can't access Client B's data. With ShadoClaw, the isolation is at the proxy level, so this is handled for you — but verify it anyway by confirming that each client's endpoint returns errors when used with another client's context.

Test 3: Recovery
What happens when the connection drops? OpenClaw should handle reconnection gracefully. Test this by temporarily disabling a proxy endpoint and then re-enabling it. Know what the client experience looks like and what, if anything, they need to do.

The Economics at Scale

Let's do the math for a 10-client agency.

Direct API route: 10 clients × $150/month average API spend = $1,500/month. Plus your time managing billing alerts, debugging token costs, and fielding "why is this expensive" questions from clients.

ShadoClaw Team plan: $179/month for up to 20 accounts. Per client: $17.90/month in infrastructure cost.

If you're charging clients $300–500/month for Claude-powered services, your infrastructure cost is under 6% of revenue. That's a healthy margin.

The math gets even better as you add clients — the fixed cost spreads across more accounts while revenue scales linearly.

Common Mistakes to Avoid

Mistake 1: Starting with the cheapest option and migrating later
Migration is a pain. If you start 10 clients on a setup that doesn't scale, you'll spend a weekend migrating instead of building. Start with the right architecture.

Mistake 2: Shared credentials
One key for all clients is a support nightmare. When something breaks, you don't know who caused it. When you need to revoke access for one client, you affect everyone.

Mistake 3: No documentation
Six months from now, you'll have forgotten how this was set up. Write it down. Specifically: what endpoint each client uses, what their primary use cases are, and what "normal" usage looks like for them.

Mistake 4: Treating Claude access as a commodity
Claude isn't a commodity. How you configure it, what tools your clients have access to, what context they're working with — these create real differentiation. Don't just give clients a key and call it done. Build a workflow around it.

What You Have at the End of the Weekend

By Sunday evening, you should have:

A managed Claude proxy setup with isolated accounts per client
A documented onboarding checklist you can run in 20 minutes
A basic usage monitoring routine
A stress-tested stack you understand and can debug

This is the foundation. You'll refine it as you go — but you'll refine it from a stable base, not from chaos.

Getting Started

The free 3-day trial at shadoclaw.com is the right first step. Set up your own account, configure Nexus, run it through your typical workflows. If it works for you, add clients. If something doesn't fit, you'll know before you've committed to anything.

ShadoClaw is built by Gerus-lab — an engineering studio that works on OpenClaw integrations, AI tooling, and agency infrastructure. If you're running into something more complex than a standard setup, that's the right place to look.

The weekend project is achievable. Start Saturday morning with the proxy decision and you'll have something worth building on by Sunday night.

ShadoClaw — Managed Claude proxy for OpenClaw agencies. Flat rate. No billing surprises. shadoclaw.com

DEV Community

From Zero to Production: Building a Claude-Powered Agency Stack in One Weekend

From Zero to Production: Building a Claude-Powered Agency Stack in One Weekend

Why Most Agencies Get This Wrong

The Stack You Actually Need

Saturday Morning: The Access Layer

Saturday Afternoon: Account Isolation

Saturday Evening: Usage Visibility

Sunday Morning: The Onboarding Flow

Sunday Afternoon: Stress Testing

The Economics at Scale

Common Mistakes to Avoid

What You Have at the End of the Weekend

Getting Started

Top comments (0)