Paul

Posted on Mar 16

AI Makes Your Firm Faster. Maister Explains Why It Doesn't Make You Richer.

#ai #softwaredevelopment #management

Every professional services firm is adopting AI right now. Most are seeing speed gains — drafts in minutes, research in hours, code in seconds. Almost none are seeing proportional profit gains. Some are seeing the opposite: delivery gets faster, revenue stays flat, and margin pressure grows.

Why?

Because speed of generation is not leverage. And leverage — where profit actually comes from in a professional firm — was explained with painful clarity by David Maister over 30 years ago. His model didn't predict AI. But it explains precisely what AI breaks, what it doesn't, and where the money actually moves.

If you run a practice, lead a delivery org, or make decisions about how your firm sells and staffs work, this matters more than your AI adoption roadmap.

A Quick Maister Refresher

For those who read Managing the Professional Service Firm a decade ago (and those who keep meaning to), here's the core of it.

Maister observed that all professional work falls into three types:

	Brains	Gray Hair	Procedure
What the client buys	Rare expertise	Experience & judgment	Speed & reliability
Leverage	Low	Medium	High
Price sensitivity	Low	Medium	High
Typical pyramid	Flat	Medium	Wide

Leverage is the ratio of junior staff to senior staff. A firm's profit per partner grows in two ways: either you sell more expensive work (move up the scale) or you deliver the same work with a wider pyramid (more juniors per senior). That's it. That's the entire economics of professional services in one sentence.

And here's the part everyone forgets: Maister noted that work naturally drifts down the scale. What was once brain surgery becomes, over time, more standardized, more procedural. Senior expertise gets codified into checklists, templates, training programs.

AI doesn't change this dynamic. It accelerates it dramatically.

Three Things AI Actually Breaks

A. Not your cost base — the nature of the service itself

AI pushes chunks of work down the Brains → Gray Hair → Procedure scale. What required senior attention yesterday can be partially standardized today and handed to a system instead of a junior.

But — and this is critical — it doesn't push all work down. Real services become hybrid: the discovery and judgment parts stay Gray Hair or Brains, while the synthesis, drafting, comparison, and QA parts shift toward Procedure. Profit emerges at the seam between them.

B. Not your people — the carrier of leverage

In a classic firm, leverage lives in the pyramid: juniors at the bottom producing work, seniors at the top selling judgment. AI compresses the bottom of the pyramid. Extraction, drafting, classification, first-pass review — these are increasingly done by a system, not a team of associates.

This doesn't eliminate leverage. It changes what carries it. Leverage stops being "how many juniors can I put under one senior" and becomes "how much delivery can I move into reliable, repeatable digital workflows while keeping quality and accountability."

C. Not your margin — your billing model

Here's the most uncomfortable shift. AI often raises productivity faster than profitability. If your firm does the same work 3x faster but still sells hours, you've just cut your revenue by two thirds. AI creates value only when a firm knows how to repackage speed into price, scope, throughput, or share of wallet.

The most vulnerable thing in your firm isn't your margin. It's the billable hour.

Work as a Graph

Here's where it gets interesting.

Think about what actually happens inside a project. It's not "a team works for 500 hours." It's a chain of transformations: someone takes an input — client data, a document, a set of requirements — and turns it into an intermediate artifact. Someone else picks up that artifact, transforms it further, and passes it on. Eventually, a final deliverable comes out.

This isn't a metaphor. It's a structure. In technical terms, it's a directed graph: a network of tasks where each task takes an input, produces an output, and passes it to the next node. If you've ever seen a flowchart or a project dependency diagram, you've seen a graph.

In a traditional firm, this graph is executed by people arranged in a pyramid:

Partner scopes the problem
Manager coordinates the work
Juniors produce the artifacts
Partner validates the result

Leverage = how many juniors fit under one partner.

In an AI-native firm, the same graph looks different:

Senior expert designs the graph
System orchestrates execution
Model executes a significant portion of intermediate nodes
Humans sit at high-stakes checkpoints — verification, escalation, sign-off
Firm monetizes the architecture of the graph, not just the effort

Leverage = how many nodes can run reliably without a human in the loop.

This gives us a tighter formula for what a profitable AI-native service line actually looks like:

Brains-framed, Gray-Hair-supervised, Procedure-executed graph.

The expert frames the problem. Experienced judgment governs the critical decisions. AI executes the procedural middle. The firm charges for the architecture, the reliability, and the final sign-off — not for the hours it took.

But here's the catch: this doesn't give you infinite leverage. The graph has its own bottlenecks.

Decomposition. Someone has to break the problem into the right nodes. This is the new elite skill — and it's scarce.

Verification. The more powerful the generation, the more expensive it becomes to check the output. In high-stakes domains, the cost of validation doesn't disappear — it becomes central.

Exceptions. Graphs work beautifully on the standard path. Valuable client work often breaks the pattern. Exception handling snaps you right back into expensive senior judgment.

Context. Each node solves locally, but the client's problem requires global coherence. Stitching local outputs into a coherent answer is its own expensive function.

Trust. Even if 80% of the work was done by a pipeline, someone must sign the final result. In professional services, clients often pay precisely for that signature, that accountability, that trust.

The graph doesn't make leverage infinite. It moves the bottleneck from "production capacity" to "architecture, verification, exceptions, and accountability."

The Trap Everyone Misses

Now here's the question most firms get wrong.

The question every firm asks: Can AI do this work?

The question they should ask: Can we verify that AI did it right — cheaper than doing it ourselves?

This distinction changes everything. Consider:

	Easy to verify	Hard to verify
Easy to produce	Commodity automation	False-friend zone
Hard to produce	AI-native sweet spot	Human-dominant

Commodity automation (easy to produce, easy to verify): formatting, template extraction, rules-based classification. The money is real but commoditizes fast.

AI-native sweet spot (hard to produce, easy to verify): complex software with good test coverage, compliance mapping, structured due diligence, analytics with rubric-based outputs. Generation is expensive and valuable, but verification is cheap — you can build a reliable graph here.

Human-dominant (hard to produce, hard to verify): unique strategic decisions, bespoke negotiations, socially complex transformations. AI helps you think, but doesn't become the engine of delivery.

And then there's the dangerous quadrant.

False-friend zone (easy to produce, hard to verify): the model will happily produce a "convincing" strategy memo, a "solid" analysis, a "professional" report. It looks great in a demo. But verifying that it's actually right — materially correct, substantively sufficient, contextually appropriate — costs almost as much as writing it from scratch.

This is the zone of impressive demos and weak economics. The benefit of cheap generation gets eaten alive by expensive human review.

Here's a helpful litmus test: if your reviewer has to essentially re-think the entire piece to verify it, you're in the false-friend zone. Your AI is creating the illusion of leverage, not the reality.

Software delivery understood this early — not because writing code is easy, but because software has a rich verification layer: tests, types, linters, CI pipelines, staging environments, rollback. You can check correctness without re-doing the work.

Most advisory work hasn't figured this out yet.

Where the Money Actually Is

If you overlay AI's impact onto Maister's three types of work, the picture is clear:

Practice type	What AI does	Consequence
Procedure	Most graphable, most automatable	Price drops, transparency rises, margin compresses, consolidation accelerates
Gray Hair	Best segment for AI-native capture — client still needs a human, but most delivery lives in the graph, error cost is high, judgment premium holds	The sweet spot
Brains	Graph augments frontier thinking (more options explored, faster synthesis) but doesn't replace it	Stays premium, too narrow for mass money pool

The main prize is Gray Hair work, translated into a managed graph.

Not full automation — that's Procedure, and the margins are heading to zero. Not pure Brains — that's too bespoke to scale. The sweet spot is the middle: work where the client still needs experienced judgment, but where a significant share of delivery can live inside a well-governed, verifiable graph.

What This Means for Your Firm

A few implications worth sitting with:

The transition won't be won by whoever adopts AI first. It will be won by whoever learns to build verifiable graphs — who can decompose expert work into nodes that are cheap to execute, cheap to check, and cheap to re-run.

Practice areas will be defined not just by domain, but by graph economics. The relevant question is no longer "do we have expertise in X?" but "can we build a reliable, verification-friendly delivery graph for X?"

The form of your deliverable becomes a strategic decision. A memo is hard to verify. A memo with a source map, an assumption ledger, and an evidence trail is much easier. Firms that redesign their outputs to be proof-carrying will have structurally better economics.

Automating generation without automating verification is a trap. If you're making AI write drafts but still having seniors review them line by line, you've sped up production while keeping the most expensive bottleneck intact.

We've Built the Operating System for This

This article covers the lens. Behind it, there's a full operational framework.

A structured intake model for evaluating which assignments are actually good candidates for AI-native delivery — and which are false friends. A multi-level assessment protocol that doesn't require building the full graph upfront. A library of assignment archetypes and graph patterns. A set of reshape playbooks that turn "AI-assisted at best" work into graph-friendly delivery. And the automation tooling that brings it all to life.

We use this framework ourselves, and we implement it for firms that are serious about making AI-native delivery actually profitable — not just faster.

If you're running a practice and this resonated, let's talk.

Top comments (1)

Alexey Tolmachev • Mar 22

The "false-friend zone" is something I've been running into constantly in system analysis work.
I remember a project where we tried using AI to generate integration specifications — API contracts, data mappings, error handling scenarios. The drafts looked professional, well-structured, covered all the obvious cases. The problem was that verifying them took almost as long as writing from scratch, because the model doesn't know your specific upstream system quirks, your legacy constraints, the unspoken agreements between teams that never made it into documentation.
Your point about "decomposition as the new elite skill" — I think this is exactly what good system analysts and architects have been doing for years, just without calling it that. Breaking a complex problem into verifiable nodes, defining clear inputs and outputs, identifying where human judgment is needed vs where you can rely on a repeatable process. The graph was always there. AI just makes it explicit and forces you to be honest about which nodes actually need a senior person.
One thing I'd add though — the verification problem isn't just about cost. It's about who has the context to verify. In my experience, the real bottleneck isn't "can we check this cheaper" but "do we still have people who understand the domain deeply enough to catch what the model gets subtly wrong." If you over-automate the junior layer, you eventually lose the pipeline that produces those verifiers.

How are you seeing firms handle that talent pipeline question in practice?