Keith MacKay

Posted on Jun 8

AI Augmentation: Amazing. Replacement: A Rarity (AI Can't Do Your Whole Job).

#ai #career #management

AI Augmentation: Amazing. AI Replacement: A Rarity (It Can Only Do a Fraction of Your Job).

The "AI will take your job" prediction keeps getting the unit of analysis wrong. Jobs are bundles, and AI only handles part of the stack.

Your legal team just ran a document review that would have taken three paralegals two weeks. An AI completed it in four hours. Your CFO is now asking the obvious question: do we still need paralegals?

The question sounds reasonable. The answer is yes. The confusion about why reveals something important about what jobs actually are.

A Job Is Not a Task

When people say "AI will take jobs," they're collapsing two different things.

A task is a discrete unit of work: summarize this contract, identify anomalies in this dataset, generate a first draft of this email. A job is a bundle of dozens of tasks, plus the judgment that connects them, plus the relationships that give the output meaning, plus the accountability for when things go wrong.

AI is genuinely good at tasks. AI cannot hold a job.

Think about what a paralegal actually does over the course of a month. Document review is maybe 30% of it. The rest: advising attorneys on case strategy based on accumulated pattern recognition, managing client communication that requires tone-reading and discretion, deciding which documents in a production are strategically significant versus merely responsive, carrying institutional knowledge about the firm's risk tolerance and client history, and being accountable (in a professional and legal sense) for the work product.

The AI completed the document review. It cannot do the rest. The paralegal who now does less document review has more time to do the rest better.

A job has dimensionality. A task is one-dimensional.

The Dimension Stack

Think of every job as a stack of dimensions. Each dimension describes a type of work along a spectrum from "AI handles this reliably" to "AI struggles and a human is essential":

Volume and pattern recognition: AI wins, and it isn't close. Processing 200,000 documents, reading radiology scans for anomalies, flagging fraud transactions at scale: these are high-volume, pattern-rich tasks where AI outperforms humans on speed and consistency, especially at 2 AM.
Judgment under ambiguity: Humans win. When the facts are incomplete, the stakeholders are difficult, the situation has no clear precedent, and being wrong has real consequences, AI generates plausible-sounding answers. Humans know what they don't know. (Mostly.)
Relational complexity: Humans win. Negotiating a contract isn't just parsing terms: it's reading the room, understanding what the other party actually wants versus what they're asking for, and deciding how hard to push. AI can prepare you for that conversation. It cannot have it.
Accountability: Humans win by default. Someone has to own the outcome. AI doesn't hold a professional license, can't be sued, and can't make the judgment call about when a risk is worth taking. When AI-assisted work goes wrong, the human in the loop is still the one in front of the client or the regulator.
Novel framing: Humans win (for now). Identifying the right question (deciding which problem is worth solving before anyone has framed it) is still predominantly human territory.

Most jobs touch all five dimensions. AI currently handles the first well and struggles with the other four.

MIT economist Daron Acemoglu, in a 2024 working paper on the macroeconomics of AI, made a similar point with more precision [1]. His argument: AI's productivity gains are real, but they concentrate in a narrow slice of tasks within each occupation. He estimated that AI, in its current form, materially affects only about 5% of tasks in the average job: the high-volume, pattern-rich slice. The other 95%, requiring what he called multi-task fluidity (the ability to switch between judgment calls, relational work, novel situations, and domain-specific improvisation across a single workday), remains outside what current systems can handle reliably. His projected contribution to overall economic growth: roughly 0.07% annually. Nowhere near the 5-10% projections from the optimist camp. His 5% figure is the most conservative in the field; Goldman Sachs estimates 25% of all work tasks are eventually automatable, and Penn Wharton puts 40% of labor income in the exposure zone [2]. The right answer is somewhere in that range, which is large enough to be consequential and uncertain enough to warrant humility about any single projection.

The fluidity point is underappreciated. A paralegal doesn't spend eight hours on document review and then clock out. They spend 90 minutes on document review, then pivot to a client call that requires empathy and discretion, then draft a memo that requires strategic judgment, then field an unexpected question that requires institutional memory. The pivot itself, the reading of context to know which cognitive mode to engage, is something AI cannot do. The tasks are automatable in isolation. The job, the sequence of pivots across a day, is not.

What the Pairs Research Shows

A meta-analysis across medical, legal, and technical domains found a consistent performance staircase: human alone 68%, AI alone 77%, AI plus human 80%, full collaborative framework 88% [3]. The gap between AI alone and full human-AI collaboration is larger than the gap between AI alone and human alone. The pairing matters.

Gartner's May 2026 study of 350 executives reinforces the organizational stakes. Companies using AI to amplify workers outperform those using it to replace them. Gartner VP Helen Poitevin: "Workforce reductions may create budget room, but they do not create return" [4].

Radiologists working with AI-assisted anomaly detection have lower miss rates than either the AI or the radiologist working alone. The AI catches what tired human eyes miss during a 12-hour shift; the radiologist catches the anomaly that falls outside the AI's training distribution. Neither is redundant. Geoffrey Hinton declared radiologists would be obsolete in 2016. Their median salary is now $571K and growing [5]. A decade-long natural experiment: the AI took the routine scans; the radiologist salary rose because judgment and accountability became more valuable, not less.

In chess (where this research goes back decades), humans paired with AI assistance beat AI alone and unassisted grandmasters. The telling detail: the winning pairs weren't necessarily the grandmasters with the highest individual ratings. They were the humans who understood what the AI saw, what it missed, and when to trust it versus override it. Kasparov called these pairs "centaurs" and argued that the insight applies everywhere knowledge work meets computation [6].

A study of GitHub Copilot users found developers completed tasks 55% faster on average, with code that passed quality checks at equivalent rates [7]. The speed gain was largest for the kind of boilerplate work that senior engineers find most draining, which means senior engineers got more time for the architecture and debugging that actually requires them.

The bottleneck shifts upward. AI raises the floor. The ceiling (judgment, relationships, accountability) becomes the new constraint.

The Cognitive Surrender Trap

There is a version of augmentation that isn't augmentation. When AI handles the routine tasks, the natural human response is to do less: fewer deep reads, shallower research, faster decisions with less independent verification. That response is rational in the short run and corrosive over time.

A 2026 peer-reviewed study in Human Behavior and Emerging Technologies gave this dynamic a name and proved it empirically: the Paradox of Augmentation. Human performance initially rises with AI support. With sustained use, the curve eventually dips below baseline (the human performing worse than before they had the tool) [8]. The mechanism is straightforward. Skills not exercised atrophy. The AI handled the practice reps.

Cognitive skills require exercise. The radiologist who stops reading difficult scans because AI flags the obvious ones will, over time, lose the pattern recognition that makes them valuable on the edge cases. The lawyer who delegates all document review loses the intuition for what documents actually say and what they imply strategically. The engineer who never writes foundational code loses the feel for what the AI is generating and where it is likely to fail. A 2026 study found AI coding assistance lowers code comprehension scores by 17% and makes experienced developers 19% slower on debugging tasks (while they report feeling 20% faster) [9]. The confidence goes up. The capability goes down.

Augmentation requires deliberate reinvestment. The hours AI saves are not supposed to become idle time. They're supposed to become harder work. The paralegal freed from document review should be in the deposition room, not watching the hours tick by. The radiologist whose routine scan volume drops should be spending more time on the cases that don't fit the pattern. The engineer whose boilerplate writes itself should be designing the architecture.

There is also a generational dimension worth naming. A March 2026 Psychology Today analysis distinguishes two patterns: adults lose skills to AI, and children never build them [10]. Workers 46 and older offload tasks they already mastered; they lose capability but retain a foundation. Workers 17-25 offload tasks they were supposed to be learning. The 55% speed gain from Copilot is real for a senior engineer who understands what good code looks like. For the junior developer who never wrote the boilerplate, there is no foundation to fall back on.

Research in Scientific Reports (2026) adds a further wrinkle: AI collaboration enhances task performance but measurably undermines intrinsic motivation and sense of ownership [11]. Augmentation has costs beyond skill atrophy.

This is the real risk for organizations that automate without intent: you don't lose the job title, you lose the capability behind it. The work gets lighter, the judgment atrophies, and when the hard case arrives (the one that requires genuine expertise), the human who was supposed to be the backstop has spent two years exercising none of the muscles that would have caught it.

What Good Augmentation Looks Like in Practice

The Stanford HAI 2026 AI Index found developer employment for ages 22-25 fell nearly 20% since 2024, while developers 30 and older at the same companies grew [12]. The floor rises for those already above it. Access to the skills that get you to the ceiling is shrinking.

The practical question for any leader: where in your team's work does AI handle a dimension well, and what should that free people to do?

A mapping exercise worth running: list the recurring tasks in a given role. Estimate the time each consumes. Score each against the dimension stack: which are high-volume pattern tasks AI can accelerate, which require judgment, relationships, or accountability? The tasks where AI provides real leverage are candidates for offloading. The tasks that require the upper dimensions are where freed time should go.

A few patterns worth watching across industries:

In client-facing roles: AI handles research, briefing preparation, and follow-up documentation. The human handles the actual relationship. The ratio of meaningful client contact per professional increases, which is the point (and the thing that clients actually pay for).

In technical roles: AI handles implementation of known patterns. The human handles architecture, debugging novel failures, and deciding what is worth building. The quality bar on human decisions rises because implementation cost drops, making more ideas worth testing.

In analytical roles: AI surfaces patterns in data at a scale and speed no human team matches. The human decides which patterns matter, what they imply, and how to present findings to stakeholders who asked the wrong question. The analysis becomes cheap; the interpretation is the scarce resource.

In each case, the job survives because the job was never the task. The job was the bundle.

The Bottom Line

AI replaces tasks. It doesn't replace the judgment, relationships, and accountability that bundle tasks into jobs. The human who works alongside AI and invests the recovered time in harder work is more capable than either the AI alone or the human before the AI arrived.

The risk worth watching isn't replacement. It's atrophy. The document review AI completed in four hours freed three paralegals for two weeks of higher-dimension work. Or it gave them two weeks of lighter schedules and a gradual erosion of the skills that made them worth keeping. Which version your organization gets depends entirely on whether you're deliberate about it.

The bundle doesn't disappear. It thins, if you let it.

Where have you seen AI augmentation actually work, where the human genuinely got better because of the pairing rather than just faster? And where have you seen the atrophy trap play out? Both patterns are real, and the difference between them isn't the technology.

Related reading:

For the argument that AI should augment cognition rather than replace it, and why convenience is the enemy of capability: On LinkedIn
For how to think about AI as a capable colleague rather than a formula or tool, with implications for how much autonomy to grant: On LinkedIn | On Substack | On Medium
For the organizational strategy of putting humans before the loop rather than in it, and what that means for judgment-intensive work: On LinkedIn | On Substack | On Medium
For the reality behind AI-driven layoff announcements and whether jobs are actually being replaced or just tasks: On LinkedIn | On Substack | On Medium

References

The Simple Macroeconomics of AI — Acemoglu, D., NBER Working Paper 32487, 2024. Estimates AI materially affects roughly 5% of tasks in the average occupation; projects 0.07% annual TFP growth from current AI systems. Introduces the multi-task fluidity constraint on AI task substitution.
AI's Economic Potential: Goldman Sachs Responds to Daron Acemoglu — AEI, 2024. Goldman Sachs estimates 25% of all work tasks are eventually automatable; Penn Wharton analysis puts 40% of labor income in the exposure zone.
PMC Meta-Analysis: Human-AI Collaboration Performance — Meta-analysis across medical, legal, and technical domains. Human alone 68%, AI alone 77%, AI plus human 80%, full collaborative framework 88%.
Gartner: Autonomous Business and AI Layoffs May Create Budget Room but Do Not Deliver Returns — Gartner, May 2026. Study of 350 executives; companies using AI to amplify workers outperform those using it to replace them.
Godfather of AI Geoffrey Hinton, Radiologists, and the Future of Work — Fortune, May 2026. Radiologist median salary now $571K and growing a decade after Hinton's 2016 obsolescence prediction.
Deep Thinking: Where Machine Intelligence Ends and Human Creativity Begins — Kasparov, G., PublicAffairs, 2017. Kasparov's centaur chess research and the generalization to human-AI collaboration.
GitHub Copilot Research: The Impact of AI on Developer Productivity — GitHub, 2022. Controlled study: developers completed tasks 55% faster with Copilot assistance; code quality equivalent to unassisted work.
Paradox of Augmentation — Human Behavior and Emerging Technologies, 2026. Human performance initially rises with AI support, then dips below baseline with sustained use. Empirical evidence for skill atrophy under AI assistance.
Skill Atrophy in AI-Augmented Engineering — 2026. AI coding assistance lowers code comprehension scores by 17% and makes experienced developers 19% slower on debugging tasks, while developers report feeling 20% faster.
Adults Lose Skills to AI, Children Never Build Them — Psychology Today, March 2026. Distinguishes skill loss in workers 46+ (offloading mastered tasks) from skill formation failure in workers 17-25 (offloading tasks they were supposed to be learning).
AI Collaboration, Task Performance, and Intrinsic Motivation — Scientific Reports, 2026. AI collaboration enhances task performance but measurably undermines intrinsic motivation and sense of ownership.
The Real Job Destruction from AI Is Hitting Before Careers Can Start — Yale SOM / Stanford HAI 2026 AI Index. Developer employment ages 22-25 fell nearly 20% since 2024; developers 30 and older at the same companies grew over the same period.

Keith MacKay is a technology strategy consultant and CTO in EY-Parthenon's Software Strategy Group (SSG), specializing in AI disruption and technology diligence for private equity and corporate clients. SSG's AI Disruption Lab conducts rapid assessments of how AI transforms and threatens existing business models and value chains. Keith teaches at Northeastern University and writes about strategy, management, and AI/technology, with Claude Code and Codex as AI collaborators.