Saqueib Ansari

Posted on May 20 • Originally published at qcode.in

Claude Code or a script? Depends on what kind of change you're making

#automation #refactoring #codemods #developertools

If the change is truly mechanical, I do not want Claude Code making judgment calls. I want a script.

That is the real answer to Claude Code vs scripts for repo-wide changes, and it cuts against the current tooling mood a bit. Teams often reach for Claude Code because it feels more powerful, more flexible, and more capable of handling messy codebases. All of that can be true. It just does not make it the right first tool for sweeping mechanical edits.

For repo-wide transformations, the most important question is not “Which tool is smarter?” It is “Does this change require interpretation, or does it require consistency?”

If the job is mostly consistency, scripts usually win. If the job is mostly interpretation, Claude Code becomes much more valuable. The tricky part is that many large migrations start out looking deterministic and only reveal their semantic edge cases once you get deep enough into the diff.

That is why this comparison is worth making carefully. The wrong choice does not just waste time. It can increase review burden, widen blast radius, and turn a boring upgrade into an expensive cleanup project.

Start with the shape of the transformation, not the number of files

A lot of teams choose the tool based on scale alone.

They think:

small change, maybe use an agent
large change, definitely use an agent

That logic is backwards.

A change across 4,000 files can be a better fit for a script than a change across 40 files if the large one follows one deterministic rule and the small one depends on local code meaning. The deciding variable is not file count. It is uniformity.

Script-shaped changes

These are the repo-wide updates that can be explained almost like compiler passes:

rename one namespace to another everywhere
replace one config key with a new key
rewrite import paths from one package entrypoint to another
swap a deprecated helper for a one-to-one replacement
normalize generated annotations or docblock tags
update one constant name across a codebase

What matters here is not that the job is easy. It is that the transformation should behave the same way everywhere. If local variation appears, it is usually a bug in the migration, not a feature of the migration.

That kind of work is where scripts, codemods, or AST-based transforms are strongest. They are deterministic, rerunnable, and brutally consistent.

Claude Code-shaped changes

These are the updates that stop being safe the moment you try to treat them as pure search-and-replace:

one deprecated API has multiple valid replacements depending on calling context
the same helper name means different things in different layers of the app
old tests need different rewrites depending on fixture style or harness assumptions
some modules follow the “official” pattern, while others rely on old behavior that still matters
the transformation triggers nearby edits like constructor injection, assertion updates, or altered setup logic

In those cases, the change may still be broad, but it is no longer purely mechanical. Now you need local reasoning.

That is where Claude Code can genuinely help. It can read the file, inspect nearby code, infer which replacement pattern fits, and carry out a context-sensitive edit without you writing a giant tree of codemod conditions.

The trap is using that power where it was never needed.

Scripts win because determinism is easier to trust than intelligence at scale

When a change spans hundreds or thousands of files, the most underrated engineering property is not raw capability. It is auditability.

A deterministic script makes the rule explicit. That changes how the whole migration feels.

The rule is visible before the diff is visible

With a script or codemod, reviewers can inspect the transform logic directly. They can say:

yes, this only touches certain files
yes, this replacement is narrow
yes, this is the exact condition being matched
yes, this is safe to rerun

That is a powerful advantage.

Compare that with an agent-driven migration where the implicit rule is basically:

“Claude Code examined each file and seemed to make the right call.”

That is workable for nuanced refactors. It is weaker for mass mechanical edits because the reviewer now has to infer the rule from the output instead of inspecting the rule directly.

Rerun safety matters more than teams expect

Repo-wide migrations rarely land cleanly on the first try.

Typical reality looks more like this:

first pass misses files in an unexpected directory
CI fails on one environment-specific path
generated code reintroduces old patterns
another branch merges stale syntax back in
a release train forces the migration to happen in two stages

Good scripts are built for reruns. That is one of their superpowers.

A simple codemod can often be rerun with confidence after every rebase or CI failure. The exact same rule applies again. That is much harder to guarantee with an agent session, especially if the instructions are broad and the tool is making contextual decisions.

Consistency is the product

For truly mechanical changes, local variation is not sophistication. It is risk.

If 1,200 files need the exact same structural rewrite, then a tool that improvizes slightly different versions of the rewrite is not being helpful. It is creating inconsistency you now have to review, justify, and maintain.

This is where scripts beat Claude Code decisively. Scripts do not get clever. For this class of work, that is a strength.

Claude Code becomes valuable when the migration is only pretending to be mechanical

A surprising number of “simple” migrations fall apart once you inspect the real codebase.

This is the moment where a script-first mindset is still right, but a scripts-only mindset becomes expensive.

Imagine a broad update like this:

replace deprecated fetchUser() with the new repository layer

At first glance, that sounds like a codemod. Then you find the real world:

controllers should become $users->findVisible($id)
admin flows should become $users->findIncludingArchived($id)
background workers should become $users->findForProcessing($id)
tests should sometimes use a fake repository instead of any of those

Now there is no one safe replacement rule.

You can still write a codemod, but the cost rises sharply. You are no longer writing a transform. You are encoding local semantics.

This is Claude Code’s best repo-wide use case

Claude Code is strongest when the migration has a deterministic backbone plus contextual exceptions.

It helps with things like:

reading the call site and choosing the right replacement variant
applying a broad API shift while fixing adjacent fallout
rewriting tests differently depending on fixture setup
updating constructor signatures or imports after a local transform
explaining why a subset of files should be handled manually or in a separate batch

This is very different from “Claude Code should perform the whole migration.”

A better mental model is:

scripts handle the bulk rule; Claude Code handles the semantic tail.

That is a far more productive way to combine the two.

The best workflow is usually script first, Claude Code second

Most teams should not treat this as a binary choice. They should use a staged migration workflow.

Here is the pattern I would recommend in practice.

Step 1: isolate the deterministic core

Before you open Claude Code, ask a hard question:

What part of this migration can be expressed as a rule instead of as instructions?

If a large chunk can be expressed as code, do that first.

For example:

from pathlib import Path

for path in Path("src").rglob("*.php"):
    content = path.read_text()
    updated = content.replace("OldNamespace\\", "NewNamespace\\")
    if updated != content:
        path.write_text(updated)

Or better, an AST-based codemod for languages where syntax-aware changes matter.

The point is not elegance. It is narrowing the problem.

Step 2: run tests and static analysis immediately

Do not ask Claude Code to finish a migration before you know what the deterministic bulk broke.

Run:

unit tests
type checks
linters
static analysis
targeted integration tests if the changed area is risky

This gives you an exception set rather than a repo-sized cloud of uncertainty.

Step 3: classify the failures

Once the automated pass finishes, the remaining failures usually fall into a few buckets:

missed directories or file types
false positives from the script
genuine semantic exceptions
local fallout like imports, constructor updates, or test harness adjustments

This is the moment where Claude Code can become much more cost-effective.

Step 4: use Claude Code on the exception set only

Now the instructions become sharper.

Instead of:

“Migrate the whole repo to the new API.”

You can say:

“These 27 files failed after the codemod. Fix only these. Keep behavior unchanged. Prefer the new repository method that matches local visibility rules.”

That is a much healthier use of an agent. You are no longer paying for interpretation across 1,800 files. You are paying for interpretation exactly where automation stopped being safe.

Step 5: rerun the deterministic pass if needed

If more stale patterns reappear after rebases or generated code updates, rerun the script. That is part of the advantage of keeping the mechanical rule separate.

Claude Code is not a great substitute for rerunnable infrastructure.

The real tradeoff is auditability versus local judgment

The common framing — scripts are dumb, agents are smart — is too shallow to guide real migrations.

The better framing is this:

scripts maximize auditability
Claude Code maximizes local judgment

That leads to much better decisions.

When auditability should dominate

Prefer scripts first when:

the change must be uniform everywhere
the main risk is missed files, not semantic ambiguity
you want easy reruns after CI or rebases
reviewers should be able to understand the transformation rule directly
diff volume is large enough that local variation becomes dangerous

In other words, scripts are best when you want fewer decisions.

When local judgment should dominate

Prefer Claude Code earlier when:

the replacement depends on surrounding code
the repo contains inconsistent historical patterns
multiple valid replacements exist for the same old API
the migration requires nearby fixes that are cumbersome to encode in a codemod
expressing the rule in code would take longer than applying it intelligently

In other words, Claude Code is best when the migration contains irreducible interpretation.

Cost and review burden both push scripts upward for true mechanical work

There is also a simple economics argument here.

For genuinely mechanical changes, Claude Code is often the more expensive option even if it feels faster at first.

It costs more to supervise

A codemod lets you review the rule once and then review the outcome strategically.

An agent-driven migration often forces you to spot-check many local edits because you need confidence that it did not interpret similarly shaped files slightly differently. That review tax is real.

It costs more to reapply

Scripts are naturally rerunnable. Agent sessions are not. The moment a migration needs a second pass, the script’s value goes up sharply.

It increases blast radius more easily

Claude Code may decide to tidy adjacent code while it is in the file:

normalize style
simplify nearby logic
rename variables
clean unused imports
restructure a small helper

That may be nice in isolation. In a repo-wide mechanical change, it inflates diff noise and makes code review harder. Mechanical migrations benefit from narrowness, not taste.

This is one of the strongest arguments for scripts: they usually do not have “opinions” beyond the rule you encoded.

What I would choose in practice

If I am doing any of the following:

import path rewrites
namespace renames
config key migrations
generated annotation updates
exact helper replacements with the same semantics
bulk formatting or attribute normalization

I will reach for a script, codemod, or AST transform first almost automatically.

If I am doing something like:

deprecated API replacement with multiple semantic destinations
test migration with fixture-dependent rewrites
helper replacement that changes surrounding dependency wiring
bulk refactor where the last 10-20 percent depends on business meaning

I will bring Claude Code in sooner.

But even then, I still want a script handling the boring 80 percent if that 80 percent is real.

The rule of thumb that actually holds up

If the migration rule can be stated more clearly as code than as prose, start with a script.

If it can only be explained properly with examples, caveats, and “except in these cases,” then Claude Code becomes much more attractive.

That rule is practical because it maps directly to the nature of the transformation.

The conclusion most teams need

Claude Code is not the default winner for repo-wide changes just because repo-wide changes are large.

For truly mechanical transformations, local scripts usually win because they are:

more deterministic
easier to review
easier to rerun
cheaper to scale
less likely to drift semantically

Claude Code becomes the better tool when the migration stops being truly mechanical and starts depending on local interpretation.

So the practical takeaway is simple:

Use scripts for bulk certainty. Use Claude Code for semantic exceptions.

If you reach for the agent before checking whether a codemod can express the rule cleanly, you are probably choosing the more exciting tool instead of the better one.

And for sweeping mechanical edits, boring is usually the sign that you chose correctly.

Read the full post on QCode: https://qcode.in/claude-code-vs-local-scripts-for-repo-wide-mechanical-changes/

DEV Community