Mike Krop

Posted on Mar 6 • Originally published at mitrapunk.com

Ship Less, Measure More

#ai #programming #management

AI did not remove the engineering bottleneck. It moved it.

Code is cheaper than it has ever been. Prototypes appear in hours. Complex systems that once took weeks can now be assembled in a few days. That sounds like progress, but in many teams it creates a new failure mode: shipping the wrong thing faster and mistaking motion for impact.

That is the trap.

When implementation gets cheaper, decision quality becomes more important, not less. If a team does not know exactly what problem it is solving and how success will be measured, AI becomes a force multiplier for confusion. It helps you produce more code, more architecture, more internal tooling, and more maintenance burden without increasing the odds that any of it matters.

The discipline I keep returning to is simple:

Build for one measurable goal.
Ship to remove ambiguity.

Those two rules sound almost trivial. In practice, they cut through a surprising amount of waste

AI Made Overbuilding Easier

Most engineers are not reckless. The problem is deeper than that.

Good engineers are usually system-oriented. They like coherent designs, scalable abstractions, elegant mechanisms, and solutions that feel future-proof. Those are useful instincts. But AI turns those instincts into a liability when the goal is still unclear.

Once the cost of implementation drops, overengineering becomes the default. It becomes easy to build:

AI pipelines for problems that are still hypothetical
generalized systems before the narrow use case is proven
adaptive experiences that are impossible to evaluate cleanly
automation layers that create more operational burden than user value

The question stops being "can we build this?" because now the answer is almost always yes.

The real questions are:

Will this move the metric that matters?
Can we prove that it moved it?
Can we operate the system after the demo?

If those answers are unclear, speed is not helping. It is hiding the mistake until later.

Rule 1: Build for One Measurable Goal

A team needs one scoreboard.

Not "improve onboarding."
Not "make the product smarter."
Not "invest in AI capabilities."

One measurable goal.

For example:

Activation = percentage of new users who reach the first success moment within 10 minutes.

That definition does a lot of work immediately. It forces clarity. Every idea now has to answer a simple question: how does this move activation?

If the answer is vague, it should not be built yet.

This rule also removes a common failure mode: bundling multiple ideas into one release. The moment a team changes too many things at once, causality disappears. If results improve, nobody knows why. If results get worse, nobody knows what to undo. Teams end up replacing evidence with storytelling.

There is another important part here: engineering should not define the target alone. Business, product, or whoever owns the outcome needs to sign off on the metric. That alignment is not bureaucracy. It is protection against teams wandering into technically impressive work that nobody actually needed.

Rule 2: Ship to Remove Ambiguity

Shipping is often framed as delivering value. Early in a project, that is incomplete.

Early shipping is mostly about buying information.

Every product team works inside ambiguity. People have opinions about what users want, where friction lives, and what kind of experience will unlock growth. Most of those opinions are partly wrong. The only reliable way to reduce uncertainty is to ship something small enough that the result teaches you something specific.

That is why the second rule matters: ship to remove ambiguity.

The release should be designed to answer a question, not to satisfy everyone's wishlist.

If the team believes users are dropping out because account creation appears too early, the first release should test that idea as directly as possible. Not a redesign. Not a multi-quarter onboarding initiative. Just the smallest intervention that can confirm or reject the hypothesis.

Code is not the product in that moment. Code is the receipt for the information you bought.

A Concrete Example

Imagine a team with a growth problem. Installs are healthy, but new users do not reach the first success moment. They open the app, hesitate, and disappear.

The team defines the goal clearly:

Activation = reaching the first success moment within 10 minutes.

Then they find the biggest drop-off: forced account creation before the user gets any value.

There are two possible paths.

The first path is boring:

add "Continue without account"
shorten the path to the first useful action
defer signup until after value is visible

This is not glamorous. It is narrow, unattractive, and easy to dismiss. But it is a clean test of the hypothesis that friction before value is suppressing activation.

The second path is seductive:

"Give me three days and I'll build an AI onboarding concierge."

Now the system will:

infer user intent
personalize the flow
rewrite copy dynamically
adapt the experience by persona

This sounds modern. It sounds strategic. It sounds like the kind of thing ambitious teams should be doing.

It is also where many teams get trapped.

The Three-Day AI Promise

The danger is not that the AI idea is impossible. The danger is that it is legible as impact before it is validated as impact.

An AI onboarding system does not arrive alone. It brings hidden operational weight:

evaluation complexity
model drift
latency
monitoring
privacy review
cost uncertainty
versioning problems
support burden
on-call responsibility

What looked like a fast solution in a demo quietly becomes a long-term maintenance tax.

And if activation does not improve, the team often learns less than it would have from the dumb fix. Too many moving parts. Too many explanations. Too much room for rationalization.

That is the key point: AI makes it easier to create systems that are difficult to falsify.

Simple changes fail cleanly. Complex AI systems fail noisily.

Why Organizations Still Choose the Wrong Path

If the logic is this straightforward, why do teams still overbuild?

Because most organizations are not optimized for clarity. They are optimized for ownership, visibility, and perceived impact.

People do not just want to help the business. They want to own the thing that looks most important. And right now, few things look more important than an AI initiative.

That creates a predictable system effect:

leadership wants AI on the roadmap
performance systems reward visible AI work
ambitious people gravitate toward AI-shaped projects
skepticism gets mistaken for lack of vision

In that environment, the boring fix is not just boring. It is politically weak.

The person arguing for a small, measurable experiment can easily be framed as someone who is "thinking too small" or "not embracing AI." Meanwhile, the person proposing a broad AI platform looks strategic, ambitious, and aligned with executive excitement.

This is why the problem is not just technical judgment. It is incentive design.

People are often responding rationally to the system around them.

The Hard Part

There is no clean personal immunity to this.

I have fallen into the same trap. Not from incompetence, but from the same mix of ambition, curiosity, incentives, and genuine belief that a bigger AI solution might solve more than the narrow fix.

That is what makes this problem worth taking seriously. It does not only capture careless teams. It captures capable people who want to do meaningful work.

AI did not create bad judgment. It amplified the cost of unclear judgment.

A Better Standard

Before building the bigger system, teams should earn the right to build it.

That usually means:

Pick one metric that matters.
Identify the narrowest hypothesis connected to that metric.
Ship the smallest change that can test it.
Measure the result.
Only add complexity if the simple version proves there is real lift to capture.

If the simple fix works, great. You solved the problem cheaply.

If the simple fix partially works, maybe a more advanced solution is justified.

If the simple fix fails, that is also useful. You ruled out a hypothesis without dragging a large system into existence.

This is what disciplined speed looks like. Not building everything faster, but reducing uncertainty faster.

Measure More, Ship Less

The phrase sounds backward in a culture obsessed with output, but that is exactly why it matters.

Measure more.
Ship less.

Not because shipping is bad, but because unmeasured shipping is expensive theater.

AI does not change what wins. It changes what fails faster.

So when implementation becomes cheap, the standard should rise:

clearer goals
smaller tests
tighter feedback loops
less narrative, more evidence

Speed is useful only when you have a map.

Without that, you are not moving faster toward success.

You are just accelerating toward a more complicated failure.

DEV Community