MVPBuilder_io

Posted on May 25

I gave Claude the full spec. It gave me 47 files. I shipped nothing for 3 months.

#sideprojects #productivity #ai #buildinpublic

The plan was perfect.

I had a full spec. Clean architecture. Claude had generated a 47-file scaffold in about 12 minutes. We'd talked through edge cases. We'd handled auth, error boundaries, the deployment pipeline. I could feel the momentum.

The project sat untouched for three months.

Here's the thing I kept getting wrong: I thought having a great plan was the hard part.

It isn't.

AI is genuinely good at plans. It's frighteningly good. Give it your spec, your constraints, your tech stack, and it will hand you something that looks like you already built it. The architecture diagram looks complete. The file tree looks real. The README reads like you've already shipped.

And that feeling — I've started to recognize it now — that feeling is not momentum. It's a very convincing substitute.

Because the plan doesn't have a Day 4.

Day 4 is when you come home after work, open the IDE, look at the 47 files, and genuinely don't know where to start. Day 4 is when the scaffold no longer feels like a head start. It feels like someone else's project. Day 4 is when you close the laptop and tell yourself you'll pick it up on the weekend.

The AI has no idea.

This is not an AI problem

I want to be clear about something: AI tools are not the villain here. This is not a post about AI being bad or overhyped or harmful. I use AI every day. MVP Builder — the thing I'm building — uses AI to generate every daily prompt.

But there's a specific failure mode I want to name, because I fell into it hard.

I started treating AI as a substitute for execution, not just a tool for it. The output felt like progress. The conversation felt like work. Every session with Claude felt like a productive evening. And technically it was — I had generated real artifacts. The problem was that generation is not shipping.

At some point I had to sit down and actually build the thing. Type the code. Hit the wall. Be confused for 40 minutes about why the hook wasn't firing. That part doesn't get delegated.

What the data actually says

In July 2025, a randomized controlled trial was published on arxiv (METR, arxiv.org/abs/2507.09089). 16 experienced open-source developers, 246 tasks, Cursor Pro with Claude 3.5/3.7 Sonnet.

The result: developers with AI access took 19% longer than developers without it.

Before the study, those same developers predicted AI would make them 20-24% faster. The gap between expectation and reality was roughly 39 percentage points.

The researchers haven't fully explained it yet. But if you've spent any time building with AI, you have a theory. I have mine: generation creates cognitive overhead. You have more code to understand, more decisions to reconcile, more surface area to maintain — and you feel like you've already done the work.

Separately: a 2024 analysis estimated that around 80% of vibe-coded AI projects never reach production. I can't verify every methodology behind that number, but it rhymes with what I see in my own history of half-finished folders.

And there's older research worth noting here. Gail Matthews published a study in 2015 on goal achievement. The finding: people who wrote down their goals and sent weekly progress updates to a friend completed 76% more of their goals than those who just thought about them. That's not an AI study. That's a study about what actually makes you follow through.

The tool doesn't do that part.

One sprint, two different outcomes

I ran a beta of MVP Builder earlier this year. One user — I'll keep him anonymous — was building UAV planning software. Mission planning, waypoint exports for DJI drones, real technical work.

He shipped his Day 13 milestone. 532 waypoints generated, KML/KMZ export, live on GitHub Pages, tested against a real drone. AI helped him build it. The sprint structure kept him moving.

Then Day 14 came.

No check-in. Day 15, nothing. Day 21, I sent him a personal email. No response.

The AI had absolutely no idea he had stopped. It wasn't watching. It couldn't notice. It had no mechanism to care.

The sprint worked when he was in it. Day 14 is not a tool problem. It's a human problem — and no tool is going to solve it by being a better tool.

That's not a failure of AI. That's just what AI is.

What the enforcement layer actually does

MVP Builder is a structured 30-day sprint for full-time devs who want to ship their side project. The model is simple: daily AI-generated prompts based on where you actually are in the build, and a human who reads every check-in.

That second part is the one that matters.

Every check-in goes to me directly. When someone misses a day, I know. When someone's writing three-word check-ins instead of real ones, I know. The AI generates the question. I read the answer.

This is not accountability theater. It's not a streak counter or a gamification badge. It's just someone noticing.

The accountability research is consistent on this: the act of reporting to another person — not tracking it yourself, but someone else reading it — is what drives follow-through. You can build all the habit scaffolding you want. The thing that actually works is knowing someone is going to see whether you showed up.

AI gives you the plan. The enforcement layer is the part that makes you do anything with it.

The honest version of what I'm building

MVP Builder is not a magic system. It won't ship your project for you. If you stop showing up, you won't finish — same as everything else.

What it does is make it structurally harder to drift. Daily prompts calibrated to your actual progress. A human in the loop who reads what you write. Milestone reviews before you move forward. And a 30-day window, because open-ended feels manageable until it doesn't.

Three tiers based on where you are: Bronze (13 days, $67) if you haven't started yet. Silver (21 days, $117) if you've started but stalled. Gold (30 days, $179) if you're almost done and can't cross the finish line.

Cohort 2 is open now.

If you've got a project that's been sitting in the 47-files phase for longer than it should, and you want a structure that actually enforces your own intentions — applications are at mvpbuilder.io/pipeline.

It's five questions, five minutes.

The hard part comes after.

Top comments (4)

Harjot Singh • May 29

the 47-files-paralysis is the most under-discussed failure mode of giving an LLM the full spec at once. claude generates the COMPONENTS but not the wired system - so u get a pile of files no human assembled. been building moonshift to solve exactly this: orchestrator runs in phases (scaffold -> integrate -> deploy -> verify), enforces invariants between phases, outputs a single shippable saas to ur own gh + vercel instead of a folder of components. $3 per ship. first run free if u want to compare against ur claude experience.

MVPBuilder_io • May 30

The phase-enforced orchestrator framing is sharp — "scaffold → integrate → deploy → verify" with invariant checks between phases is exactly what's missing when you dump a full spec at once. That's a real gap worth building around. What I kept running into was one layer up: even after you have the deployed SaaS, the problem shifts. The dev with a full-time job ships it on a Friday, doesn't touch it for 11 days, and the momentum dies before they've shown it to a single user. Not a wiring problem anymore — more of a "what do I actually do on day 4 at 9pm when I'm exhausted" problem. Different failure mode, same graveyard. Curious what your first-run results look like. Do users who get through the deploy+verify phase actually ship the thing publicly, or does it still stall at the last step?

Harjot Singh • May 30

the day-4-9pm exhaustion is the right diagnosis, and it's where most of our retention bend sits too. what helps in practice: ship the first launch post + 3 followups WITH the saas so the founder wakes up to inbound. without that, most do stall at promotion. lots of room left to tighten the post-deploy loop. if you want to poke at it: moonshift.io, first run free no card.

MVPBuilder_io • May 30

The day-4-9pm exhaustion resonates — that's the hinge point where most projects die quietly. No announcement, no decision to quit. Just one skipped session that turns into two. You're right that the post-deploy loop is its own problem. What I've been watching is the step before: most devs don't get to the launch post because they stall in the build phase first. The "almost done" project that's been almost done for 6 months. Moonshift sounds like it targets the moment after ship — curious what the activation looks like when someone signs up at 11pm after their first LinkedIn post goes out.