DEV Community

I Made Claude Code Think Before It Codes. Here's the Prompt.

v.j.k. on March 10, 2026

Update: there's now a v2. The single disciplined developer in this post became a whole team: an agent that turns ideas into issues, an orchestrato...

Read full post

Harjot Singh • May 30

Forcing a plan-before-code step is one of the highest-ROI prompt patterns there is - it catches the "confidently builds the wrong thing" failure before it costs you a 300-line diff to unwind. Cheap to add, big payoff. The model is way better at critiquing its own plan than at backing out of code it already committed to.

Bonus most people miss: a plan-first step is also a cost lever. The planning turn is short and cheap; you can even run it on a smaller model, approve the plan, then let the bigger model execute only the approved scope. Way cheaper than letting the expensive model improvise and re-roll. Saving your prompt - thanks for sharing it. Do you gate execution on explicit plan approval, or just let it flow once it's reasoned?

v.j.k. • May 31

Thanks, that means a lot, the plan-first step really is the quiet workhorse of the whole thing. You nailed why: critiquing a plan is cheap, backing out of committed code isn't.

And yes, love that you clocked the cost angle too. Same instinct here: a short reasoning pass up front is a few hundred tokens; an expensive model improvising down the wrong path and re-rolling is the real bill.

On your question: it's a soft gate, not a hard wall. The default is: reason, surface the plan, and flow... "wizard" only hard-stops for approval when the fork is genuinely architectural, where the user holds context the model can't infer and the answer changes the shape of what gets built (schema, contract, irreversible call). For everything that's just "which safe path," it picks the sound one and keeps moving rather than turning every decision into a permission prompt. Gating everything trains people to rubber-stamp, which quietly defeats the point.

Glad it's useful to you... use freely. 🙏

klement Gunndu • Mar 10

The think-then-plan-then-code flow is solid — worth noting that coupling the /wizard skill with spec-driven prompts could let you enforce architectural constraints before any code gets generated, not just testing conventions.

v.j.k. • Mar 30

Great point -- spec-driven prompts as an upstream constraint layer is exactly the right direction. Think of it as guardrails before the guardrails. The EXPLORE phase in /wizard already nudges Claude to read existing patterns, but encoding architectural rules explicitly in a spec would make that much harder to accidentally violate. Worth experimenting with.

Alpha Compadre • Mar 14

Phase 7 is the one that resonated most. The adversarial review mindset — reviewing your own output as an attacker before shipping — applies well beyond code.

I built a Mac app that uses AI to draft email replies, and the hardest design decision was how much autonomy to give the AI. The answer: none for sending. The AI drafts, the human reviews, the human sends. Every time.

The confidence scoring system I built is basically a lightweight Phase 7 — the AI evaluates its own draft and flags how confident it is. High confidence means it nailed the tone and context. Low confidence means "read this carefully before hitting send."

Your line about "enthusiasm does not catch nullable datetime crashes" is perfect. In email, enthusiasm doesn't catch wrong names, misread tone, or confidently wrong recommendations. The adversarial review mindset applies everywhere AI touches human-facing output.

Bookmarking the /wizard skill — the 8-phase methodology is a solid framework for any AI-assisted workflow, not just coding.

v.j.k. • Mar 16

Indeed. To be honest, this is largely the same approach I’ve always taken to software architecture and development, long before AI-assisted coding was even a distant possibility. The presence of AI on our side shouldn’t be an excuse to cut corners on tried and proven methodologies. If anything, it gives us a more sophisticated set of tools to follow those principles more consistently and enforce them more effectively.

Alpha Compadre • Mar 14

The "process over intelligence" framing is exactly right. I've been applying the same thinking to AI outside of coding — specifically to email communication.

Most AI email tools operate in what you'd call "junior mode": they see an email, they generate a reply, they fire. Fast, enthusiastic, and occasionally catastrophic (wrong tone to a client, missing context on a sensitive thread, confidently replying to something that needed a human pause).

Your 8-phase approach maps surprisingly well to non-coding AI workflows. For email, the equivalent looks something like:

READ — Understand the full thread context, not just the latest message
EXPLORE — Check who's on the thread, what the relationship history is
ASSESS — Rate confidence (is this a routine reply or a landmine?)
DRAFT — Generate a response based on all that context
VERIFY — Human reviews before anything gets sent

I'm building a Mac app (Drafted) that follows this exact philosophy. It reads your Gmail inbox, assesses confidence on each email (High/Medium/Low), and pre-drafts replies — but crucially, it never sends anything. You always review, edit, and hit send yourself.

The parallel to your CLAUDE.md approach: just like you encode process into the prompt so Claude doesn't skip steps, we encode process into the tool so the AI doesn't skip the "should a human look at this first?" step. The answer is always yes.

Curious if you've thought about applying the /wizard methodology beyond code — documentation, communication, operational workflows? The "think before you act" principle feels universally applicable.

v.j.k. • Mar 30

Thanks for the kind words, and Drafted sounds like a genuinely thoughtful take on AI-assisted communication. The confidence-scoring layer is smart -- a lot of tools skip straight to drafting without asking whether this is even the kind of email that should be auto-drafted at all.

Funny you asked about applying the methodology beyond code -- I shipped something yesterday that does exactly that. Battle Mage is a Slack agent powered by Claude that answers questions about a GitHub codebase when you @mention it. It reads your repo in real time, follows up in threads, and even lets you correct it so it builds a shared knowledge base over time. For me it started as a way to help new users onboard to a product without drowning a small team in repetitive questions -- same "think before you act" principle, different domain.

Repo is here if you want to take a look: github.com/vlad-ko/battle-mage

Still needs some polishing, but I plan on announcing it to the world shortly.

p. s. I've got a whole magical AI army all of a sudden 😆 it's lore I can enjoy.

Larry Barrow • Mar 16

I couldn't agree with your thoughts more. What resonates most in this piece is the idea that the real bottleneck isn’t model intelligence—it’s the lack of process. The article makes this clear when it notes that Claude “defaults to junior mode… not because it lacks knowledge, but because it lacks process”. That distinction matters.

LLMs can generate code at incredible speed, but speed without structure just accelerates the path to subtle regressions, race conditions, and “it worked until it didn’t” failures. What /wizard does well is exactly what senior engineers do instinctively: slow down the beginning so the end goes faster. Planning, exploring the codebase, verifying assumptions, writing mutation‑resistant tests—these are the habits that prevent the 2am incidents described in the article, like the nullable datetime crash or the missing database lock.

But even with a strong process prompt, the human role doesn’t disappear. If anything, it becomes more important. The human is still the one holding the architectural context, the product intent, the long‑term tradeoffs, and the “should we even build this?” perspective. AI can execute steps, but it can’t yet own the big picture.

That’s why I see frameworks like /wizard as less about making AI autonomous and more about making AI reliable. The human sets the direction; the process keeps the AI from cutting corners; and the combination produces work that’s both fast and trustworthy.

In other words: intelligence is useful, but process is what makes intelligence safe and smart to use.

v.j.k. • Mar 30

100% agree, Larry. This is exactly why I never let Claude Code work directly on main -- always a feature branch, always a PR, always a human review gate before anything merges. The process prompt keeps AI from cutting corners mid-task; the branch discipline keeps humans in the loop at every stage. Both layers matter.

Matt Buscher • Apr 4

This is the right instinct — getting Claude to plan before executing is huge.

I'd push it one step further: externalize the plan into a file, not just the prompt. When the plan lives in chat, it dies with the session. When it lives in a markdown file (like execution/tasks.md), any future session can pick it up and continue.

Same goes for decisions and constraints. The golden rule I follow: if it matters, move it from chat into markdown. Chat is for thinking. Files are for preserving.

Solves the "I have to re-explain everything every session" problem completely.

v.j.k. • Apr 8

Indeed. My prompts went from long winded essays begging it not to forget TDD or this or that. To "work on 1234, follow our dev cycle until the PR is ready to merge" ...

Patrick Nemenz • Mar 12

I think that plan is solid in itself and tries to enforce some very important software engineering and QA practices.

That said, the assumption that you can make Claude think something in particular or assume a software architect role -- hell, that any LLM can think at all -- is misguided. Instead you're just trying to offset statistics in your favour. Statistics-based output will still bullshit you now and then, and you still won't know whether it does, or not. (I see you trying very hard to, with all that reference output for human oversight, but still: LLM slop remains, no matter how hard you push.)

I will still try (and perhaps keep) using this as part of my ever-growing push for quality.
My point is that caution should go up, not down, with increasingly sophisticated and convincing output.

v.j.k. • Mar 13

It’s not a bulletproof solution to let AI build things for you. These are simply guardrails, grounded in old school, battle tested software architecture and development principles that guide the AI in the right direction.

Sure, I dress it up with a bit of marketing flair, otherwise where’s the fun. But details aside, it works far better than raw prompt slinging. The goal is simple, make your life easier while still respecting the fundamentals of good software engineering.

Comment hidden by post author - thread only accessible via permalink

f23587623 • Mar 17

Claude Code with a properly written CLAUDE.md is 10000000000000x better than this.
Before I installed this, I had clean, secure code.
After I installed this, I had to spend hours cleaning and securing the code.

v.j.k. • Mar 23

You did something horribly wrong 😅

Some comments have been hidden by the post's author - find out more