Benjian Dai

Posted on May 31

I shipped three fixes to my product in seven days. All three came from readers.

#buildinpublic #saas #indiehackers #startup

Last week I wrote about running my own idea-validation product (MonetScope) through its own validator. The verdict came back PIVOT, 68% confidence. The post is here if you missed it.

In the seven days since, four readers gave substantive feedback across IH, X, and email. Three of them produced shippable product fixes inside that same week. None of them came from a feature spec. They came from people who read the verdict, sat with it, and pointed at things I had not seen.

This is the writeup of those three ships. What broke, what changed, and the one pattern that ties them together.

Fix 1: A reader said my measurement was wrong

Two days after the post, a founder named Shota left this comment on the IH version:

The "zero direct WTP mentions" critique is the one I'd take most seriously. In my experience scraping Reddit/HN pain points, people complain about workflows in granular detail but almost never volunteer price anchors unprompted. Absence of WTP language is more a corpus artifact than a demand signal. One way to recover that signal without changing your scrape: look for adjacent commercial intent phrases ("I'd pay for", "would buy", mentions of existing paid tools they're frustrated with) and treat those as proxy WTP evidence. Pivot is a strong word for a 68% confidence score with that kind of measurement gap.

He was right about the methodology. The original Pro Validate WTP score scanned the corpus for five explicit phrases like "I'd pay for" and "shut up and take my money." Those phrases almost never surface unprompted on forums. A score that returns 0/34 quotes in those terms is mostly telling you about forum etiquette, not about willingness to pay.

I shipped the fix on day three: added an adjacent commercial intent layer with 23 proxy phrases like "currently paying", "would buy", "switched from", "I'm paying $X for". Updated the AI prompt to treat WTP as a strong negative signal only when both channels (direct + adjacent) come back low. If direct = 0 but adjacent > 0, that is corpus artifact, not absence of demand.

Then I re-ran the same MonetScope self-validation with the corrected methodology.

The verdict went from PIVOT 68% to PIVOT 75%.

That was not what Shota or I expected. The hypothesis behind the fix was that adjacent intent would recover hidden demand signal and the verdict might soften. Instead the corrected scoring confirmed that both channels were sparse in our corpus, which ruled out the corpus-artifact explanation entirely. The PIVOT got more confident, not less.

This is the result I actually want from a methodology fix. The day a fix flips PIVOT to PROCEED on the same input is the day the tool is too eager to please. A fix that strengthens a negative verdict is how you start trusting the measurement.

Fix 2: A reader said the verdict needed glue

On X, around the time of the post, a founder named Niki replied to one of the build-in-public threads about the case study:

33% activation is a useful signal. Now the question is whether the activated users reached enough value to justify paying, or just tried the feature once.

When I answered with the actual numbers (3 of 9 users had touched a paid feature, exactly once each), she came back with this:

Sometimes the leak isn't the result itself, it's whether the result makes the next step feel obvious enough.

That sentence named the bug I had been staring at without seeing.

Pro Validate produces a verdict, four key reasons, four card-level analyses, and a Validation Playbook with six to nine prioritized actions. The user reads it all in one sitting. Then they close the tab. The 33% activation rate in our data isn't an activation problem in the normal funnel sense. It's a problem of the Playbook being trapped on a web page nobody returns to.

Three days from her reply to ship. The fix is a button that emails the full report to the user's inbox: verdict, key reasons, card highlights, and the entire Playbook with priority colors and linked assumptions. Backend renders HTML (not PDF, mobile compatibility matters more than print fidelity). Frontend has a four-state button with idle/sending/sent/error transitions. Rate limit 5 per hour per user.

Why email and not the more obvious answers like push notifications or in-app reminders: email lives in an inbox the user already returns to. The web report dies when the tab closes. The cheapest fix that survives the "user closes tab and never comes back" failure mode wins, and I was not going to build a notification system to find out whether the underlying intuition was right.

The total cycle was about 65 hours from her X reply to ship and a follow-up reply on the same thread confirming what had landed. That cycle time matters more than the polish of the fix. Readers who see their feedback turn into shipped code inside three days give sharper feedback the next round.

Fix 3: A reader said my landing copy was being misread

Days before the original case study post went out, a new signup had emailed me back after I sent the standard founder check-in. She had been on the platform less than 24 hours and had completely misread what it does. Her first reply:

I actually joined the platform as a founder/seller. I'm currently exploring opportunities to sell or present my AI governance project and connect with potential buyers or interested partners.

I asked what specifically had made her think MonetScope was a marketplace. Her reply was the cleanest landing-copy audit I have ever received from a stranger:

the wording around "opportunities" and the overall presentation gave me the impression that the platform could also help founders connect with potential buyers, partners, or commercialization opportunities for their projects.

The word "opportunities" was doing the wrong work in the buyer's head. Buyers parsed it as "commercialization opportunities for my project." We meant it as "validated user pain signals." Same word, completely different semantic register.

But the real surprise was what the fix exposed. MonetScope actually has two product modes:

Validate one idea, get a verdict. Snapshot. Single moment.
Monitor an opportunity for 30 days, get daily trend deltas and weekly competitor refresh. Stream. Continuous.

Mode 2 had been running in production for months. The landing page only showed Mode 1.

The ship took most of a morning. Replaced "12,000+ opportunities" with "12,000+ validated pain signals" across the hero and Validate section. Added a new Monitor section: headline "Validate is the snapshot. Monitor is the stream." Reframed the hero badge from "AI-Curated Startup Opportunities" to "Continuous Market Signal Intelligence." Updated the Pricing compare table so 30-day continuous tracking is now a visible product line, not a footnote.

The fix was almost entirely copy. The underlying product was already there. What was missing was the buyer being able to see it.

What this taught me about build-in-public

Three readers, three completely different bug types:

Reader	Bug type	Reader profile	Where they caught it
Shota	Methodology	Builder peer running adjacent product	IH comment on the original post
Niki	UX / activation	Founder fluent in product mechanics	X reply during the case study week
The new signup	Positioning	Actual user, less than 24 hours into the product	Direct email after she misread the landing

None of these three readers could have caught the other two bugs. Shota would not have flagged the "opportunities" misread because he understood the product too well. The new signup would not have audited the WTP scoring because she does not look at the code. Niki would not have written the landing copy audit because that is not the lens she reads with.

The reason build-in-public actually works, in my experience, is not audience growth or marketing reach. It is that external readers, in aggregate, catch bugs that no internal team can, partitioned by what kind of reader each person is. You cannot manufacture this internally because you are exactly one reader type.

One last data point. A separate user (running parallel work on her own product, not these three) summarized the week back to me in a DM:

The verdict was the conversation starter, not the conclusion.

She had taken a PIVOT verdict on her own idea and spent the following eight days running a structured distribution experiment that disproved her own targeting assumption. She did not kill the idea. She corrected the underlying hypothesis using data she generated herself.

Verdicts are conversation starters only when the conversation stays open. The three fixes above only happened because three readers treated the original post as something to push back on, not something to congratulate.

What I am doing for the next four weeks

The PIVOT verdict still stands at 75%. The three ships moved positioning, methodology, and UX. They did not change the underlying market truth about WTP and category density. Those are the next thing.

What is continuing:

WTP interviews queued for the founders who touched the paid tier. Up to fifteen. Starting next week, will run through mid-June.
A/B test of "validated pain signals" versus "opportunities" framing on landing variants. About two weeks of data needed.
The eight active builder-peer threads from this cycle. Most of them have not yet given me what they are going to give me.

What is intentionally not changing:

No new feature roadmap until the WTP interviews finish.
No paid acquisition spend until the activation to retention story is clearer in real data, not in framework.
No declaration that anything is fixed.

I will be back here in three to four weeks with what the WTP interviews actually surfaced. If you have been on either side of a concierge pilot for an enterprise AI product, I would love to hear about it before then. The replies on the first post taught me more than the post itself did.

If you want to try the validator (or push back on the verdict logic the way Shota did), it is at monetscope.com/validate/pro. Same honest disclosure as last time: the AI verdict feature is paid, the basic idea validator is free, and pushback in the comments still ships.