Discussion on: AI Agents Are Shipping Features Without You. Now What?

View post

The line that landed for me was about the agent being wrong in ways that are coherent and hard to catch. That's the specific failure mode that keeps me up. Not the obvious errors—those you spot immediately and fix. It's the plausible, internally consistent, slightly-off solution that slides through review because nothing looks broken.

What's quietly terrifying is that reviewing agent output might actually be harder than writing the code yourself. When you write it, you carry the mental model of why each decision was made. You know where the sharp edges are because you put them there. Reviewing someone else's work—even an agent's—means reconstructing intent from artifact. You're reverse-engineering the reasoning from the output. And if the output is coherent but wrong, the mismatch between "looks right" and "is right" is almost invisible.

Your point about junior developers missing the repetition that builds judgment is the long-term version of the same problem. If you never write the insecure SQL query yourself, never see it get exploited, never feel that specific sting, do you develop the instinct to spot it in a code review? Or do you just learn to trust that the agent "usually" handles it?

The OINP bot is a good counterweight. Small scope, clear success criteria, low stakes if it gets the parsing slightly wrong. That's the sweet spot. What I'm wondering is whether you've found yourself writing more tests for agent-generated code than you would for your own. Feels like that might be the unspoken tradeoff—delegating implementation but doubling down on verification.

Jaskaran Singh • Apr 22

The reverse-engineering-intent-from-artifact framing is exactly right and I hadn't put words to it that cleanly. When you write it yourself, the mental model and the code exist together. When you review agent output, you only have the code. You're inferring the model, and if the model was slightly off to begin with, you're reconstructing a faulty map from terrain that looks completely normal.
On the tests question: yes, noticeably. I write more, and I write different tests. When I write the code myself I tend to know the edge cases because I thought through them while writing. With agent output I don't have that context, so I end up writing tests that are almost adversarial. Trying to break it rather than confirm it works. It's slower but I think it's the right instinct.
The SQL exploit point is the one I keep coming back to. There's a version of the next generation of developers who are extremely good at prompting and reviewing but have never felt the specific dread of finding an injection vulnerability in their own code. Whether that's fine or a real gap, I genuinely don't know. My gut says it matters but I can't fully defend that yet.