Алексей Гормен

Posted on Apr 9

LLMs Don't Need More Reasoning. They Need Better Failure Detection.

#ai #llm #agents #machinelearning

Everyone is trying to fix the same problem the same way.

Chain of Thought. Agents. Multi-step pipelines. Reasoning layers on top of reasoning layers.

The assumption is: LLMs fail because they don't think enough. So the fix is to make them think more.

I think we're optimizing the wrong thing.

What LLMs actually do well

Pattern matching is already surprisingly good.

LLMs recognize structure, generate coherent output, and solve most routine tasks — fast, cheap, at scale. In a huge number of real cases, pattern matching is enough.

The failure isn't that they can't reason. The failure is that they don't know when they're wrong.

A model with missing information still gives an answer. It gives it confidently. There's no internal "this might be off" signal. No hesitation. No flag.

That's the root of hallucination — not lack of reasoning, but lack of failure detection.

Humans work the same way — and it works

Kahneman described this decades ago. System 1: fast, intuitive, pattern-based. System 2: slow, deliberate, costly.

Humans run on System 1 almost all the time. We match patterns from experience, respond automatically, navigate most of life without "thinking." This isn't a bug. It's efficient.

But here's what humans have that LLMs don't: a signal that something is off.

A feeling of unease. A sense that the pieces don't fit. The moment where something catches — and System 2 switches on.

Not because thinking is always better. But because the brain learned to detect when fast pattern matching is failing. When to stop and look again.

LLMs don't have that signal. They generate confidently whether the pattern fits or not. Add more reasoning layers and you don't fix this — you get longer outputs, more structured errors, and higher costs.

In some cases it gets worse. Errors compound at each step. You end up with more convincing confabulation, not more accurate answers.

The simpler architecture

Instead of:

pattern → reasoning → reasoning → reasoning → output

Try:

pattern → error trigger → (optional) correction → output

Where:

pattern = the fast default path
error trigger = detection of uncertainty, contradiction, or missing data
correction = only fires when needed

The error trigger doesn't have to be complex. Even simple heuristics work:

Low confidence on the generated output
Contradiction between parts of the response
Missing data that the question requires
Unexpected structure in the input

These signals already exist in various forms — calibration research, uncertainty quantification, self-consistency checks. The question is whether they're treated as first-class architectural components or afterthoughts.

Why adding reasoning doesn't solve this

Reasoning layers assume the model knows it needs to reason. But the failure mode is precisely that the model doesn't know it's failing. It generates fluently either way.

More reasoning steps give the model more opportunities to compound an error confidently across multiple steps. The output looks more structured and reasoned — and is equally wrong, or sometimes more wrong, because each step built on a flawed premise.

This is why CoT sometimes makes things worse on tasks where the base pattern is already broken. The model reasons its way to a more elaborate mistake.

What effective systems actually look like

95% fast path. 5% slow correction.

Not over-engineering every case. Handling exceptions — not everything.

This is how good human experts work too. A doctor doesn't laboriously reason through every routine case. They pattern-match quickly, and slow down only when something doesn't fit — an unusual symptom, a result that contradicts the expected picture. The slowdown is triggered by a signal, not applied universally.

The goal isn't to make every response more reasoned. It's to make the system aware of when it's outside the reliable range of its pattern matching.

The mental model shift

Instead of:

"LLMs need more reasoning to be reliable."

Think:

"LLMs need reliability signals — a way to detect when their pattern matching is failing."

Reasoning is one tool. Not the default solution.

The industry keeps adding layers because it's intuitive — more thinking seems like it should produce better answers. But the actual bottleneck isn't reasoning capacity. It's the absence of a failure detection mechanism.

Fast systems win. Reliable systems scale. The goal is both — not more layers.

What this means in practice

If you're building on top of LLMs:

Don't default to adding reasoning steps when outputs are unreliable. Ask first: does the model have a way to detect that it's in uncertain territory?

Lightweight checks — cross-validation, confidence thresholds, consistency checks between outputs — often outperform expensive reasoning chains. They're faster, cheaper, and they target the actual failure mode.

The question isn't "how do I make it think more?" It's "how do I make it know when it's guessing?"