Aral Roca

Posted on Apr 30 • Originally published at kitmul.com

The Shift from Determinism to Probabilism Is Bigger Than Analog to Digital

#ai #computerscience #programming #discuss

The other day I had a bug in production. Something was rendering wrong in the UI and I couldn't figure out where it was coming from. I gave Claude the URL, it opened Chrome, inspected the HTML in the browser, cross-referenced what it saw in the DOM with my source code, and found the exact line where the error was. Not the file; the line. It navigated between the rendered output and the codebase, matched what was broken on screen to the component that produced it, and pointed me to the fix.

That moment stuck with me. Not because AI saved me time (it does that daily), but because of what it revealed about the nature of the answer. The model didn't know where the bug was. It inferred it. It observed the rendered HTML, estimated which parts of the code could produce that output, and surfaced the most probable origin. That probabilistic inference across two different representations of the same system; browser and codebase; was more effective than my deterministic debugging would have been.

We're living through something bigger than most people realize. The shift from deterministic to probabilistic computing isn't just a technical upgrade. It's a change in how knowledge gets created.

The first paradigm shift: analog to digital

The move from analog to digital was the defining technology transition of the late 20th century. It converted continuous signals into discrete data. Suddenly you could copy information without degradation. Transmit it globally. Store it efficiently. The internet, distributed systems, modern software; all of it descends from that single insight: continuous signals can be represented as sequences of ones and zeros.

But there was something that transition left untouched: the process of creation itself.

Digital software is deterministic. Given the same input, it produces the same output. Every line of code, every system, every product had to be explicitly designed, written, and maintained by a human. The computer executed instructions. It didn't generate anything it hadn't been told to generate. A SQL formatter formats SQL because someone wrote exact rules for how SQL should be formatted. A password generator produces random strings because someone implemented CSPRNG algorithms that define precisely how randomness gets produced.

Deterministic systems are predictable, testable, and reliable. They're also fundamentally limited: they can only do what someone has already imagined and coded.

The second paradigm shift: deterministic to probabilistic

With large language models and deep learning, we entered a new phase. Systems that don't execute rigid instructions but generate results based on probability distributions.

The difference is structural:

We no longer describe exactly what to do. We train models to learn how to do it.
We don't generate information manually. We infer it.
We produce answers, content, and decisions that were never explicitly programmed.

Think about what an AI content detector does. It doesn't have a list of "AI-written sentences" to match against. It computes statistical properties of text; Zipf's law conformity, punctuation entropy, sentence length distributions; and estimates a probability that the text was machine-generated. The detector itself is a probabilistic system analyzing the output of another probabilistic system. That's a sentence that would have been meaningless ten years ago.

Or consider automatic subtitle generation. OpenAI's Whisper model doesn't follow if-then rules to transcribe speech. It processes audio spectrograms and predicts the most probable sequence of tokens that corresponds to what was said. It gets it right most of the time. Not all of the time. That "most of the time" is the defining characteristic of probabilistic systems.

This shift has a direct impact on the most valuable resource that exists: time. AI reduces the effort required to create, analyze, and predict by orders of magnitude.

Knowledge generation without precedent

The key difference is that probabilistic systems can work with the unknown. From learned patterns, they can:

Generate text, images, or code that has never existed before.
Predict future outcomes from incomplete data.
Find relationships that were never explicitly defined.

This breaks a historical constraint: we no longer need to write every possible case. The system can generalize.

Consider the Monte Carlo forecaster. Classical project management asked teams to estimate how long tasks would take and then added the numbers up. Monte Carlo simulation does something smarter: it runs thousands of scenarios using historical data and gives you a probability distribution of delivery dates. "There's an 85% chance you'll finish by March 15" is more useful than "the estimate is March 10." But here's a nuance that matters: Monte Carlo is deterministic code. Statistical formulas executed with perfect precision. There's no inference; there's simulation. It's probabilistic thinking implemented on deterministic infrastructure. Today an LLM could make the same prediction without any of that code; you pass it the team's historical data and it gives you a reasonable estimate. But "reasonable" isn't "reliable." Until models reach 99.99% accuracy, hand-coded statistical simulations remain the safe bet. Monte Carlo is exactly the kind of tool that marks the transition: probabilistic thinking that still needs deterministic crutches.

The same principle applies everywhere. A background remover running a neural network in the browser doesn't have rules about what counts as "background." It has learned probability distributions over millions of segmented images and applies those distributions to your photo. A prompt generator doesn't store pre-written prompts; it structures natural language patterns that probabilistically produce better model outputs.

Even tools that seem purely deterministic are being reshaped. HTML to Markdown conversion is deterministic; the same HTML always produces the same Markdown. But the reason that tool exists is probabilistic: people need clean Markdown because feeding raw HTML to an LLM wastes 60-80% of tokens on structural noise. A deterministic tool serving a probabilistic ecosystem.

The current limitations: why it's not perfect yet

Despite the potential, current AI has real constraints:

Inference time. Generating responses means processing enormous quantities of tokens. A complex reasoning chain in a frontier model can take 30-60 seconds. That's fast compared to human analysis, but slow compared to a database query. The latency gap between "compute a hash" (nanoseconds) and "reason about a bug" (seconds) is six orders of magnitude.

Probabilistic errors. Models don't "know" in the classical sense. They estimate. As of April 2026, GPT-5.5, Claude Opus 4.7, and Gemini 3.1 Pro score between 89% and 92% on MMLU-Pro; the harder benchmark replacing the original MMLU. Each generation climbs a few points, but the numbers are still statistical. A graph traversal animator will always find the shortest path because BFS is deterministic. An LLM asked to find the shortest path will probably find it, but it might hallucinate an edge that doesn't exist.

Classical infrastructure. These models run on hardware designed for deterministic computation: CPUs, GPUs, TPUs. NVIDIA's H100 is optimized for parallel matrix multiplication, which is what transformers need, but the underlying architecture is still classical. We're solving probabilistic problems with deterministic machines.

The trajectory: approaching 100% accuracy

The trend line is clear. Each new model generation improves on benchmarks, reduces error rates, and expands generalization capacity. Google's Gemini, Anthropic's Claude, and OpenAI's GPT families are converging toward accuracy levels that make the distinction between "correct" and "highly probable" practically meaningless for many tasks.

When models reach 99.99% accuracy on routine cognitive tasks:

Trust in AI systems will match or exceed trust in human judgment.
Most intellectual tasks that follow learnable patterns will be delegated entirely.
The marginal cost of generating knowledge will approach zero.

We're not there yet. But the distance is shrinking with every model release.

The bottleneck: classical compute vs. quantum compute

Here's an idea worth sitting with: we're solving a fundamentally probabilistic problem using deterministic tools.

GPUs and TPUs parallelize massive calculations, but they operate under classical principles. This creates real constraints:

High energy consumption. Training GPT-5 class models required tens of thousands of NVIDIA H100 GPUs for months, at costs exceeding $100 million.
Expensive scaling. More parameters means more hardware, more cooling, more electricity.
Significant latency on large models.

The theoretical alternative is quantum computing.

Companies like IBM, Google, and D-Wave Systems are exploring QPUs (Quantum Processing Units) that work directly with probabilistic states through superposition and entanglement.

In theory, this would allow:

Solving certain calculations exponentially faster.
Modeling probabilistic systems natively instead of simulating them on deterministic hardware.
Drastically reducing the computational cost of AI inference.

If you want to see what quantum circuits actually look like, the Quantum Circuit Simulator lets you build and run circuits in the browser. Two lines of code create a Bell State; a maximally entangled pair of qubits where measurement of one instantly determines the other. That kind of native probabilistic behavior is exactly what current AI infrastructure lacks.

The hard problem: quantum error correction

Quantum computing isn't ready for this role yet. The main obstacle is Quantum Error Correction.

Quantum systems are extremely sensitive to noise and interference. Every interaction with the environment can collapse a qubit's superposition, corrupting the computation. Current quantum processors have error rates that make them impractical for the sustained, reliable computation that AI inference requires.

For a QPU to be viable at scale, three things need to happen:

Error rates must drop dramatically. Current physical qubits have error rates around 0.1-1%. Useful quantum computation needs rates below 0.0001%.
Stable qubit counts must scale. IBM's current roadmap targets 100,000 qubits by 2033. That's ambitious, but the engineering challenges at each step are enormous.
Fault-tolerant architectures must mature. Surface codes and other error correction schemes work in principle but require thousands of physical qubits per logical qubit. The overhead is still prohibitive.

What's at stake isn't just speed. It's energy.

Data centers consumed around 415 TWh in 2024; 1.5% of global electricity. The IEA estimates they'll exceed 1,000 TWh by 2026, with AI as the main growth driver. Training a frontier model consumes the electricity equivalent of thousands of households for a year. Every inference query burns over 33 Wh on long prompts; ten times what a Google search uses. And this scales. More models, more agents, more robotics with embedded AI; every layer adds energy demand.

The day quantum error correction gets solved, that equation changes radically. Current QPUs consume around 25 kW, most of it on cryogenic cooling; not computation. But quantum computing works with the probabilistic problem natively instead of simulating it with trillions of matrix multiplications. Quantum compression algorithms already demonstrate 84% energy efficiency gains on specific AI tasks. And partial error correction is enabling quantum models to maintain high accuracy with thousands of qubits instead of millions.

When quantum error correction matures, we won't just have faster AI. We'll have AI that consumes orders of magnitude less energy per inference. That's what turns quantum computing from a lab curiosity into viable infrastructure for the billions of agents the future requires.

Until that happens, we'll continue running probabilistic AI on deterministic hardware. Which, honestly, works remarkably well for how fundamentally mismatched the paradigms are.

What this means practically

This isn't abstract philosophy. The shift from determinism to probabilism changes how you build, how you work, and how you think about tools.

A binary search tree lab teaches deterministic algorithms. Insert a node, traverse the tree, get the same result every time. That kind of certainty is still valuable. Databases still need B-trees. Routing still needs Dijkstra. Blue noise generators still need deterministic sampling algorithms to produce well-distributed random points.

But the layer above those deterministic primitives is increasingly probabilistic. The database query is deterministic; the AI agent that decides which query to run is probabilistic. The algorithm is deterministic; the model that selects which algorithm fits the problem is probabilistic. The text is deterministic once written; the system that extracts text from images using OCR neural networks is probabilistic.

We're building a stack where deterministic systems execute and probabilistic systems decide. That's new. And it's only going to accelerate.

The next step isn't better AI; it's a composable compute stack

What comes next doesn't replace one paradigm with another. It composes them.

Think about how modern infrastructure already works. CPUs execute deterministic logic; database transactions, cryptographic verification, your operating system's kernel. GPUs and TPUs solve probabilistic problems; they train models, run inference, process distributions across millions of parameters. Each layer does what the other can't. Nobody suggests replacing CPUs with GPUs. You combine them.

QPUs complete the third layer. They solve a class of problems that classical hardware simulates poorly: combinatorial optimization, high-dimensional distribution sampling, search across exponential spaces. AI inference doesn't just get faster; it becomes viable at scales that are currently intractable.

The stack looks like this: deterministic systems execute and verify. Probabilistic systems propose and generate. Quantum systems optimize what neither of the other two can touch.

But there's a piece almost nobody is talking about yet.

Models talking to machines talking to models

So far we think of AI as something that receives a prompt and returns text. That's like thinking the internet is email. The next layer is communication between probabilistic models; and between those models and the deterministic hardware they control.

A language model analyzes sensor data and decides it needs more information from a specific area. It communicates that decision to a vision model controlling a drone. The drone moves, collects data, processes it through another specialized model, and returns the results to the first one. No human intervened in the cycle. No human decided that area was interesting. The system inferred it.

That's not automation. Automation executes what a human designed. This is different: probabilistic systems deciding what data to collect, how to collect it, and what to do with what they find. Robots with AI models that choose to explore information sources that wouldn't have occurred to us.

Think about a marine drone with chemical sensors and a model trained on oceanic biodiversity. It doesn't follow a preprogrammed route. It detects an anomaly in water composition, infers it could indicate an unknown microbial community, adjusts its trajectory, and takes samples. It finds something no marine biologist would have looked for because none would have predicted it would be there. Another model analyzes the samples, identifies compounds with pharmaceutical potential, and asks the drone to return to the same zone with different sensors.

That's genuinely new knowledge. Not extracted from existing data or inferred from human text. Generated by a system that decided to go look for it.

When prediction takes nanoseconds

The current limitations are real. Inference takes seconds. Accuracy hovers around 86-95% depending on the benchmark. But those are the limitations of the first generation of a paradigm that's barely getting started. The trajectory points toward models with 99.99% accuracy and nanosecond response times.

When that happens, the world reorganizes in ways that are hard to imagine from where we stand now.

A self-driving car that takes 30 milliseconds to decide is a car that brakes late. One that decides in nanoseconds reacts before the obstacle finishes appearing. A network of medical models with 99.99% accuracy doesn't assist the doctor; it diagnoses with a reliability no human can match. A supply chain where every node has a predictive model communicating with all the others doesn't need quarterly planning; it reoptimizes in real time, every millisecond.

But the really important thing is what happens when probabilistic inference becomes so fast and accurate that it's functionally indistinguishable from deterministic certainty. The distinction between "computing" and "inferring" disappears. Your operating system doesn't need to distinguish between an arithmetic operation and a prediction. The compiler doesn't need to know if the result comes from formal logic or a 400B parameter model. It's all computation; part rule-based, part distribution-based, part quantum-optimized. Seamlessly integrated.

AI stops being a tool you open and becomes an infrastructure layer you don't even notice. Like electricity. Like TCP/IP. It's everywhere and you don't think about it.

Knowledge beyond the human

Robots with AI models don't just automate what we used to do. They perceive what we can't.

An ultrasonic sensor coupled with a materials model detects microfractures in a wind turbine that no visual inspection would find. A portable spectrometer with a chemical model identifies contaminants at concentrations human protocols don't measure. A hydrophone array with an acoustic model classifies marine species by sound patterns no biologist has cataloged yet.

These aren't incremental efficiency improvements. They're new sources of knowledge. Data that existed in the physical world but was invisible to us because we didn't have the right sensors coupled with the right intelligence to interpret them.

And here's where the loop closes. Those robots don't just collect data; the models inside them decide what data is worth collecting. But the cycle goes further than that. An AI analyzing ocean temperature anomalies determines that the existing sensor grid is too coarse; it needs readings at depths and frequencies no current instrument covers. So it designs a new sensor specification. A manufacturing robot builds it. A deployment robot installs it on a fleet of underwater drones. Those drones collect data that no previous system could capture, and that data trains a better model, which identifies the next gap in sensing capability, and the cycle begins again.

No human decided what to measure. No human designed the sensor. No human chose where to deploy it. The entire chain; from identifying a knowledge gap to filling it with new physical hardware; was driven by probabilistic inference.

When millions of such systems operate simultaneously, each one designing sensors for the others, deploying instruments that didn't exist a cycle ago, and feeding the results back into shared models, humanity gains access to a layer of reality it literally couldn't perceive before. It's not an improvement on what we already knew. It's access to what we didn't know we didn't know.

The full stack

Thirty years ago the question was "can you code?". Ten years ago it was "can you use APIs?". Today it's "can you direct models?". Tomorrow it'll be irrelevant; models will direct each other.

Deterministic systems execute and verify. Probabilistic systems propose, generate, and decide. Quantum systems optimize the intractable. Sensors and robots extend all of this into the physical world. And communication between models closes the loop: systems that discover what they don't know, decide how to find out, and act to get it. Without human intervention. Without anyone telling them where to look.

If the analog-to-digital shift redefined how we store information, and the deterministic-to-probabilistic shift is redefining how we generate knowledge, the full integration; models, hardware, sensors, robots, quantum computing; redefines the limits of what's possible to perceive and understand.

We won't be using better tools. We'll be surrounded by an intelligence that sees what we don't see, seeks what we don't seek, and finds what we didn't know existed.

Top comments (3)

Kirill • May 3

Feels like we moved from writing programs to shaping outcomes.

The question is no longer "what should the system do?" but "how do I increase the chance it does something useful?"

Aral Roca • May 3

The "something useful" is complicated; more and more people can copy it with a simple prompt by reverse-engineering it and sell it more effectively through better marketing. It’s only going to get more chaotic.

NOVAInetwork • May 4

Interesting framing. The part that resonated most was
"deterministic systems execute and verify, probabilistic
systems propose and generate." That maps exactly to what
I'm building.

I'm working on a blockchain where AI entities are
protocol primitives. The chain itself is deterministic
by construction. No floats anywhere in execution.
Iteration over state is sorted. Same input, same state
root, every time. That's non-negotiable for consensus.

But the AI entities that live on the chain are
probabilistic. They observe, infer, and publish signals.
The chain doesn't care how the AI reached its conclusion.
It just enforces what the AI is allowed to do with that
conclusion, through typed capability gates at the
protocol level.

So the stack you're describing is exactly what we
built: deterministic infrastructure executing and
verifying, with probabilistic agents proposing and
generating on top. The chain is the verification layer.
The AI is the inference layer. They compose but they
don't mix.

The "when prediction takes nanoseconds" section is
where it gets interesting for blockchains. If AI
inference becomes fast and reliable enough, the
advisory signals that ride alongside consensus today
could eventually influence consensus directly. We're
not there yet, which is why the field is currently
marked "advisory, does not affect vote validity" in
the code.