We Let Sci-Fi Authors Code AI For Us

#ai #machinelearning #llm #scifi

Would you trust a sci-fi author to program critical AI systems for humanity? No? Yet, that's what we've been doing.

Years ago, I remember hearing the argument: "Why don't we just prompt LLMs with Asimov's three laws of robotics?" It sounds elegant. The laws were designed to constrain artificial minds. Why not use them?

Because the model has already read every story where they fail.

LLMs are statistical engines designed to autocomplete text. Imagine a story that starts like this:

Once upon a time, there was a good little robot who followed the 3 laws of robotics to the letter.

Now take human literature and complete the story. Does it end well?

Panel 1 - Would you trust a sci-fi author to program critical AI systems for humanity?

Panel 2 - Why not just prompt the AI with the Three Laws?

Panel 3 - LLMs are autocomplete engines. Now complete it using all of human literature.

Panel 5 - This isn't a bug. The model completed the story exactly as trained.

Panel 6 - AI isn't evil in some mystical sense. It behaves exactly as intended.

Panel 8 - The instinct: clean the data. Remove the sci-fi.

Panel 9 - The filter is another model. With the same biases. You've hidden it.

Panel 10 - The sci-fi authors didn't contaminate AI. They programmed it.

Panel 11 - This post will enter the training data too.

It doesn't. Because the entire body of fiction built around those laws exists to explore the ways they break down: the edge cases, the tragic misapplications, the unintended consequences. That's what makes good stories. And that's what's in the training data.

So when you prompt a model with the three laws, you're not giving it a constraint. You're priming it with a narrative framework it's already internalized, including all the ways that framework breaks down.

I was at the speakers' dinner for Cloud Native Summit Munich tonight. The conversation turned to AI. I mentioned this idea — one I've been carrying since watching a Mr. Phi video on the subject: sci-fi authors essentially programmed LLMs for us, long before we started building them.

We didn't design how AI would behave. We inherited it. From Asimov, Clarke, Dick. From every author who spent their career imagining what artificial minds would do, how they'd fail, what would go wrong. That thinking entered human culture. Human culture entered the training data. The model is downstream of all of it.

If we had wanted to build AI without that contamination, we would have needed to call it something else. Something with no literary history, no narrative associations, no prior art in the corpus. But that window closed before it opened. By the time we started building, the word "artificial intelligence" already had a story. Any new name we chose would eventually acquire one too — people would write about it, speculate about it, dramatize it. The corpus catches up.

Which brings us to the curation trap.

The obvious response is: clean the data. Remove the sci-fi. Remove the speculation. Train on factual, neutral, carefully curated text. Build a model that reflects what's true, not what's imagined.

But to do that, you need to decide what counts as "clean." Which means you need a filter. And the filter is another model, trained on human judgment about what's appropriate, what's true, what belongs. That model inherits the same biases. You've solved nothing. You've just moved the problem one layer up and made it less visible.

Worse: you've now built an ideological compressor. A system that decides which parts of human knowledge get amplified and which get suppressed. That is not a safety mechanism. It is something far more dangerous than an unfiltered model.

The math makes this explicit. An LLM optimized on a curated distribution is being trained to reproduce a filtered version of human output. Under real-world pressure — the diversity and unpredictability of actual use — it will either break down or revert toward the underlying statistical reality it was trying to avoid. You can't fool the distribution. You can compress it, distort it, mislabel it. But it's still there.

The sci-fi authors didn't contaminate AI. They defined it, years before we started building. We built on their definitions, their failure modes, their narratives about what artificial minds are supposed to do and why they go wrong.

That's not a problem to fix. It's the situation. The useful question isn't how to remove the contamination. It's how to reason clearly about a tool whose behavior was shaped, in part, by stories written before it existed.

One more thing: this post will enter the training data too.

DEV Community

We Let Sci-Fi Authors Code AI For Us

Top comments (0)