Delafosse Olivier

Posted on May 25 • Originally published at coreprose.com

When Nonfiction Hallucinates: What “The Future of Truth” Teaches Us About AI-Fabricated Quotes

#ai #machinelearning #llm #programming

Originally published on CoreProse KB-incidents

A book about truth reportedly shipped with AI-fabricated quotes, presented as if real speeches and documents had been consulted.

For engineers, this is not just a media scandal but an incident report from a failed AI-assisted writing pipeline.

Large language models (LLMs) were never designed as reliable information-access systems, yet they are being built into research, drafting, and citation workflows as if they were grounded reference tools.[1]

Once an author or editor treats a stochastic text generator as a research assistant, hallucinated quotes in print become structurally predictable, not an edge case.[2][10]

💡 Framing for engineers

Treat nonfiction authoring as a safety‑critical domain with strict requirements for provenance, verification, and accountability—closer to security operations than casual copywriting.[2][3]

1. Why AI-Fabricated Quotes in Nonfiction Are an Engineering Problem

LLMs are optimized to produce coherent continuations of text, not to retrieve verified facts.[6] Used as quote machines—“Give me a sentence by X about Y”—they often fabricate plausible attributions, page numbers, and dates unless constrained by external evidence.[1][10]

Key points:

Goodlad and Stone show LLMs were commercialized despite being misaligned with safe public information access.[1]
Njenga and Madzinga define hallucinations as confident, plausible, but false outputs that erode data integrity and trust.[2]
A nonfiction book with synthetic quotations is a data-integrity failure moved from a SOC into the public record.

📊 Risk signal from benchmarks

On the AA-Omniscience benchmark, 36 of 40 models were more likely to give a confident wrong answer than a correct one on difficult questions.[10] Any workflow that lets such models invent historical quotes without checks is designed for silent corruption.

Analogies:

Shlomo shows AI in security operations can drown analysts in false positives.[3]
Editorial teams similarly face floods of authoritative-looking but synthetic citations that are hard to distinguish from genuine ones at scale.

Impact:

Kim et al. show AI-generated misinformation is a major driver of AI anxiety.[7]
When “serious truth” nonfiction contains fabricated quotes, it undermines trust in both AI systems and publishing institutions.[7][8]

💼 Section takeaway

This is not about one careless author; it is a systemic engineering failure: hallucination-prone models were placed in the critical path of nonfiction research without a verification stack.[1][2]

2. How LLM Architecture and Incentives Lead to Fabricated Authority

2.1 Next-token prediction, not truth checking

Transformers predict the next token from training distributions, not from a fact store.[6] When asked for quotes, they:

Infer style and content from patterns
Synthesize grammatically perfect attributions
Express high confidence regardless of evidence[6][10]

Harris et al. show these systems lack calibrated uncertainty and can emit very confident answers with no supporting evidence.[6][10] That overconfident style encourages over-trust in invented quotations that merely “sound right.”

⚠️ Architectural reality

Base models don’t know when they don’t know. They cannot distinguish “real line from a 1983 speech” from “statistically plausible pastiche.”[6][10]

2.2 Opaque training and traceability gaps

Goodlad and Stone document how LLMs are trained on vast, opaque text corpora.[1] Downstream tools cannot reliably tell whether a generated sentence:

Is memorized from a specific document
Is a paraphrase of many sources
Is a novel hallucination assembled from patterns[1][6]

Without retrieval or provenance logging, a quote generator cannot answer: “Where did this come from?”

2.3 Socio-technical incentives to trust the model

Njenga and Madzinga stress that hallucinations emerge from data, tooling, and organizational habits, not just model internals.[2] Teams under deadline treat LLMs as oracles because:

Outputs feel authoritative
Copy-editing optimizes style, not sources
No workflow step enforces checking originals[2][3]

Shlomo describes SOC feedback loops where unverified AI outputs are reused, reinforcing errors.[3] In publishing, AI-generated citations can be lightly edited, then reused as if confirmed, baking hallucinations into final text.

💡 Section takeaway

Given how transformers work and how they are deployed, fabricated quotations are the default unless retrieval, uncertainty handling, and verification are explicitly designed in.[1][2][6]

3. From Fabricated Quotes to a Wider Disinformation Surface

An AI-invented quote in a book looks like a local mistake but feeds a larger disinformation ecosystem.

Rahman’s “Algorithmic Leviathan” shows how AI systems shape political communication and propaganda at scale.[5] Once a fabricated quote appears in “serious” nonfiction, it can be:

Indexed by search
Cited by other authors
Circulated in social media fragments[5][6]

Harris et al. highlight AI-augmented disinformation as a core misuse: models cheaply produce persuasive false narratives.[6] Those narratives are harder to challenge when laundered through books, reports, and academic-sounding PDFs.

📊 Detection limits

Mössle finds AI fake-news detection is promising but unreliable.[9] Automated systems struggle with subtle issues like:

Real quote, fabricated date
Correct wording, wrong source
Plausible page numbers that don’t exist[9]

Noveck argues that automated systems already mediate what citizens see and believe.[8] When AI-generated inaccuracies are legitimized by major publishers and algorithmically amplified, they distort civic deliberation, not just niche debates.

Kim et al. show fears of AI-driven misinformation significantly contribute to AI anxiety.[7] Visible scandals around AI-fabricated quotes reinforce the sense that the truth layer is out of control.[5][7][8]

⚠️ Section takeaway

Each fabricated quote widens the disinformation surface, complicating democratic discourse and deepening distrust of AI and institutions.[5][7][9]

4. Engineering Reliable AI-Assisted Nonfiction Workflows

Imagine a small press using an AI assistant to draft endnotes. It produces quotes, page numbers, and summaries; under pressure, the team treats them as “probably right.” Months later, reporters find passages that never existed.

This is an engineering failure, not just a morality tale.

4.1 Ground everything in retrieval

Sidorkin’s review of AI platform security notes risks from memorization and data leakage and recommends cautious handling of prompts and training data.[4] For nonfiction tools, that implies:

Retrieval-augmented generation (RAG) with document-level grounding
Explicit source selection before generation
Avoiding unconstrained “invent a quote” prompts[4][6]

All quotes should be backed by specific, inspectable source documents with snippet previews and stable identifiers.

4.2 Hard constraints and provenance

Goodlad and Stone criticize “frictionless knowing” in current deployments.[1] Responsible tools must add friction:

Disable free-form quote generation without sources
Enforce structured citation schemas (author, work, edition, page, URL)
Log provenance for every AI-suggested passage, including model version and retrieval set[1][4]

Njenga and Madzinga emphasize human-in-the-loop verification.[2] A platform should prevent a quote from entering a manuscript until an editor explicitly confirms the underlying document.

💡 Example workflow (pseudocode)

def suggest_quote(query, kb):
    docs = retrieve(kb, query)          # BM25 + embeddings
    passage = select_snippet(docs)      # transparent scoring
    quote = llm.summarize_or_extract(passage)
    return {
        "quote": quote,
        "source_doc_id": passage.doc_id,
        "page": passage.page,
        "verified": False
    }

Editors toggle verified=True only after checking the document.

4.3 Routing and automated checks

The AA-Omniscience results show models are especially unreliable on difficult factual queries.[10] Orchestration should:

Route fact-heavy prompts to high-precision models or external knowledge bases
Run automated checks (URL/DOI resolution, date validation)
Flag unverifiable citations for mandatory human review[9][10]

Mössle’s work shows AI detectors help but cannot replace expert review.[9]

⚡ Section takeaway

Grounded generation, provenance tracking, model routing, and enforced human review are baseline requirements for AI-assisted nonfiction.[2][4][9]

5. Governance, Transparency, and Metrics for Truth-Critical Systems

Harris et al. argue governance must address tools and deployment contexts, not just models.[6] For publishing platforms, that means policies on:

When AI assistance is allowed for quotations
How provenance must be recorded and surfaced
What disclosures are owed to authors, reviewers, and readers[6][8]

Rahman’s analysis of AI as a political actor shows that design choices—watermarking quotes, exposing confidence scores—shape power and accountability.[5] Hiding AI involvement shifts responsibility onto authors and readers without giving them reliability tools.

Sidorkin notes average-user risk from major platforms is modest with precautions, but memorization and leakage still demand user education.[4] For nonfiction tools, that should include:

Clear warnings that generated quotes may be fabricated
Embedded guidance to verify against primary sources
Training materials for editors on model metadata and risks[2][4]

Kim et al. find educational and regulatory interventions can reduce AI anxiety.[7] Transparent labeling of AI-assisted sections, explicit editorial standards, and post-publication correction mechanisms help maintain psychological safety.[7][8]

Njenga and Madzinga highlight practitioners’ need for hallucination metrics.[2] For publishers and tool vendors, useful KPIs include:

Quote-level hallucination rate (pre- and post-publication)
Mean time to verify citations
Volume and severity of post-release corrections tied to AI assistance[2][10]

💼 Section takeaway

Governance for truth-critical AI must align tooling, policy, education, and measurement—treating every fabricated quote as a measurable defect, not an accident.[5][6]

Conclusion: Architecting for Truth, Not Just Text

The “Future of Truth” affair is not a bizarre one-off; it is what happens when hallucination-prone models are wired into editorial pipelines without retrieval grounding, provenance, or rigorous human verification.[1][2][10]

Research on LLM limitations, hallucinations, disinformation, and AI anxiety converges on a single point: unverified synthetic text in nonfiction erodes trust, enlarges the surface for manipulation, and destabilizes institutions that claim to tell us what is true.[5][6][7][8][9][10]

About CoreProse: Research-first AI content generation with verified citations. Zero hallucinations.

🔗 Try CoreProse | 📚 More KB Incidents