Most people do not think about file formats. We focus on the words, the contracts, the medical records, and the transactions. The format is invisible, like plumbing or electricity, until it fails. And when it fails in healthcare, legal, or financial systems, the consequences are severe.
Right now, we are doing something highly illogical: we take structured database rows, serialize them into static visual coordinates (a PDF), and then pay neural networks and large language models billions of dollars to scrape those visual coordinates back into database rows.
We are literally paying artificial intelligence to undo the failures of our own file formats [1].
There is a better way.
It is free, open source, and already working.
Its name is the Universal Document Standard (UDS) 1.0.
Instead of treating a document as a picture of text, UDS treats it as a self-describing, cryptographically signed, block-structured database that renders itself natively in any web browser. A picture shows the surface, but a document should show the structure.
THE DESIGN: DATA BLOCKS AND SECURITY
To understand why UDS works, you have to look at how it handles data. A PDF describes where characters sit on a page [1, 2]. A UDS file describes what those characters mean by dividing the document into a tree of distinct semantic nodes called Blocks.
This diagram shows the core architecture of UDS. Instead of a single flat image file, the document is made up of modular database blocks stacked together. Each block is cryptographically linked to the one before it. If someone edits a single word in one block, the security chain breaks instantly, making unauthorized alterations impossible.
Each block contains three things:
- The Cryptographic Header: A unique block ID, a local timestamp, the author## THE DESIGN: DATA BLOCKS AND SECURITY
To understand why UDS works, you have to look at how it handles data. A PDF describes where characters sit on a page [1, 2]. A UDS file describes what those characters mean by dividing the document into a tree of distinct semantic nodes called Blocks.
This diagram shows the core architecture of UDS. Instead of a single flat image file, the document is made up of modular database blocks stacked together. Each block is cryptographically linked to the one before it. If someone edits a single word in one block, the security chain breaks instantly, making unauthorized alterations impossible.
Each block contains three things:
- The Cryptographic Header: A unique block ID, a local timestamp, the author's public signing key, and a hash link to the parent block.
- The Semantic Metadata: Key-value tags mapped directly to standard public ontologies (like SNOMED-CT for medicine, JEL codes for economics, or legal taxonomies) from a public schema registry.
- The Payload Content: The raw data, whether it is text, table cells, or base64 media files.
Because these blocks are linked sequentially via their hashes, they form a Directed Acyclic Graph (DAG). If you alter a single character, metadata tag, or permission boundary in any block, the cryptographic chain breaks. The document instantly rejects itself as invalid. By embedding structured semantics directly into the file schema, we eliminate the verification costs and friction that plague traditional digital documents [5].
FIVE PRACTICAL CAPABILITIES
Here are five capabilities that have been nearly impossible with PDFs, and how UDS makes them happen.
Recall a document after sending it. The Problem: You email a contract or a patient note, find an error, and must chase down every recipient, hoping they delete the old copy. The UDS Solution: A sealed UDS document carries a live reference to its issuer's state registry [6]. When you revoke or amend a document, the local viewer checks this signature. The file remains readable (UDS is offline-first and respects local storage) but it dynamically flags its own revocation state across all active copies. The files report their own history.
Cryptographic expiration. The Problem: An expired license, prescription, or waiver is presented, and you must rely on a human check of the date text to reject it. The UDS Solution: Expiry is not text on a screen; it is encoded in the metadata block header. After the validity period passes, the cryptographic verification fails to authenticate. The document itself refuses to present as valid.
Proof of integrity. The Problem: In a court of law, verifying if a PDF has been back-dated or altered requires forensic expert testimony and database audits. The UDS Solution: UDS uses a local trust mesh. The author signs the root hash of the block DAG using an Ed25519 signature. Because this signature is embedded inside the file container, any browser can recalculate the hash chain locally. Either the math matches the signature, or the document is rejected. There is no middle ground.
Multi-language support in a single file. The Problem: Translating a document means generating five separate files, leading to translation drift and version fragmentation. The UDS Solution: Because UDS is block-structured, a single document carries parallel language streams inside the same file. Utilizing OMNI (Open Medical Nomenclature Integration) binding protocols, multiple translations coexist inside one document [7]. When a Spanish-speaking doctor opens a UDS file, the browser renders the Spanish blocks. When an English-speaking billing specialist opens the exact same file, they see the English blocks. One file, one truth, multiple voices.
Zero malware risk. The Problem: PDF is a persistent virus delivery vector because it allows arbitrary executable scripts and external network calls [3]. The UDS Solution: UDS uses a strict, zero-telemetry schema. It forbids executable code, active script tags, and unsealed external resource calls. A UDS file is a static data object rendered by a trusted engine. It cannot carry malware.
INSIDE THE UDS JSON PAYLOAD
For developers, the beauty of UDS is that it requires no complex PDF rendering libraries or network parsers. It is standard JSON wrapped in a self-rendering HTML page (.uds.html).
Here is what the raw UDS database payload looks like:
{
"uds_version": "1.0",
"document_title": "Invoice 4812",
"root_hash": "a1f9e2b...",
"signature": "sig_ed25519_...",
"public_key": "pub_key_...",
"blocks": [
{
"block_id": "block_1",
"timestamp": "2026-06-21T19:35:43Z",
"parent_hash": "0x00000...",
"metadata": {
"type": "invoice_total",
"value_usd": 1500.00
},
"content": "Invoice total: $1500.00"
}
]
}
When a developer's app reads this file, it parses the JSON payload directly in milliseconds. When any human opens the file, the embedded script automatically generates a clean, responsive web view.
BUILD WITH UDS: THE OPEN-SOURCE CLI TOOL
To make integration into developer pipelines as fast as possible, we have released the open-source uds-cli command line tool. Developers can use it right now to convert, sign, merge, and validate files locally:
# Install the UDS Command Line Interface
npm install -g uds-cli
# Seal a draft into a sealed .uds.html file
uds seal agreement.udr --output lease.uds.html
# Cryptographically sign a document locally using a private key
uds sign lease.uds.html --keyfile private.key
# Verify a document's hash chain and signature offline
uds verify lease.uds.html
The CLI tool is fully open-source. You can view the code, contribute to the schema register, and download the SDK repository directly on GitHub at https://github.com/universal-document/uds-cli.
GET STARTED NOW: CONVERT YOUR DOCUMENTS TO UDS
If you want to try UDS today, you do not need to install anything. We have built a free web portal where anyone can instantly convert their existing PDF, Word, or text files into secure, self-rendering UDS documents.
Simply visit: https://utilities.hive.baby/
At this portal, you can drag and drop your files to convert them, split or merge UDS documents, run client-side OCR, and redact sensitive information securely, entirely in your web browser.
You can also use:
- UD Creator (https://creator.hive.baby/) to author, structure, and export custom schemas visually.
- UD Validator (https://validator.hive.baby/) to drag-and-drop any UDS file to verify its cryptographic hash chain offline with absolute privacy.
The digital coordinate era of the PDF is reaching its end. By shifting to block-semantic trust architectures, we protect legal integrity and build a secure baseline for digital sovereignty. The transition is a structural inevitability.
Use it. Build on it. It belongs to you.
About the author: Sonny Saggar MD is a physician and AI evaluator with a background in internal medicine and a graduate degree from Oxford. He works at the intersection of clinical systems, AI governance, and document infrastructure.
The author has pleaded guilty to a federal offense and does not seek to relitigate his conviction or minimize his responsibility. Every argument in this article concerns the legal standard itself, not the facts of any individuasoftware
l case. It is written with respect for the courts, for prosecutors, and for the rule of law. Authored during pre-release BOP custody consistent with 28 CFR Part 540, Subpart B, BOP PS 5265.14, and the First Amendment rights affirmed in Procunier v. Martinez.
References & Citations
[1] Josh Dzieza, "Why is AI so bad at reading PDFs?", The Verge, February 2026.
[2] "The war against PDFs is heating up," The Economist, February 2026.
[3] PDF as enterprise security threat vector: TechRadar Enterprise and Security coverage, 2025-2026.
[4] Adobe Acrobat Studio launch: Wired and technology press, August 2025.
[5] Coase, R. H. (1937), "The Nature of the Firm," Economica; Williamson, O. E. (1985), The Economic Institutions of Capitalism.
[6] Swedish Civil Contingencies Agency (MSB) & Estonian Information System Authority (RIA), National Trust Frameworks for Electronic Identification, 2024.
[7] Saggar, S. S. (2026), "Open Medical Nomenclature Integration (OMNI) Binding Protocol Specification (v1.0.4)," Universal Document Foundation.
Top comments (0)