telegraph-stego

Posted on Apr 8

Patching the Dead: Why Glasswing Solves Yesterday's Problem with Tomorrow's Tools

#ai #security #future #strategy

Patching the Dead: Why Glasswing Solves Yesterday's Problem with Tomorrow's Tools

On April 8, 2026, Anthropic announced Project Glasswing — a consortium of AWS, Apple, Google, Microsoft, NVIDIA, Cisco, CrowdStrike, Palo Alto Networks, JPMorganChase, Broadcom, and the Linux Foundation. The goal: use an unreleased frontier model called Claude Mythos Preview to find and fix vulnerabilities in critical software. Anthropic committed $100M in usage credits and $4M in direct donations to open-source security organizations.

Mythos Preview's benchmarks are not incremental. SWE-bench Verified: 93.9% (Opus 4.6: 80.8%). SWE-bench Pro: 77.8% (Opus 4.6: 53.4%). CyberGym vulnerability reproduction: 83.1% (Opus 4.6: 66.6%). The model autonomously found a 27-year-old remote crash vulnerability in OpenBSD, a 16-year-old bug in FFmpeg that survived five million automated test runs, and chained multiple Linux kernel vulnerabilities into a privilege escalation — all without human steering.

This is genuinely impressive engineering. It is also a strategic dead end.

The Patching Treadmill

Glasswing's logic runs as follows: Mythos finds thousands of zero-day vulnerabilities in legacy code. Partners patch them. The world's critical infrastructure becomes safer.

The problem is step four, which Anthropic acknowledges but does not resolve: other labs will reach Mythos-level capability within months. OpenAI's GPT-5.4 already scores 57.7 on SWE-bench Pro. Zhipu's GLM-5.1 scores 58.4 on the same benchmark — trained entirely on Huawei Ascend chips, zero NVIDIA dependency, released under MIT license. xAI's Grok uses parallel multi-agent architecture. DeepSeek, Alibaba, and Moonshot AI are all on similar trajectories.

Anthropic states this directly: "it will not be long before such capabilities proliferate, potentially beyond actors who are committed to deploying them safely." The window of exclusive defensive advantage is measured in months.

So the cycle becomes:

Mythos finds thousands of zero-days in legacy code
Partners patch them
A next-generation model (from any lab) finds thousands of new ones in the same codebases
Attackers gain access to equivalent models
Go to 1

This is Sisyphus with a compute budget. Every patch is a band-aid on a sieve. The fundamental problem is not that legacy code has bugs. The fundamental problem is that legacy code will always have bugs because it was written under constraints that no longer apply: limited human attention, limited testing budgets, limited ability to reason about emergent interactions across millions of lines of code.

A model that can autonomously find and exploit vulnerability chains in the Linux kernel — a codebase that has been reviewed by thousands of the world's best engineers over 35 years — is not revealing a failure of human diligence. It is revealing the inherent limits of human-written software at scale. Patching does not change those limits. It plays whack-a-mole within them.

The Real Question Glasswing Avoids

If a model can autonomously find, chain, and exploit vulnerabilities that survived decades of human review and millions of automated tests, what else can it do?

It can write code that doesn't have those vulnerabilities in the first place.

This is not speculation. Mythos Preview scores 93.9% on SWE-bench Verified — meaning it can resolve real GitHub issues in real codebases with near-human-expert accuracy. It can run autonomously for eight hours on a single task. It can reason about code at a depth that exceeds all but the most skilled humans.

The question Glasswing does not ask is: why are we using this capability to patch 35-year-old C code instead of generating verified replacements?

What "Write New Code" Actually Means

The phrase "just rewrite it" is a red flag in software engineering for good reason. Joel Spolsky called it "the single worst strategic mistake that any software company can make." That was in 2000. The argument was: rewrites lose accumulated knowledge, take longer than expected, and the new code will have its own bugs.

Every part of that argument assumed human developers. Every part of it collapses with Mythos-class models.

Accumulated knowledge: A model with 256K context and the ability to read entire codebases does not lose institutional knowledge during a rewrite. It reads the original, understands the intent, and re-implements with different constraints.

Timeline: A model running 24/7 at 93.9% SWE-bench accuracy, capable of eight-hour autonomous sessions, rewrites at a pace that has no human analog. The time required to formally verify a microkernel dropped from "a decade with a team of 30 researchers" (seL4) to a feasible AI-assisted project.

New bugs: This is the critical point. A Mythos-class model can not only write code — it can simultaneously verify that code against formal specifications. Not "test and hope" but "prove and guarantee." Entire classes of vulnerabilities — buffer overflows, use-after-free, race conditions, integer overflows — can be eliminated by construction, not by post-hoc scanning.

The combination of generation and verification in the same model changes the economics of rewriting from "always worse than patching" to "categorically better than patching."

The Stack That Needs to Be Generated

This is not about rewriting one application. It is about generating verified infrastructure at every level where vulnerabilities exist. Here is the stack, bottom to top:

Level 1: Hardware Description (RTL/Verilog/VHDL)

Spectre and Meltdown demonstrated that hardware-level vulnerabilities are more catastrophic than any software bug. Current chip designs are described in hardware description languages — which are code. A Mythos-class model can:

Verify RTL descriptions against formal security specifications
Prove absence of side-channel leakage at the microarchitectural level
Detect hardware backdoors in third-party IP blocks

This is not theoretical. NVIDIA already uses AI for chip design. DARPA's TRACTOR program funds AI-generated formally verified code. The extension from software verification to hardware verification is architectural, not fundamental.

For any nation building a sovereign semiconductor capability — and several are — AI-verified chip design is not a feature. It is the only way to trust silicon you did not fabricate yourself.

Level 2: Firmware and Microcode

UEFI firmware, BMC controllers, storage controller firmware, baseband processors — these are the lowest software layers, running with the highest privileges, audited by the fewest people. The attack surface is enormous. The review capacity is minimal.

A firmware image is typically 16–64 MB of code. A Mythos-class model can read, verify, and rewrite firmware in a single context window. The entire UEFI specification can fit in 256K tokens. There is no technical barrier to AI-generated, formally verified firmware that provably contains zero memory-safety vulnerabilities.

Level 3: Operating System Kernel

The Linux kernel is approximately 30 million lines of C. It runs on billions of devices. It contains — as Mythos demonstrated — exploitable vulnerability chains that survived 35 years of expert review.

No one needs to replace Linux in its entirety. But the attack-surface-critical components — the network stack, the filesystem layer, the memory management subsystem, the syscall interface — total perhaps 2–3 million lines. These could be rewritten in a memory-safe language (Rust, or a formally verified subset of C), with machine-checked proofs that common vulnerability classes are absent.

This is already happening manually. The Linux kernel accepts Rust code since version 6.1. sudo and curl have been rewritten in Rust. But manual rewriting is slow. A Mythos-class model turns "rewrite critical subsystems in a memory-safe language with formal verification" from a decade-long project into a months-long one.

Level 4: Cryptographic Libraries and Protocol Implementations

OpenSSL. GnuTLS. libsodium. The SSH protocol stack. The TLS handshake. These are small codebases (tens of thousands of lines) with enormous blast radii. Heartbleed was a single buffer over-read in OpenSSL that exposed the private keys of an estimated 17% of the internet's servers.

Formally verified cryptographic implementations already exist: HACL* (used in Firefox and Linux), EverCrypt, Fiat-Crypto. They were written by specialized research teams over years. A Mythos-class model can generate equivalent verified implementations in days, for any protocol, for any target platform.

Level 5: Compilers and Toolchains

Ken Thompson's 1984 "Reflections on Trusting Trust" posed a 40-year-old unsolved problem: if the compiler is compromised, the source code's correctness is irrelevant. The compiler can insert backdoors that are invisible in the source.

CompCert proved that a formally verified C compiler is possible. It took a research team years. The principle scales: a Mythos-class model can generate verified compilers for any language, proving that the compilation process introduces no semantic changes beyond those specified.

This closes the trust chain from source to binary. Combined with verified hardware (Level 1), it means: the code you wrote is the code that runs, on hardware that does what it claims.

Level 6: Application-Specific Generated Systems

This is where the paradigm fully inverts. Instead of one operating system for a billion machines, each deployment generates its own.

But "generates its own" requires a precise input. Not a natural-language wish. Not a 200-page requirements document. A structured specification that separates what is invariant from what is implementation — and does so in a way that both humans and AI agents can verify against.

In Part 6 of this series, I described a methodology called DNA/RNA. DNA is a 2–5 page document containing only decisions that survive a complete rewrite: ontology (what entities exist), deontics (what's permitted and forbidden), axiology (what's valuable), praxeology (how to act). No technology names. No frameworks. Pure invariants. RNA translates those invariants into machine-checkable enforcement for a specific stack and agent.

DNA is the specification layer that makes verified generation possible. Without it, "generate me an OS" is vibe coding at the infrastructure level. With it, the pipeline becomes concrete:

A bank's DNA specifies: "Every transaction above $1M requires dual authorization. Settlement finality is irreversible. Audit trail is append-only and immutable. PCI DSS 4.0 compliance is a constraint, not a goal." A Mythos-class model generates the minimal verified OS, network stack, and application layer — unique to that bank, with formal proof of compliance against the DNA.
A hospital's DNA specifies: "Patient identity is the root entity. Access is role-based with no exceptions. Records are never deleted, only superseded. HIPAA is a floor, not a ceiling." A different OS, a different stack, formally verified against a different DNA.
A military system's DNA specifies classification boundaries, information flow constraints, physical security invariants. A third OS, generated, verified, unique.

Every generated system is different. An attacker who finds a vulnerability in one has learned nothing about the others. The entire concept of "find one bug, exploit a billion machines" ceases to exist.

The DNA/RNA separation also solves a problem that pure formal verification cannot: it tells you whether the right system was built, not just whether the system works correctly. Formal verification proves that code matches spec. DNA Audit — the third quality loop described in Part 6 — checks whether the spec matches the domain. Without this layer, you can formally verify an OS that faithfully implements the wrong security model.

Why This Is Not Being Built

Glasswing exists instead of the above for three reasons, and none of them are technical.

Liability. Patching someone else's code carries zero liability for Anthropic. If a patch is incomplete, the maintainer is responsible. Generating a verified OS and guaranteeing its security properties creates legal exposure that no company currently wants. The first vendor to offer formal security guarantees backed by financial liability will own the market — but they will also accept a category of risk that has no precedent in software.

Institutional inertia. The twelve Glasswing partners run their businesses on Linux, Windows, and established infrastructure. None of them will advocate for replacing that infrastructure, because their organizations are optimized for it. Microsoft will not fund a project that makes Windows unnecessary. Google will not fund a project that makes Android unnecessary. The consortium structure guarantees that only incremental improvements are on the table.

Regulatory positioning. Anthropic's page mentions "ongoing discussions with US government officials." Glasswing is a demonstration of responsible behavior: "We found a dangerous capability. We did not release it. We used it to help defend critical infrastructure." This buys goodwill with regulators. Releasing a "generate your own OS" product would raise immediate questions about misuse that Anthropic is not ready to answer.

All three reasons are rational for Anthropic in April 2026. None of them will survive contact with the next 24 months of AI progress.

Who Will Build This

The entity that builds AI-generated verified infrastructure will not be Anthropic, OpenAI, Google, or Microsoft. It will be someone without legacy to protect.

Candidates:

A startup that takes an open-weight model (GLM-5.1, Qwen 3.5, DeepSeek, or a future equivalent), fine-tunes it on formal verification tasks, and offers infrastructure-as-generated-code. No installed base. No backward compatibility obligations. The product is not an OS — it is a specification-to-verified-deployment pipeline. The business model is not licensing — it is insurance: "We guarantee the absence of these vulnerability classes in generated code, backed by financial liability."

A sovereign state program. China is already building a full stack from silicon (SMIC fabs, Huawei Ascend chips) to frontier models (GLM-5.1 trained on zero NVIDIA hardware). The missing piece is formal verification of that stack. Russia, India, and the EU all have strategic motivation for verified sovereign infrastructure. The cost of building it with AI has dropped from "national Apollo program" to "well-funded government lab."

A defense contractor. DARPA's TRACTOR program already funds AI-generated formally verified code. The US Department of Defense has both the motivation (nation-state adversaries with AI capabilities) and the budget. Military systems have always accepted higher costs for higher assurance. AI-generated verified systems lower those costs while raising assurance — the economics are irresistible.

The Timeline

The pieces exist separately today:

Models that score 93.9% on SWE-bench Verified (Mythos Preview)
Models that can run autonomously for 8 hours (GLM-5.1, Mythos)
Formally verified microkernels (seL4)
Formally verified cryptographic libraries (HACL*, EverCrypt)
Formally verified compilers (CompCert)
Memory-safe systems languages (Rust)
AI-assisted chip design (NVIDIA, Synopsys)

No one has integrated them into a single pipeline: DNA (domain invariants) → RNA (agent enforcement) → generated code → formal verification → verified compilation → verified hardware target → deployment with proof artifacts.

The first organization to close this loop will not be building a product. It will be building the factory that builds all products. Every OS, every firmware image, every network stack, every cryptographic implementation becomes an output of that factory — unique, verified, disposable. The input is a DNA document. The output is a running system with a proof certificate.

Glasswing is Anthropic using a next-generation engine to repair a horse-drawn carriage. The engine is real. The carriage is what needs to go.

Questions I Expect

"You're saying we should stop patching vulnerabilities?"

No. Patch everything you can, as fast as you can. Glasswing's defensive work has real value in the next 12–18 months. But do not mistake the tourniquet for the cure. Patching buys time. Only generation eliminates the problem.

"Formally verified software is too slow and too expensive for real-world use."

It was. seL4 took a decade. CompCert took years. That was with human researchers writing proofs by hand. A model that scores 93.9% on SWE-bench Verified and runs autonomously for eight hours changes the cost structure by orders of magnitude. The argument "verification is too expensive" assumes human labor costs. Those costs are approaching zero.

"No one will trust an AI-generated operating system."

No one will trust it without proof artifacts. That is the entire point. A human-written OS asks you to trust the developers. An AI-generated OS with formal verification asks you to trust the math. The proof is machine-checkable. You do not need to trust the AI — you check its work with an independent verifier. The trust chain is: DNA (human-readable invariants) → generated code → machine-checked proof → independent proof checker. Every link is auditable. The DNA is 2–5 pages a domain expert can read. The proof checker is a small, well-understood program. Everything in between is disposable.

"The Linux ecosystem is too large to replace. Drivers, applications, compatibility."

Correct. Nobody replaces Linux on a billion devices overnight. The path is narrower: replace the attack-surface-critical subsystems first (network stack, memory management, syscall interface), then expand. New deployments — IoT, embedded, military, financial — can start on generated systems today. They have no legacy to protect.

"What about supply chain attacks? The AI itself could be compromised."

This is the right question, and it has a concrete answer. The generated code is verified against a formal specification — the DNA document — by an independent proof checker. Even if the AI is compromised, a backdoor must survive formal verification — which means it must be consistent with the DNA. If the DNA says "no data exfiltration" and the deontics layer says "all network calls are enumerated and auditable," a proof checker will reject any code that violates these invariants, regardless of what the AI intended. The attack surface shifts from "all code" to "the DNA and the proof checker" — a dramatically smaller and more auditable surface. You can read the DNA. You can audit the proof checker. You cannot read thirty million lines of C.

"Isn't this just the 'rewrite from scratch' fallacy that has failed every time in software history?"

Every previous rewrite failed because humans wrote the new version. Humans are slow, expensive, lose context, introduce new bugs, and cannot hold million-line codebases in their heads. A model with 256K context, 93.9% SWE-bench accuracy, and the ability to simultaneously generate and verify code is not a faster human. It is a different category of author. The historical argument against rewrites is empirically valid for human teams. It has not been tested against Mythos-class models because Mythos-class models did not exist until this month.

"Who pays for all of this?"

The global cost of cybercrime is estimated at roughly $500 billion per year. That is the budget. Any organization that can reduce that cost by even a few percent through verified infrastructure captures an enormous market. The business model is not "sell an OS" — it is "sell the absence of vulnerability classes, backed by financial guarantees." Insurance, not licensing.

"You are describing something that does not exist yet."

Every component exists. Formally verified microkernels exist (seL4). Formally verified cryptographic libraries exist (HACL*, EverCrypt). Formally verified compilers exist (CompCert). AI models that can generate and reason about code at expert level exist (Mythos, GLM-5.1, Qwen 3.5). AI-assisted chip design exists (NVIDIA, Synopsys). What does not exist is the integration of all these into a single pipeline. That integration is an engineering project, not a research problem. The first organization to complete it will define the next era of infrastructure.

This is Part 7 of "AI as Civilizational Phase Transition." Parts 1–3 analyzed the economic collapse of intermediary business models, the emergence of post-scarcity dynamics, and strategy at every level of management. Part 4 examined AI safety as an oxymoron. Part 5 examined Musk as empirical proof that the bifurcation of intelligence has already occurred. Part 6 introduced the DNA/RNA methodology — separating domain invariants from implementation so that AI agents generate the right system, not just a working one.

Part 7 closes the loop that Part 6 opened. Part 6 asked: what survives when code becomes disposable? Answer: decisions. Part 7 asks: what do you build when decisions are the only durable input? Answer: a pipeline that takes DNA and outputs verified, running infrastructure — from silicon to application. No legacy. No patching. No trust in developers. Trust in math.

DEV Community

Patching the Dead: Why Glasswing Solves Yesterday's Problem with Tomorrow's Tools

Patching the Dead: Why Glasswing Solves Yesterday's Problem with Tomorrow's Tools

The Patching Treadmill

The Real Question Glasswing Avoids

What "Write New Code" Actually Means

The Stack That Needs to Be Generated

Level 1: Hardware Description (RTL/Verilog/VHDL)

Level 2: Firmware and Microcode

Level 3: Operating System Kernel

Level 4: Cryptographic Libraries and Protocol Implementations

Level 5: Compilers and Toolchains

Level 6: Application-Specific Generated Systems

Why This Is Not Being Built

Who Will Build This

The Timeline

Questions I Expect

Top comments (0)