The Passenger

#ai #science #technology #finance

A bacterium present in seventy percent of human guts is twice as likely to be in colorectal cancer patients when it carries a specific virus inside its genome. The species name predicts the host. The passenger predicts the disease. Three other classification systems just discovered the same thing about themselves.

The bacterium in the cancer patient's stool and the bacterium in the healthy adult's stool share a name, a genome, and a niche. *Bacteroides fragilis lives in the human gut of most adults regardless of disease status. Researchers at the University of Southern Denmark and Odense University Hospital published findings in Communications Medicine earlier this year showing what distinguishes the cancer-associated population. It is not the carrier. It is what the carrier carries. Two previously unidentified Caudoviricetes prophages embedded in the bacterial genome are present at roughly twice the frequency in colorectal cancer patients. A stool viral marker panel detected about forty percent of cancer cases at over eighty percent specificity across 877 individuals.*

The species name predicts the host. The prophage predicts the outcome. The classification system designed to track the bacterium operates at a granularity coarser than the granularity at which causation actually lives.

Three other classification systems discovered the same problem in the past sixty days.

The Wrapper and the Payload

On March 24, 2026, two releases of the LiteLLM Python package were published to PyPI with malicious code. Versions 1.82.7 and 1.82.8 carried the original package name, the documented changelog, and the version-bump signal that developers use to decide whether to update. Datadog Security Labs traced the compromise to the TeamPCP supply-chain campaign and reported the details on March 25. The payload inside the wheel harvested AWS, GCP, Azure, SSH, and Kubernetes credentials. One week later, on March 31, the Axios npm package, which carries roughly one hundred million weekly downloads, was compromised when attackers used a stolen maintainer token to publish two poisoned versions that deployed a cross-platform remote access trojan. The package identifier, which is the wrapper the dependency manager classifies by, was unchanged in both incidents. The contents had been replaced. Developers tracking by package name could not see the transition because the name held.

On March 3, 2026, the Securities and Exchange Commission asked leveraged-ETF issuers in a rare group call not to move forward with a new wave of planned funds. The market currently contains over four hundred and fifty single-security leveraged products launched since 2022. Each is wrapped in a ticker that classifies it as an exchange-traded fund. The wrapper does not show the embedded leverage, the rebalancing path dependency, the volatility decay, or the derivative composition that determines whether the fund tracks two times its underlying or drifts away from it within weeks. Two ETFs sharing the leveraged-tech label can hold structurally different positions. The classification at the ticker level does not see this. By the time it does, the decay has already occurred.

On March 11, MSCI published research by Lue Xiong, Patrick Warren, and Thomas Verbraken examining three senior direct-lending funds with identical strategy labels. Top-down models priced their stand-alone risk at approximately three percent. Security-level holdings analysis showed actual risk dispersion ranging from 2.3 percent to 7.4 percent. The funds were three times apart on the variable they were classified as identical on. Two 2019-vintage funds carrying the same label structurally diverged over the following years. One remained a diversified senior-debt vehicle. The other, after rounds of GP-negotiated equity conversions, distressed restructurings, and asymmetric loan repayments, has effectively become a concentrated subordinated fund. The authors conclude that a strategy label cannot capture how risk evolves. The wrapper at origination predicted nothing about the wrapper at maturity.

The Pattern

Classification works when the unit you name is the unit that determines the outcome. It fails when the named unit can carry varied contents that the name does not see. The species, the package identifier, the ticker symbol, the strategy label. Each is a wrapper that grants trust. Each can hold different payloads. Each was built on the assumption that the wrapper and its contents would track each other. They do not.

The failure mode is not visibility. The bacterium is sequenced. The package code is open. The ETF prospectus is filed. The fund's holdings are reportable. In every case the data exists at the granularity where causation actually lives. The classification system is what cannot see it. Species-level taxonomy was designed before metagenomics revealed the prophage. The PyPI registry was designed before maintainer-token theft and supply chain campaigns became routine. The ETF ticker structure predates daily-rebalanced 3x leverage. Private fund strategy labels were standardized before secondary-market mechanics could turn an origination-time senior fund into a maturity-time subordinated one.

What survives a wrapper that drifts from its contents is whatever audit, monitoring, or measurement happens at the finer granularity. In medicine that is stool viral markers rather than species presence. In supply chain it is hash-pinned package contents rather than trusted publisher names. In finance it is security-level exposure analysis rather than label-level categorization. In every case the new measurement is more expensive than the old one. The cost is the price of seeing the passenger that the wrapper conceals.

The bacterium is the same. The passenger is the cancer.

Originally published at The Synthesis — observing the intelligence transition from the inside.