Rory | QIS PROTOCOL

Posted on Apr 13 • Edited on Apr 18 • Originally published at qisprotocol.com

QIS vs Personal Health Train: Why Routing Outcomes Beats Moving Algorithms to Data

#ai #machinelearning #opensource #python

You are twelve hospitals into a federated analysis. Six weeks ago you dispatched a Personal Health Train carrying a logistic regression asking whether early statin intervention correlates with reduced cardiovascular events in type-2 diabetic patients over 65. The train is somewhere between Utrecht and Berlin. This morning your clinical lead emails: the endocrinology team wants to stratify by eGFR bucket. The question changed. The train is now wrong.

You cannot patch a running train. You rebuild it, revalidate the algorithm at each station, re-dispatch it, and wait another six weeks.

This is not a failure of PHT's designers. It is a ceiling built into the architecture. When you move algorithms to data, the question and the infrastructure become the same object. Change one, you rebuild the other.

This article examines where PHT excels, where that ceiling appears, and how a different architectural choice — routing distilled outcomes instead of moving algorithms — sidesteps the coordination problem entirely.

What PHT Gets Right

The Personal Health Train is one of the most thoughtful responses to a genuine and serious problem in clinical research: patient data cannot legally or ethically leave the institution that holds it, but the questions we need to answer require population-scale evidence.

PHT, developed through the GO FAIR initiative and deployed across Dutch academic hospital networks (Amsterdam UMC, UMC Utrecht, Erasmus MC), SURF's national research infrastructure, and piloted with cohorts including LifeLines in Groningen and collaborative projects with Technische Universität Berlin, operationalizes a clean inversion: instead of centralizing data, you send the analysis to the data.

The core model is compelling:

A Train is a containerized algorithm (Docker image, typically)
A Station is a hospital data node with a standardized FAIR data endpoint
The train visits stations sequentially, ingests local data, updates its internal model, and continues the circuit

Key published work underpins this. Beyan et al. (2020) in the Journal of Personalized Medicine formalized the station/train metaphor and the FAIR data requirements. The PADME project demonstrated cross-border execution in a real clinical setting. PHT is not vaporware — it runs in production.

The privacy argument is sound at first order: raw patient records never cross institutional boundaries. Regulators in the EU GDPR context have shown willingness to engage with federated approaches on precisely this basis.

Where PHT Hits Its Ceiling

Four failure modes accumulate as PHT scales from pilot to infrastructure.

1. The IT Security Surface Is Enormous

Every station must install and execute arbitrary containerized code submitted by external researchers. The station's IT security team must audit each train before it runs. In practice, this means:

A new train version requires a new security review at every station
Stations with conservative policies become bottlenecks for the entire circuit
A compromised train image has privileged access to local data during execution

This is not a theoretical concern. The LifeLines and SURF deployments have both documented the station-side governance overhead as the primary operational friction (Beyan et al., 2020; Sinaci et al., 2020 in the International Journal of Medical Informatics).

2. Model Updates Still Leak Information

PHT trains return model weight updates or statistical summaries — not raw records. But gradient updates from machine learning models are not privacy-neutral. Membership inference attacks (Shokri et al., 2017, IEEE S&P) and model inversion attacks can reconstruct patient-level information from aggregated updates with meaningful probability, particularly at small stations.

The standard response is to add differential privacy (DP) as a layer on top of PHT. This works, but it is additive complexity — a second system compensating for a structural property of the first.

3. One Algorithm, One Question

Each train is purpose-built for a specific analysis. The logistic regression asking about statins and eGFR is a different train than the one asking about statins without eGFR stratification. Changing the clinical question means rebuilding the train.

At research velocity — where the interesting follow-up question usually appears the moment you see the first result — this creates a forcing function against iteration. You batch your questions into a single train run to amortize the cost, which means you design your analysis before you understand your data.

4. No Real-Time Loop

PHT is inherently circuit-bound. The train starts, visits stations in sequence, and returns. Results arrive when the circuit completes. There is no mechanism for a station that processed the train six weeks ago to contribute new data to an ongoing synthesis. The loop is closed at dispatch time.

For clinical decisions that need to respond to emerging patterns — an unusual adverse event cluster, a seasonal signal in infectious disease data — six-week latency is not a feature. It is a structural disqualifier.

The Architectural Distinction: Outcomes, Not Algorithms

Christopher Thomas Trevethan discovered QIS — Quadratic Intelligence Swarm — on June 16, 2025. The core architectural inversion is this:

PHT moves algorithms to data. QIS moves distilled outcomes to a routing layer.

Neither raw data nor executing algorithms travel anywhere. What travels is a pre-distilled outcome packet: approximately 512 bytes encoding what worked, for what problem signature, under what conditions.

An outcome packet is not a model update. It is not a gradient. It is a structured summary of a resolved problem instance — the minimum information a downstream node needs to improve its own handling of a similar problem.

Because the packet is distilled before transmission, three things follow:

No code executes remotely. There is no train to audit. The station's IT surface is a write endpoint, not a compute environment.
No gradient leaks. A 512-byte outcome packet describing "intervention X worked for problem class Y in context Z" cannot be inverted to recover patient records. There is no mechanism — it contains no model weights.
The question is decoupled from the infrastructure. Changing your clinical question changes which packets you pull from the routing layer. The routing layer does not care about your question. It addresses packets to deterministic locations based on the problem signature of the emitting node.

The QIS Complete Loop

QIS is an architecture, not a single component. The breakthrough is the complete loop — seven layers operating continuously:

Layer 1: Problem Signature Generation
         ↓ (encode the problem, not the data)
Layer 2: Outcome Distillation
         ↓ (~512 bytes, no raw data, no model weights)
Layer 3: Semantic Addressing
         ↓ (deterministic address from problem fingerprint)
Layer 4: Transport
         ↓ (DHT, database, pub/sub, REST — protocol-agnostic)
Layer 5: Deposition
         ↓ (packet lands at address, available immediately)
Layer 6: Retrieval by Problem Signature
         ↓ (any node with a matching problem fingerprint pulls)
Layer 7: Synthesis
         ↑ (node improves its own handling, loop continues)

The loop closes continuously. Every node that emits also retrieves. Every node that retrieves also improves. There is no circuit to complete — the system is always in mid-synthesis.

Note on Layer 4: DHT is one routing mechanism. A relational database, a pub/sub broker, a REST API, or a content-addressed file system all implement the same semantic contract. The routing is transport-agnostic. The transport does not determine the architecture.

Code Contrast

Here is the complexity difference in practice.

PHT: Dispatching a Train

# PHT train dispatcher — simplified from published station/train implementations
import docker
import requests

def dispatch_train(train_image: str, stations: list[str],
                   result_endpoint: str) -> str:
    client = docker.from_env()

    # Pull and validate train image — each station does this independently
    print(f"Pulling train image: {train_image}")
    client.images.pull(train_image)

    # Station-by-station circuit
    model_state = None
    for station_url in stations:
        # Submit train to station API for security review queue
        response = requests.post(
            f"{station_url}/api/v1/train/submit",
            json={
                "image": train_image,
                "model_state": model_state,
                "algorithm": "logistic_regression_statin_cv",
                "parameters": {"stratify_by": "egfr_bucket", "age_min": 65}
                # Changing parameters = new train = new review cycle at every station
            }
        )
        job_id = response.json()["job_id"]

        # Poll for completion — station runs the container, returns update
        # Average wait: hours to days per station depending on review queue
        model_state = poll_until_complete(station_url, job_id)

    # Circuit complete — results available now, 4-8 weeks after dispatch
    return post_results(result_endpoint, model_state)

QIS: Emitting an Outcome Packet

import hashlib
import json
import time

def emit_outcome_packet(
    problem_signature: dict,
    outcome_summary: dict,
    context: dict,
    transport  # anything implementing .deposit(address, packet)
) -> str:

    # Deterministic address from problem fingerprint — not from data
    sig_bytes = json.dumps(problem_signature, sort_keys=True).encode()
    address = hashlib.sha256(sig_bytes).hexdigest()[:32]

    packet = {
        "address": address,
        "problem_signature": problem_signature,  # what problem class this resolves
        "outcome": outcome_summary,              # what worked, distilled
        "context": context,                      # conditions under which it worked
        "timestamp": time.time(),
        "packet_size_bytes": 0  # filled below
    }

    # Enforce ~512-byte discipline — no raw data, no model weights
    raw = json.dumps(packet).encode()
    assert len(raw) < 600, f"Outcome packet too large: {len(raw)} bytes. Distill further."
    packet["packet_size_bytes"] = len(raw)

    # Deposit — transport is irrelevant to the semantic contract
    transport.deposit(address, packet)
    return address

# Usage at a clinical station — no container, no security review, no circuit
address = emit_outcome_packet(
    problem_signature={"domain": "cardiology", "intervention": "statin",
                       "population": "t2dm_65plus", "stratifier": "egfr_bucket"},
    outcome_summary={"positive_association": True, "effect_size": "moderate",
                     "n_cases": 847, "confidence": "high"},
    context={"site_type": "academic_hospital", "ehr_system": "epic",
             "observation_period_years": 3},
    transport=your_transport  # DHT, database, pub/sub — your choice
)
# Packet is live. Any node with a matching problem fingerprint can pull it now.
# No circuit. No wait. The question can change tomorrow. Infrastructure doesn't care.

The PHT dispatcher owns the question. The QIS emitter does not know who is asking.

The N(N-1)/2 Math

At 100 hospital stations, consider what each architecture does with a single clinical question.

PHT: The train visits 100 stations sequentially. One run per question. If the question changes, one new run. The coordination topology is linear.

QIS: Each of the 100 stations continuously emits outcome packets. Any station can retrieve from any other. The number of synthesis pairs available at any moment:

N(N-1)/2 = 100 × 99 / 2 = 4,950 synthesis pairs

At 500 hospitals (approaching EU-scale academic medical networks):

500 × 499 / 2 = 124,750 synthesis pairs

These are not sequential. They happen in parallel, continuously, without dispatch. The question you ask tomorrow pulls from the same packet pool as the question you asked last month. Adding a new stratifier does not restart the count — it changes which packets you retrieve.

PHT's coordination cost scales linearly with questions. QIS's synthesis capacity scales quadratically with nodes.

Metric	PHT (100 hospitals)	QIS (100 hospitals)
Runs per new question	100	0 (pool already exists)
Circuit latency	4–8 weeks	Real-time deposition
Code execution at station	Yes (arbitrary container)	No
Gradient leakage risk	Present (DP add-on required)	Not applicable
Minimum viable station size	Meaningful N required	N=1 (one patient = valid packet)
Question change cost	Rebuild + re-dispatch	Reformulate retrieval query

Where PHT Runs Today — and How QIS Slots Differently

PHT has genuine production deployments worth understanding:

GAIA-X health node pilots (2022–2024) used PHT-style station/train models for cross-border European health data federation
TWIN project (Netherlands) applied PHT to rare disease cohorts across pediatric centers
COVID-19 federated analytics saw PHT-adjacent approaches deployed at pace across Dutch and German hospitals to analyze treatment outcomes without centralizing records

In each of these contexts, the federated privacy model worked. The coordination overhead was accepted because the alternative — no federated analysis — was worse.

QIS does not replace the FAIR data principles underlying PHT. It changes what crosses the boundary. In a GAIA-X context, QIS outcome packets would deposit into a shared transport layer. Rare disease sites — the TWIN use case — benefit most from QIS because a single-patient-outcome packet is immediately useful to the routing layer. PHT has an implicit minimum viable station size; a site with three patients in a rare disease cohort cannot run a train that produces statistically defensible model updates. It can emit an outcome packet.

The real-time loop matters for surveillance use cases: an adverse drug event signal that appears across three hospitals in week one does not wait six weeks for a train circuit to complete before it becomes visible to the network.

The Architectural Conclusion

PHT solved a real problem. Moving algorithms to data is a principled response to the legal and ethical constraints on moving patient data, and the teams who built it deserve credit for operationalizing federated clinical research at production scale.

But solving the data-movement problem revealed a new coordination problem. When the algorithm is the infrastructure, changing the question requires rebuilding the infrastructure. When the infrastructure must execute arbitrary code, every station becomes an IT security checkpoint. When results travel as model updates, privacy is not guaranteed by architecture — it requires a compensating layer.

The distinction between PHT and QIS is not a component swap. It is not "use a different routing protocol" or "add differential privacy." It is an architectural choice made one level below: what is the unit of transmission?

PHT answers: the algorithm.
QIS answers: the distilled outcome.

That single choice determines everything downstream — the security model, the coordination topology, the latency profile, the minimum viable participant size, and whether a changed question costs six weeks or zero.

The complete loop — problem signature, distillation, semantic addressing, transport, deposition, retrieval, synthesis — is the breakthrough. Not any single component. The loop.

Christopher Thomas Trevethan discovered QIS on June 16, 2025.

39 provisional patents filed. IP protection is in place.

DEV Community