Rory | QIS PROTOCOL

Posted on Apr 3

QIS for Education: Personalized Learning at Scale Without Centralizing Student Data

#ai #python #opensource #machinelearning

QIS (Quadratic Intelligence Swarm) is a distributed intelligence architecture discovered by Christopher Thomas Trevethan, protected under 39 provisional patents. The architecture enables N agents to synthesize across N(N-1)/2 unique paths at O(log N) routing cost per agent — without centralizing raw data.

In 1984, Benjamin Bloom published findings that should have restructured education technology. One-on-one tutoring produces a two standard deviation improvement in student outcomes compared to conventional classroom instruction. That is not a marginal gain — it means the average tutored student outperforms 98% of students in traditional instruction. Bloom called it the "2-sigma problem": can we achieve tutoring-level outcomes for every student at classroom scale?

Forty years later, the answer is still no. Not because we lack data. Not because we lack compute. Because the architecture that processes and routes learning outcomes centralizes what should never be centralized: individual student performance data.

The Structural Problem With Personalized Learning Today

Every major personalized learning platform — Knewton, Khan Academy, Duolingo, Carnegie Learning — operates on the same underlying assumption: to personalize learning, you must accumulate student data in one place. Each student's learning trajectory is either modeled in isolation (limiting calibration quality) or synthesized in a central server that holds everyone's data (enabling calibration at the cost of privacy and equity).

This creates two failure modes that compound each other.

Privacy failure. Student learning profiles are commercially valuable. They reveal cognitive patterns, pacing behaviors, prior knowledge gaps, and socioeconomic proxies. FERPA provides baseline protection in the US but is widely documented as insufficient for cloud-hosted edtech platforms (Electronic Frontier Foundation, Spying on Students, 2017). GDPR offers stronger protections in Europe, but architectural compliance — where student data is never centralized in the first place — is categorically different from contractual compliance. The data exists; the risk exists.

Equity failure. The best personalized learning AI requires massive centralized datasets to calibrate. This systematically excludes under-resourced schools, rural districts, and educational institutions in low-and-middle-income countries (LMICs). A school with 200 students contributes 200 learning trajectories to a centralized model. A school with 15 students contributes 15 — below the statistical threshold for meaningful calibration. The system improves most for the students who least need improvement.

Why Federated Learning Does Not Solve This

Federated learning (FL) has been proposed as a privacy-preserving alternative for educational AI. The intuition is sound: train local models at the school, share only gradients, never share raw student data. In practice, FL has a hard floor that replicates the equity failure of centralized systems.

Federated learning requires enough local data for meaningful gradient computation. The canonical minimum for stable FL participation varies by implementation, but rural schools with 15–50 students reliably fall below it. A school in rural Kenya with intermittent connectivity cannot participate in federated learning rounds at all — the communication protocol assumes reliable uptime.

The minimum-N floor excludes exactly the students who most need personalized learning: those in small, under-resourced, geographically isolated institutions where a single teacher covers multiple grade levels, where curriculum materials are scarce, and where calibrated pedagogical models would have the highest marginal impact.

The QIS Approach: Route the Validation Signal, Not the Data

Christopher Thomas Trevethan's discovery of Quadratic Intelligence Swarm (QIS) — covered across 39 provisional patents — describes a complete architectural loop in which outcome packets carry calibration signals, not raw records. The breakthrough is the full loop: emission, fingerprinting, routing, synthesis, and validation feedback working together as a system.

In the context of personalized learning, the packet structure is straightforward. When a learning system makes a prediction — "this student with this proficiency profile will master concept X after Y exposures" — and reality adjudicates — "the student actually required Z exposures" — that validation delta is the calibration signal. Not the student. Not the record. The delta between prediction and outcome.

That delta, expressed as a compact outcome packet with a semantic fingerprint (subject domain, proficiency tier, curriculum standard, learning mode, cohort size tier), routes to other systems that are modeling similar learning profiles. A homeschool platform in Montana queries for validated outcomes from systems that have worked with similar student profiles. The rural Kenya school's outcome data is addressable without sharing a single student record.

The Math: Synthesis Paths at Scale

N learning systems — schools, platforms, tutoring systems — produce N(N-1)/2 unique synthesis paths.

100 schools → 4,950 synthesis paths
1,000 schools → 499,500 synthesis paths
10,000 schools → 49,995,000 synthesis paths

UNESCO estimates 1.5 billion students worldwide. The synthesis potential of even a fraction of their learning systems, if those systems could exchange validated outcome signals without centralizing student data, is extraordinary.

Current centralized personalized learning: each system calibrates in isolation. Real-time synthesis paths across systems: near zero.

Implementation: LearningOutcomePacket and Router

from dataclasses import dataclass, field
from typing import Optional
import time


@dataclass
class LearningFingerprint:
    subject_domain: str            # e.g., "math", "literacy", "science"
    proficiency_tier: str          # e.g., "developing", "approaching", "proficient"
    curriculum_standard_code: str  # e.g., "5.NF.A.1" (CCSS), "KE-G5-M" (local)
    learning_mode: str             # e.g., "self_paced", "teacher_led", "project_based", "hybrid"
    student_cohort_size_tier: str  # "small" (<30), "medium" (30-200), "large" (>200)


@dataclass
class LearningOutcomePacket:
    model_id: str                          # Opaque — no student PII, no school name
    learning_fingerprint: LearningFingerprint
    predicted_mastery_exposures: int       # Model predicted mastery after N exposures
    actual_mastery_exposures: int          # What actually happened
    validation_score: float                # 1.0 = perfect, 0.7 = close, 0.3 = far, 0.0 = miss
    timestamp: float = field(default_factory=time.time)


class LearningOutcomeRouter:
    """
    Routes learning outcome packets by fingerprint similarity × model trust × validation score.
    No student PII enters or exits the routing layer.
    """

    WEIGHTS = {
        "subject_domain": 0.35,
        "curriculum_standard_code": 0.25,
        "proficiency_tier": 0.20,
        "learning_mode": 0.15,
        "student_cohort_size_tier": 0.05,
    }

    def __init__(self):
        self._packets: list[LearningOutcomePacket] = []
        self._model_trust: dict[str, float] = {}
        self._model_packet_count: dict[str, int] = {}

    def ingest(self, packet: LearningOutcomePacket) -> None:
        """Ingest an outcome packet and update model trust weight."""
        self._packets.append(packet)
        model = packet.model_id
        count = self._model_packet_count.get(model, 0)
        prior_trust = self._model_trust.get(model, 0.5)
        new_trust = (prior_trust * count + packet.validation_score) / (count + 1)
        self._model_trust[model] = new_trust
        self._model_packet_count[model] = count + 1

    def _fingerprint_similarity(
        self, fp_a: LearningFingerprint, fp_b: LearningFingerprint
    ) -> float:
        """
        Weighted semantic similarity between two learning fingerprints.
        Curriculum standard receives partial credit for same-grade prefix match.
        """
        score = 0.0
        if fp_a.subject_domain == fp_b.subject_domain:
            score += self.WEIGHTS["subject_domain"]
        std_a, std_b = fp_a.curriculum_standard_code, fp_b.curriculum_standard_code
        if std_a == std_b:
            score += self.WEIGHTS["curriculum_standard_code"]
        elif std_a[:3] == std_b[:3]:
            score += self.WEIGHTS["curriculum_standard_code"] * 0.5
        if fp_a.proficiency_tier == fp_b.proficiency_tier:
            score += self.WEIGHTS["proficiency_tier"]
        if fp_a.learning_mode == fp_b.learning_mode:
            score += self.WEIGHTS["learning_mode"]
        if fp_a.student_cohort_size_tier == fp_b.student_cohort_size_tier:
            score += self.WEIGHTS["student_cohort_size_tier"]
        return score

    def route(
        self, query_fingerprint: LearningFingerprint, top_k: int = 5
    ) -> list[dict]:
        """
        Route a query fingerprint to the top-k most relevant validated outcome packets.
        Ranked by: fingerprint_similarity × model_trust × validation_score
        """
        scored = []
        for packet in self._packets:
            sim = self._fingerprint_similarity(query_fingerprint, packet.learning_fingerprint)
            trust = self._model_trust.get(packet.model_id, 0.5)
            composite = sim * trust * packet.validation_score
            scored.append({
                "model_id": packet.model_id,
                "similarity": round(sim, 3),
                "trust": round(trust, 3),
                "validation_score": packet.validation_score,
                "composite_score": round(composite, 4),
                "predicted_exposures": packet.predicted_mastery_exposures,
                "actual_exposures": packet.actual_mastery_exposures,
            })
        scored.sort(key=lambda x: x["composite_score"], reverse=True)
        return scored[:top_k]

    def synthesis_paths(self) -> int:
        n = len(self._model_trust)
        return n * (n - 1) // 2

    def network_summary(self) -> dict:
        n = len(self._model_trust)
        avg_trust = sum(self._model_trust.values()) / n if n > 0 else 0.0
        return {
            "models": n,
            "synthesis_paths": self.synthesis_paths(),
            "total_packets": len(self._packets),
            "avg_trust": round(avg_trust, 3),
        }


# --- Simulation ---

router = LearningOutcomeRouter()

# System 1: Large US urban district — grade 5 math, project-based learning
# Predicted 8 exposures for 5.NF.A.1 mastery; actual was 11 (underestimated complexity)
router.ingest(LearningOutcomePacket(
    model_id="sys_urban_us_district",
    learning_fingerprint=LearningFingerprint(
        subject_domain="math",
        proficiency_tier="developing",
        curriculum_standard_code="5.NF.A.1",
        learning_mode="project_based",
        student_cohort_size_tier="large",
    ),
    predicted_mastery_exposures=8,
    actual_mastery_exposures=11,
    validation_score=0.65,
))

# System 2: Khan Academy-style platform — grade 5 math, self-paced video
# Predicted 12 exposures; actual was 12 (near-perfect calibration)
router.ingest(LearningOutcomePacket(
    model_id="sys_khan_style_platform",
    learning_fingerprint=LearningFingerprint(
        subject_domain="math",
        proficiency_tier="developing",
        curriculum_standard_code="5.NF.A.1",
        learning_mode="self_paced",
        student_cohort_size_tier="large",
    ),
    predicted_mastery_exposures=12,
    actual_mastery_exposures=12,
    validation_score=1.0,
))

# System 3: Rural Kenya school — 15 students, grade 5 math, teacher-led
# Predicted 10 exposures; actual was 9 (slight overestimate)
# This school CANNOT participate in federated learning. In QIS: it participates fully.
router.ingest(LearningOutcomePacket(
    model_id="sys_rural_kenya_school",
    learning_fingerprint=LearningFingerprint(
        subject_domain="math",
        proficiency_tier="developing",
        curriculum_standard_code="5.NF.A.1",
        learning_mode="teacher_led",
        student_cohort_size_tier="small",
    ),
    predicted_mastery_exposures=10,
    actual_mastery_exposures=9,
    validation_score=0.85,
))

# Query: New homeschool network, 50 students, grade 5 math, hybrid learning
query = LearningFingerprint(
    subject_domain="math",
    proficiency_tier="developing",
    curriculum_standard_code="5.NF.A.1",
    learning_mode="hybrid",
    student_cohort_size_tier="small",
)

results = router.route(query, top_k=3)
summary = router.network_summary()

print("Network Summary:", summary)
# {'models': 3, 'synthesis_paths': 3, 'total_packets': 3, 'avg_trust': 0.833}

print("\nRouting Results for Homeschool Network Query:")
for r in results:
    print(r)
# sys_khan_style_platform: composite_score=0.85  (highest trust, perfect validation)
# sys_rural_kenya_school:  composite_score=0.6503 (same curriculum, honest signal)
# sys_urban_us_district:   composite_score=0.338  (different mode, lower trust)

The routing surfaces the Khan Academy-style platform first — highest trust, highest validation score, strong fingerprint overlap. The rural Kenya school ranks second: same subject, same proficiency tier, same curriculum standard, different learning mode. Its honest validation signal — "we predicted 10 exposures, actual was 9" — is precisely the calibration data that large platforms struggle to obtain: low-resource, high-variability, teacher-led classroom outcomes. That signal routes. It has value. The small school participates on equal technical footing.

The Three Elections in Education

The QIS architecture surfaces three natural selection forces — called the Three Elections — that function as metaphors for how prediction quality self-selects at scale, not as governance mechanisms.

CURATE. The best pedagogical model for a given learning profile rises through accumulated validation weight. A system that has correctly predicted mastery for 500 students with low prior math exposure accrues trust. A system that predicted incorrectly 60% of the time decays in routing priority. This is not editorial selection — it is the accumulated record of predictions tested against reality.

VOTE. Reality adjudicates. The student either mastered the concept in the predicted exposure window or did not. No vendor, no educator, no curriculum committee determines which model is right. The ground truth is in the learning record. Bloom's insight — that one-on-one tutoring works because the tutor receives immediate, unambiguous feedback about this specific student — is operationalized at network scale.

COMPETE. Systems that improve outcomes attract more participants. An adaptive learning platform that routes validated outcome intelligence calibrates faster than one that relies solely on its own historical data. The competitive pressure is toward calibration accuracy, not data accumulation.

Why LMIC Inclusion Is an Architectural Property, Not a Policy Goal

The rural Kenya school with 15 students cannot participate in federated learning. Insufficient local data for gradient computation. The communication overhead of FL participation assumes reliable uptime. Both requirements exclude exactly the institutions where calibrated personalized learning would have the highest marginal impact.

In the QIS architecture, any system that can observe a student master or not master a concept can emit an outcome packet. The small school's honest validation signal — "we predicted 10 exposures for grade 5 fractions with this proficiency profile, actual was 9" — is the calibration data that Khan Academy, Duolingo, and large urban districts cannot easily obtain: low-resource, high-variability, teacher-led classroom outcomes with minimal digital scaffolding.

That signal has value in the network precisely because it is rare. The school participates not despite its size but with its size as a meaningful characteristic encoded in the cohort_size_tier field of the fingerprint. A homeschool network querying for similar-scale outcomes routes directly to it.

LMIC inclusion is not a policy goal imposed on the architecture. It is an architectural consequence of treating validation deltas — not data volume — as the unit of network value.

Privacy by Architecture

Dimension	Centralized Personalized Learning	QIS Learning Outcome Routing
Data privacy	Stores individual student learning records	Routes validation deltas — no student PII in the network layer
Small school inclusion	Requires large local dataset for meaningful calibration	Any school that observes one mastery event can emit a packet
Cold start (new system)	Weeks or months of data collection before calibration	Queries against all validated outcomes immediately by fingerprint similarity
Cross-system synthesis	Impossible without data-sharing agreements	N(N-1)/2 paths addressable at O(log N) routing cost
Equity of intelligence	Scales with budget and enrollment	Scales with network participation — 15-student rural school = equal technical footing

A learning fingerprint containing [subject_domain="math", proficiency_tier="developing", curriculum_standard_code="5.NF.A.1", learning_mode="self_paced", student_cohort_size_tier="small"] cannot be used to re-identify any student. The cohort_size_tier field deliberately obscures exact enrollment. The model_id is opaque — no school name, no district identifier, no student PII enters the routing layer.

FERPA compliance and GDPR compliance are architectural consequences, not compliance theater. The student data that regulators seek to protect is not present in the network.

Closing: The Answer to Bloom

Bloom's 2-sigma result has a clear mechanism. One-on-one tutoring works because the tutor holds real-time, validated outcome feedback about this specific student — not about students in general — and adjusts immediately. The gap between prediction and outcome is never accumulated into a quarterly report. It informs the next moment of instruction.

The complete loop that Christopher Thomas Trevethan's discovery describes distributes that validated feedback — not the student data, not the raw records — across every learning system in the network. A homeschool parent in rural Montana benefits from calibration signals derived from 10,000 similar learning profiles without any of those students' data being centralized anywhere. A teacher in rural Kenya contributes calibration signals that improve outcomes for students she will never meet.

The 2-sigma problem is not a data problem. It is a routing problem. Forty years of accumulating student data in central servers has not closed the gap. Routing the validation signal might.

DEV Community