<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Henry Yau</title>
    <description>The latest articles on DEV Community by Henry Yau (@henry_yau_2266).</description>
    <link>https://dev.to/henry_yau_2266</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3992781%2Fefd48338-4e5e-4832-b8e8-a51638b20c8e.png</url>
      <title>DEV Community: Henry Yau</title>
      <link>https://dev.to/henry_yau_2266</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/henry_yau_2266"/>
    <language>en</language>
    <item>
      <title>Building a CompTIA practice-test engine that gives students source-verified questions</title>
      <dc:creator>Henry Yau</dc:creator>
      <pubDate>Sat, 20 Jun 2026 04:22:12 +0000</pubDate>
      <link>https://dev.to/henry_yau_2266/building-a-comptia-practice-test-engine-that-gives-students-source-verified-questions-332</link>
      <guid>https://dev.to/henry_yau_2266/building-a-comptia-practice-test-engine-that-gives-students-source-verified-questions-332</guid>
      <description>&lt;p&gt;If you've ever studied for a CompTIA exam with free practice questions online, you already know the problem: most "practice tests" are recycled exam dumps. The same leaked pool, copy-pasted across a dozen sites, with a worrying number of "correct" answers that are subtly wrong and no indication of where any of it came from.&lt;/p&gt;

&lt;p&gt;For a student, that's worse than useless. You can memorize a wrong answer with total confidence and walk into the exam having actively trained the mistake.&lt;/p&gt;

&lt;p&gt;I wanted to build a practice-test engine that made the opposite promise to students: every answer you see can be traced back to a primary source. Not "I checked most of them." Not "a reviewer will probably catch the bad ones." A hard guarantee, enforced by code. This post is how I turned that promise into an actual pipeline — and what it cost.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The guarantee, stated as a rule&lt;/strong&gt;&lt;br&gt;
The whole engine is built around one deliberately unreasonable constraint:&lt;br&gt;
If a question's correct answer can't be traced to a primary source, the pipeline isn't allowed to publish it.&lt;/p&gt;

&lt;p&gt;That single rule is what separates a study tool a student can trust from a dump that quietly teaches them wrong. Everything below is the machinery that makes the rule true by default instead of by good intentions.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Correctness shouldn't be a vibe&lt;/strong&gt;&lt;br&gt;
The usual way AI-generated quiz content gets made is: prompt a model, get 50 questions, skim them, ship. The correctness of any given question is a judgment someone makes (or forgets to make) at review time. That doesn't scale, and it isn't reproducible — the same question might pass on Monday and fail on Tuesday depending on who's looking.&lt;/p&gt;

&lt;p&gt;I wanted correctness to be a property of the engine, not a per-question opinion. The same way you don't "review" whether your tests pass — they go green or they don't — a question either carries proof or it gets killed automatically.&lt;br&gt;
That proof is what I call a source receipt.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The receipt&lt;/strong&gt;&lt;br&gt;
Every question that survives generation has to carry a verbatim excerpt from the primary source material that justifies its correct answer. Not a vibe, not "the model said so" — an actual quoted span, stored on the question object, that a human or a script can check.&lt;br&gt;
A question ends up looking roughly like this:&lt;br&gt;
&lt;code&gt;json{&lt;br&gt;
  "id": "netplus-1-0012",&lt;br&gt;
  "objective": "1.2",&lt;br&gt;
  "stem": "Which transport protocol establishes a session before data transfer?",&lt;br&gt;
  "answer": "TCP",&lt;br&gt;
  "evidence": {&lt;br&gt;
    "excerpt": "...connection-oriented transport establishes a session prior to exchange, in contrast to connectionless transport...",&lt;br&gt;
    "source": "official exam objectives, domain 1.2"&lt;br&gt;
  }&lt;br&gt;
}&lt;/code&gt;&lt;br&gt;
That evidence block is the whole game. It's a durable, stored receipt — it travels with the question forever, so a student (or I) can audit correctness months later, not just at the moment of generation.&lt;br&gt;
There's a hard rule layered on top that matters both pedagogically and legally: approximate and explain, never reproduce. The receipt grounds the question in the real objective, but the question and explanation are written fresh. I'm not republishing a copyrighted exam pool — the engine generates original questions that are provably aligned to public objectives. That distinction is the entire reason a student can use this without studying leaked material.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The gates&lt;/strong&gt;&lt;br&gt;
Generation is the easy part. The interesting engineering is everything that tries to stop a bad question from ever reaching a student.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;The adversarial verifier. After a question is drafted, a second pass plays prosecutor instead of author. Its only job is to attack: does the stored excerpt actually support the marked answer? Is there a more correct option? Is the excerpt being stretched to cover a claim it doesn't make? The generator wants to ship; the verifier wants to reject. Correctness lives in the gap between them.&lt;/li&gt;
&lt;li&gt;check-mocks.mjs. A CI script that validates structural integrity across the whole bank — every question has a receipt, every receipt is non-empty, every answer maps to a real option, no orphaned references. It runs in CI and fails the build if anything's off.&lt;/li&gt;
&lt;li&gt;The blueprint-sum gate. CompTIA publishes domain weightings (e.g. domain 1 is X% of the exam). A gate checks the generated distribution actually matches the official blueprint, so a student's practice set mirrors the real exam shape instead of over-indexing on whatever was easy to generate.&lt;/li&gt;
&lt;li&gt;The NO-GO gate — the part I'm most proud of. When verification confidence drops below threshold, the question is cut. I deliberately tuned this gate to be trigger-happy: it currently runs at roughly a 24% false-cut rate, meaning about a quarter of the questions it kills were probably fine.
That sounds like a bug. It's the most important design decision in the system.
Shipping a confidently-wrong answer to a student costs them real money and a failed exam. Dropping a good question costs me nothing but a little generation budget. The failure modes are wildly asymmetric, so I tuned the gate toward the cheap failure. For a student-facing tool, over-cutting is a feature.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;code&gt;text$ node check-mocks.mjs&lt;br&gt;
  scanned:   187 candidates&lt;br&gt;
  verified:  142&lt;br&gt;
  NO-GO:     45   (confidence &amp;lt; threshold)&lt;br&gt;
  → 142 shipped, receipts attached&lt;/code&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The deliberately boring stack&lt;/strong&gt;&lt;br&gt;
None of this needs a heavy framework, and reaching for one would've been a mistake. The site is a vanilla JS quiz engine on Cloudflare Pages — static, fast, no build step to babysit, free to host. A weekly CI job re-runs the gates and flags staleness when objectives change. The discipline is in the pipeline, not the runtime.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;It's live, and free for students&lt;/strong&gt;&lt;br&gt;
The engine currently feeds six CompTIA tracks — Network+, Security+, A+ Core 1 &amp;amp; 2, CySA+, and PenTest+ — all free, no signup, no paywall. You can try it here: &lt;br&gt;
&lt;a href="https://certpracticelab.com/" rel="noopener noreferrer"&gt;certpracticelab&lt;/a&gt;&lt;br&gt;
&lt;a href="https://certpracticelab.com/a-plus-core1" rel="noopener noreferrer"&gt;Core1&lt;/a&gt;&lt;br&gt;
&lt;a href="https://certpracticelab.com/a-plus-core2" rel="noopener noreferrer"&gt;Core2&lt;/a&gt;&lt;br&gt;
&lt;a href="https://certpracticelab.com/network-plus" rel="noopener noreferrer"&gt;Network+&lt;/a&gt;&lt;br&gt;
&lt;a href="https://certpracticelab.com/security-plus" rel="noopener noreferrer"&gt;Security+&lt;/a&gt;&lt;br&gt;
&lt;a href="https://certpracticelab.com/cysa-plus" rel="noopener noreferrer"&gt;CySA+&lt;/a&gt;&lt;br&gt;
&lt;a href="https://certpracticelab.com/pentest-plus" rel="noopener noreferrer"&gt;PenTest+&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Every answer a student sees carries a receipt behind it. That's the promise, and it's enforced by code rather than by my good intentions.&lt;/p&gt;

</description>
      <category>shadowdev</category>
      <category>ai</category>
      <category>javascript</category>
      <category>webdev</category>
    </item>
  </channel>
</rss>
