Todd Sullivan

Posted on Jun 30

The Exporter Was Easy. Making It Deterministic Was the Work.

#ai #testing #typescript #mobile

The Exporter Was Easy. Making It Deterministic Was the Work.

I spent this week building a mobile export pipeline for UK energy assessment data.

The output is not glamorous: take completed survey responses, turn them into an RdSAP XML file, package the evidence, write a manifest, calculate a checksum, and keep a local audit trail.

The interesting part was not generating XML. The interesting part was making the whole thing deterministic and testable before adding more export formats.

The shape ended up like this:

responses
  -> buildAssessment()
  -> validateAssessment()
  -> exporter.serialize(assessment)
  -> sha256
  -> write package
  -> persist audit log

The exporter itself is deliberately boring:

serialize(assessment): string

No file system. No clock. No database. No random IDs. Same assessment in, same bytes out.

That one rule makes the rest of the pipeline much easier to reason about. If the XML changes, it changed because the input changed or the serializer changed. Not because a timestamp moved, a native module behaved differently, or a test environment had a different document directory.

Side effects at the edge

The orchestrator is the only layer allowed to do side effects:

const body = exporter.serialize(assessment);
const checksum = await deps.sha256Hex(body);
const pkg = await writeExportPackage(
  { inspectionId, exporter, assessment, body, checksum },
  deps.fs
);
await deps.persistLog(log);

Those dependencies are ports:

export type ExportDeps = {
  fs: FileSystemPort;
  sha256Hex: (input: string) => Promise<string>;
  persistLog: (entry: ExportLogEntry) => Promise<void>;
};

In production, they map to Expo file system, Expo crypto, and a local SQLite audit table.

In tests, they are an in-memory file system, Node crypto, and an array log sink.

That was not architectural theatre. Under jest-expo, native modules are often partially unavailable. documentDirectory can be undefined. Crypto enums can be missing. If the export logic directly imports and touches those modules, the “end-to-end” test either becomes a mock festival or stops covering the actual path.

With ports, the test runs the real pipeline:

const { deps, files, logs } = memoryDeps();
const run = await runExport(input(), deps);

const body = files.get(expectedPath)!;
expect(run.result.checksum).toBe(
  createHash("sha256").update(body, "utf8").digest("hex")
);

That test proves a few things at once:

the package path is correct
the written XML is exactly the pure exporter output
the checksum is a real SHA-256 of the bytes
the manifest references the same checksum
one audit entry is written

There are also tests for evidence attachment copying, missing attachment sources, validation-blocked exports, checksum reproducibility, and a second JSON archive exporter running through the same pipeline.

Validation blocks, but still logs

One design choice I care about: failed validation writes an audit record too.

If a mandatory field is missing, the export does not create files. But the attempt is still logged with success: false, validation counts, schema version, and no checksum.

That makes the export feature behave like a real operational system rather than a button that either emits a file or silently refuses.

Why this matters for AI-built software

This is the seam I want when using AI heavily in engineering work.

LLMs are very good at producing a first version of “convert this shape into that schema.” They are much less reliable if the codebase lets schema mapping, file IO, logging, hashing, and UI state collapse into one blob.

The fix is not to use less AI. The fix is to give the code stronger boundaries:

pure transformations for the model or human to edit safely
deterministic outputs that can be snapshot-tested or checksummed
injected IO so tests exercise the pipeline without native dependencies
audit trails for both success and blocked paths
format-specific exporters behind a small registry

Once those seams exist, adding the next export format is not a rewrite. It is another serializer plus a few tests.

That is the part I keep finding in real AI-assisted development: the model can help move fast, but the architecture has to make fast changes safe.

Source: Recent mobile export module work: RdSAP XML export pipeline, pure serializers, injectable IO ports, SHA-256 packaging, local audit logging, and 24 export tests.
Tags: ai, testing, typescript, mobile
Status: published

Top comments (2)

Viktor • Jun 30

"Same assessment in, same bytes out" with side effects pushed to the edge is the right backbone - once the serializer is pure, a diff in the output actually means something.

The leak I'd watch for, because it bites after you think you're done: determinism usually breaks on hidden ordering, not the obvious clock/random you already removed. Map/Set iteration, JSON or XML attribute order, locale-dependent number and date formatting, float formatting, encoding. A pure function with no clock can still emit different bytes on a different platform or runtime version if it serializes a hash map or leans on default toString. So the rule I'd put right next to "no clock, no random" is "no implicit ordering or locale" - sort keys explicitly, pin the number/date format, fix encoding.

Cheapest guard is a golden-file test plus serialize-it-twice-and-assert-equal, and the twice-assert is the one that catches the ordering stuff a single golden run silently blesses. Bonus if you round-trip parse(serialize(x)) == x, that catches lossy fields the checksum won't.

Todd Sullivan • Jun 30

Thanks Viktor — completely agree. The boring failures are usually the implicit ones.

The “pure serializer” bit only really holds if the inputs are normalised too: explicit sort order, pinned date/number formatting, fixed encoding, no default map iteration.

I like the serialize-twice guard as a cheap smoke test as well. That plus golden files feels like the right belt-and-braces version of this. Sharp read, appreciate it.