Gary Doman/TizWildin

Posted on May 16

ARC-Hermes: Cryptographic Receipts and Replayable Memory for Hermes Agent

#ai #opensource #hermesagentchallenge #tizwildin

Hermes Agent Challenge Submission

This is a submission for the Hermes Agent Challenge.

What I Built

I built the architecture for ARC-Hermes: a Hermes Agent application layer that turns open agentic workflows into verifiable, binary-first project operations.

The idea is simple:

Hermes Agent = planner / tool user / multi-step worker
ARC = binary memory, receipts, replay, lineage, and proof

Most agent systems can complete a task and return a response. ARC-Hermes is focused on what happens after the task:

What files changed?
What artifacts were created?
What commands were run?
Can the output be restored exactly?
Can another machine verify the session?
Can the result become future model-training lineage?
Can a developer audit what the agent actually did?

ARC-Hermes treats agent work as a verifiable development session instead of a disposable chat transcript.

The Application Layer

ARC-Hermes connects Hermes Agent-style planning and tool use to my wider ARC ecosystem:

ARC-Core — event/authority spine for registering actions and receipts
ARC-Neuron LLMBuilder — local-first model building, benchmark receipts, dataset lineage, and candidate promotion
ARC-StreamMemory — visual/session memory capture direction
OmniBinary Runtime — binary mirror/storage discipline
Arc-RAR — portable archive, rollback, and replay bundle direction
ARC-Turbo-OS — lightweight runtime/worker operating layer direction
ARC-TurboMine — compute/proof/scoring lane direction
Seeded-Universe Recreation Engine / SURE — deterministic seed/reconstruction math reference
arc-language-module — lexical/language spine for source-traceable meaning
arc-lucifer-cleanroom-runtime — local cleanroom/runtime execution context
arc-cognition-core — cognition/planning/reasoning control direction
Proto-Synth Grid Engine — visual operator shell / neural-synth style project graph
gh-ai-operator — GitHub repo discovery, workflow automation, and project-networking operator direction
ARC-Fusion — FFmpeg-backed media memory/proof runtime
ARC-Emulator — deterministic replay and binary proof framework for emulator sessions

The goal is not just to make an agent do a task. The goal is to make the task inspectable, portable, reproducible, and useful across a real developer ecosystem.

Demo

The current demo is an architectural and repo-oriented workflow.

A developer asks Hermes Agent to audit a repo, create a release artifact, and produce a verified result.

Conceptual terminal flow:

arc-hermes run "audit this repo, create a release package, and produce a verified artifact"
arc-hermes pack ./dist/release.zip
arc-hermes receipt ./dist/release.manifest.json
arc-hermes verify ./dist/release.receipt.json
arc-hermes export-replay ./dist/session.arcpack

The important part is the receipt chain:

user request
  -> Hermes Agent plan
  -> tool execution
  -> produced files
  -> binary objects
  -> SHA-256 payload hashes
  -> chunk hashes
  -> Merkle roots
  -> manifests
  -> receipts
  -> replay/export bundle

Practical Applications

1. Verified GitHub Operator

Hermes Agent can plan repo discovery, issue drafting, README improvement, dependency cleanup, and release-package creation.

ARC records:

which repos were inspected
which files were changed
which artifacts were generated
what command plan was used
what receipt proves the output

This connects directly to my gh-ai-operator and ARC-Core direction.

2. Local-First AI Memory

Instead of storing agent memory only as text logs, ARC-Hermes stores durable objects as binary payloads first.

source file -> binary object -> hash -> manifest -> receipt

That creates a local second-brain system where project history can be restored, verified, searched, and used as future model lineage.

3. Media Proof and Automation

With ARC-Fusion, Hermes Agent could plan media operations while ARC-Fusion executes FFmpeg-backed jobs and receipts the output.

Applications:

verified video/audio export pipelines
frame extraction for StreamMemory
dataset generation from media
release-package verification
reproducible media transformations

4. Emulator Session Proof

With ARC-Emulator, the same pattern applies to game/emulation sessions:

ROM/disc manifesting
save-state proof
input timeline receipts
replay verification
StreamMemory session capture

This gives agentic systems a way to reason over gameplay sessions or test runs without losing provenance.

5. Dataset and Model Lineage

ARC-Neuron LLMBuilder can consume verified source objects instead of disconnected loose files.

A dataset row can trace back to:

dataset row -> source file hash -> project receipt -> repo/session manifest -> original binary object

That makes AI training more auditable.

6. Deterministic Reconstruction Research

SURE is the research side of the ecosystem: storing exact objects when needed, but also storing deterministic recipes when a generated structure can be recreated from seed + parameters + expected hash.

This is useful for:

procedural worlds
simulation states
generated visual graphs
reproducible test environments
synthetic media layouts
long-running project timelines

I treat SURE as a math reference layer for deterministic reconstruction, not as a replacement for normal storage.

Code

Main ecosystem links:

GitHub profile: https://github.com/GareBear99
ARC-Neuron LLMBuilder: https://github.com/GareBear99/ARC-Neuron-LLMBuilder
ARC-Core: https://github.com/GareBear99/ARC-Core
gh-ai-operator: https://github.com/GareBear99/gh-ai-operator
Rift Ascent demo: https://garebear99.github.io/RiftAscent/

Related project directions in this ARC-Hermes stack:

ARC-Apache — binary-first cryptographic memory substrate
ARC-Fusion — ARC-native media proof runtime using FFmpeg as a backend
ARC-Emulator — binary-first deterministic replay framework, starting with N64, then GameCube, PS2, and PSP

My Tech Stack

Python CLI tooling
FastAPI-style route boundaries for ARC-Core
SQLite for local indexes and receipts
SHA-256 payload hashing
chunked binary object storage
Merkle roots for large payload verification
JSON schemas for manifests and receipts
local-first project archives
FFmpeg-backed media processing through ARC-Fusion
deterministic replay/save-state concepts through ARC-Emulator
GitHub automation through gh-ai-operator
language/data lineage through ARC-Neuron LLMBuilder and arc-language-module

The design principle is:

Use text for humans.
Use binary objects for truth.
Use receipts for authority.
Use Hermes Agent for planning and tool orchestration.

How I Used Hermes Agent

Hermes Agent is the right fit because the project is about multi-step agentic work, not one-shot prompting.

In ARC-Hermes, Hermes Agent would be responsible for:

breaking a developer request into a tool plan
deciding which repo/module should handle the work
calling controlled tools
inspecting results
repairing failed steps
summarizing the outcome
handing durable outputs to ARC for proof and storage

ARC then handles the trust side:

binary packing
payload hashes
chunk hashes
Merkle roots
manifests
receipts
replay bundles
future ARC-Core authority registration

The goal is not to make Hermes Agent remember everything inside a hidden agent state. The goal is to let Hermes Agent operate in a system where every important output becomes externally verifiable.

That changes the agent pattern from:

agent says it did something

to:

agent did something, and the result has a hash, manifest, receipt, and replay path

Why This Matters

Agentic AI is moving fast, but developers still need trust.

For serious work, I want agent systems that can answer:

What exactly did you change?
Can I verify that artifact?
Can I roll it back?
Can I replay the session?
Can I move the memory to another machine?
Can I use the result as model-training lineage later?

ARC-Hermes is my answer to that: Hermes Agent for planning and action, ARC for proof and continuity.

What I Would Build Next

The next milestone is a working adapter:

Hermes Agent task
  -> ARC-Hermes tool envelope
  -> repo/media/runtime operation
  -> binary manifest
  -> receipt
  -> ARC-Core registration
  -> replay/export bundle

The first concrete use case would be a verified GitHub operator:

Find related repos.
Review project fit.
Draft useful issues or PRs.
Package the results.
Receipt every artifact.
Export a session bundle.

That would make agentic development more useful, more auditable, and more portable.

Final Thought

To me, the future of agents is not just smarter chat.

It is agents that can work across real repos, real files, real media, real datasets, and real project memory while leaving behind proof that developers can inspect.

Hermes Agent provides the open agentic workflow side.

ARC provides the binary-first proof and memory side.

Together, that points toward a practical local-first agent runtime for developers who care about reproducibility, provenance, and control.

DEV Community