DEV Community

Gary Doman/TizWildin
Gary Doman/TizWildin

Posted on

ARC-Hermes: Cryptographic Receipts and Replayable Memory for Hermes Agent

Hermes Agent Challenge Submission

This is a submission for the Hermes Agent Challenge.

What I Built

I built the architecture for ARC-Hermes: a Hermes Agent application layer that turns open agentic workflows into verifiable, binary-first project operations.

The idea is simple:

Hermes Agent = planner / tool user / multi-step worker
ARC = binary memory, receipts, replay, lineage, and proof
Enter fullscreen mode Exit fullscreen mode

Most agent systems can complete a task and return a response. ARC-Hermes is focused on what happens after the task:

  • What files changed?
  • What artifacts were created?
  • What commands were run?
  • Can the output be restored exactly?
  • Can another machine verify the session?
  • Can the result become future model-training lineage?
  • Can a developer audit what the agent actually did?

ARC-Hermes treats agent work as a verifiable development session instead of a disposable chat transcript.

The Application Layer

ARC-Hermes connects Hermes Agent-style planning and tool use to my wider ARC ecosystem:

  • ARC-Core — event/authority spine for registering actions and receipts
  • ARC-Neuron LLMBuilder — local-first model building, benchmark receipts, dataset lineage, and candidate promotion
  • ARC-StreamMemory — visual/session memory capture direction
  • OmniBinary Runtime — binary mirror/storage discipline
  • Arc-RAR — portable archive, rollback, and replay bundle direction
  • ARC-Turbo-OS — lightweight runtime/worker operating layer direction
  • ARC-TurboMine — compute/proof/scoring lane direction
  • Seeded-Universe Recreation Engine / SURE — deterministic seed/reconstruction math reference
  • arc-language-module — lexical/language spine for source-traceable meaning
  • arc-lucifer-cleanroom-runtime — local cleanroom/runtime execution context
  • arc-cognition-core — cognition/planning/reasoning control direction
  • Proto-Synth Grid Engine — visual operator shell / neural-synth style project graph
  • gh-ai-operator — GitHub repo discovery, workflow automation, and project-networking operator direction
  • ARC-Fusion — FFmpeg-backed media memory/proof runtime
  • ARC-Emulator — deterministic replay and binary proof framework for emulator sessions

The goal is not just to make an agent do a task. The goal is to make the task inspectable, portable, reproducible, and useful across a real developer ecosystem.

Demo

The current demo is an architectural and repo-oriented workflow.

A developer asks Hermes Agent to audit a repo, create a release artifact, and produce a verified result.

Conceptual terminal flow:

arc-hermes run "audit this repo, create a release package, and produce a verified artifact"
arc-hermes pack ./dist/release.zip
arc-hermes receipt ./dist/release.manifest.json
arc-hermes verify ./dist/release.receipt.json
arc-hermes export-replay ./dist/session.arcpack
Enter fullscreen mode Exit fullscreen mode

The important part is the receipt chain:

user request
  -> Hermes Agent plan
  -> tool execution
  -> produced files
  -> binary objects
  -> SHA-256 payload hashes
  -> chunk hashes
  -> Merkle roots
  -> manifests
  -> receipts
  -> replay/export bundle
Enter fullscreen mode Exit fullscreen mode

Practical Applications

1. Verified GitHub Operator

Hermes Agent can plan repo discovery, issue drafting, README improvement, dependency cleanup, and release-package creation.

ARC records:

  • which repos were inspected
  • which files were changed
  • which artifacts were generated
  • what command plan was used
  • what receipt proves the output

This connects directly to my gh-ai-operator and ARC-Core direction.

2. Local-First AI Memory

Instead of storing agent memory only as text logs, ARC-Hermes stores durable objects as binary payloads first.

source file -> binary object -> hash -> manifest -> receipt
Enter fullscreen mode Exit fullscreen mode

That creates a local second-brain system where project history can be restored, verified, searched, and used as future model lineage.

3. Media Proof and Automation

With ARC-Fusion, Hermes Agent could plan media operations while ARC-Fusion executes FFmpeg-backed jobs and receipts the output.

Applications:

  • verified video/audio export pipelines
  • frame extraction for StreamMemory
  • dataset generation from media
  • release-package verification
  • reproducible media transformations

4. Emulator Session Proof

With ARC-Emulator, the same pattern applies to game/emulation sessions:

  • ROM/disc manifesting
  • save-state proof
  • input timeline receipts
  • replay verification
  • StreamMemory session capture

This gives agentic systems a way to reason over gameplay sessions or test runs without losing provenance.

5. Dataset and Model Lineage

ARC-Neuron LLMBuilder can consume verified source objects instead of disconnected loose files.

A dataset row can trace back to:

dataset row -> source file hash -> project receipt -> repo/session manifest -> original binary object
Enter fullscreen mode Exit fullscreen mode

That makes AI training more auditable.

6. Deterministic Reconstruction Research

SURE is the research side of the ecosystem: storing exact objects when needed, but also storing deterministic recipes when a generated structure can be recreated from seed + parameters + expected hash.

This is useful for:

  • procedural worlds
  • simulation states
  • generated visual graphs
  • reproducible test environments
  • synthetic media layouts
  • long-running project timelines

I treat SURE as a math reference layer for deterministic reconstruction, not as a replacement for normal storage.

Code

Main ecosystem links:

Related project directions in this ARC-Hermes stack:

  • ARC-Apache — binary-first cryptographic memory substrate
  • ARC-Fusion — ARC-native media proof runtime using FFmpeg as a backend
  • ARC-Emulator — binary-first deterministic replay framework, starting with N64, then GameCube, PS2, and PSP

My Tech Stack

  • Python CLI tooling
  • FastAPI-style route boundaries for ARC-Core
  • SQLite for local indexes and receipts
  • SHA-256 payload hashing
  • chunked binary object storage
  • Merkle roots for large payload verification
  • JSON schemas for manifests and receipts
  • local-first project archives
  • FFmpeg-backed media processing through ARC-Fusion
  • deterministic replay/save-state concepts through ARC-Emulator
  • GitHub automation through gh-ai-operator
  • language/data lineage through ARC-Neuron LLMBuilder and arc-language-module

The design principle is:

Use text for humans.
Use binary objects for truth.
Use receipts for authority.
Use Hermes Agent for planning and tool orchestration.
Enter fullscreen mode Exit fullscreen mode

How I Used Hermes Agent

Hermes Agent is the right fit because the project is about multi-step agentic work, not one-shot prompting.

In ARC-Hermes, Hermes Agent would be responsible for:

  • breaking a developer request into a tool plan
  • deciding which repo/module should handle the work
  • calling controlled tools
  • inspecting results
  • repairing failed steps
  • summarizing the outcome
  • handing durable outputs to ARC for proof and storage

ARC then handles the trust side:

  • binary packing
  • payload hashes
  • chunk hashes
  • Merkle roots
  • manifests
  • receipts
  • replay bundles
  • future ARC-Core authority registration

The goal is not to make Hermes Agent remember everything inside a hidden agent state. The goal is to let Hermes Agent operate in a system where every important output becomes externally verifiable.

That changes the agent pattern from:

agent says it did something
Enter fullscreen mode Exit fullscreen mode

to:

agent did something, and the result has a hash, manifest, receipt, and replay path
Enter fullscreen mode Exit fullscreen mode

Why This Matters

Agentic AI is moving fast, but developers still need trust.

For serious work, I want agent systems that can answer:

  • What exactly did you change?
  • Can I verify that artifact?
  • Can I roll it back?
  • Can I replay the session?
  • Can I move the memory to another machine?
  • Can I use the result as model-training lineage later?

ARC-Hermes is my answer to that: Hermes Agent for planning and action, ARC for proof and continuity.

What I Would Build Next

The next milestone is a working adapter:

Hermes Agent task
  -> ARC-Hermes tool envelope
  -> repo/media/runtime operation
  -> binary manifest
  -> receipt
  -> ARC-Core registration
  -> replay/export bundle
Enter fullscreen mode Exit fullscreen mode

The first concrete use case would be a verified GitHub operator:

Find related repos.
Review project fit.
Draft useful issues or PRs.
Package the results.
Receipt every artifact.
Export a session bundle.
Enter fullscreen mode Exit fullscreen mode

That would make agentic development more useful, more auditable, and more portable.

Final Thought

To me, the future of agents is not just smarter chat.

It is agents that can work across real repos, real files, real media, real datasets, and real project memory while leaving behind proof that developers can inspect.

Hermes Agent provides the open agentic workflow side.

ARC provides the binary-first proof and memory side.

Together, that points toward a practical local-first agent runtime for developers who care about reproducibility, provenance, and control.

Top comments (0)