Levash0v

Posted on May 30

I Turned Hermes Agent into a Verifiable Agent Operating System

#hermesagentchallenge #devchallenge #agents #ai

Hermes Agent Challenge Submission: Build With Hermes Agent

What I Built

I did not build another chatbot.

I built a memory hygiene system around Hermes Agent: a workflow that tells the agent what to remember, what to turn into a skill, what to write into the repo, what to track in a task system, and what to leave behind.

The core idea is simple:

Agent memory is not one bucket.

Long-running agent work breaks when chat history, global memory, project state, reusable procedures, task ownership, and public side effects are treated as the same thing. They have different lifetimes. Putting all of them into “memory” creates drift.

So I built a small repo-local harness and operating discipline around Hermes Agent.

Hermes Agent is the local agent runtime I use for tool-calling work: terminal commands, file edits, browser/search workflows, persistent memory, reusable skills, scheduled jobs, and gateway integrations.

Multica is the external task layer I use for active work ownership and routing. In this setup, it replaced local Hermes Kanban as the source of truth for current tasks.

The system separates agent work into durable layers:

Layer	Responsibility
Hermes memory	Stable facts only
Hermes skills	Reusable procedures
Repo files	Project-local state and conventions
Multica	Task ownership and routing
Session search	Historical recall
Human approval	External side effects

The operating rule:

Memory for stable facts. Skills for reusable procedures. Repos for project state. Multica for task ownership. Session search for history. Human approval for side effects.

That turns Hermes from a chat assistant into a small agent operating layer.

Before / after

Before	After
Task state buried in chat	Task state lives in Multica
Reusable fixes lost in history	Reusable fixes become Hermes skills
Project rules mixed with global memory	Project rules live in `AGENTS.md` / `CLAUDE.md`
Agent repeats setup mistakes	Skills + repo harness reduce rediscovery
Local Kanban drifts from reality	Multica becomes the source of truth
Claims of completion are implicit	Evidence reports verify artifacts

The important shift is not more memory. It is routing each kind of state to the layer with the right durability.

The lowest durable layer rule

The key rule is:

Store information in the lowest layer that is durable enough for its expected lifetime.

Examples:

A stable user preference goes to Hermes memory.
A repeated procedure becomes a Hermes skill.
A project convention goes to AGENTS.md or CLAUDE.md.
Current task ownership belongs in Multica.
Historical context can stay in session search.
Public side effects require human approval.

This keeps memory useful instead of turning it into a junk drawer.

Demo

The architecture is intentionally small:

Multica task layer ←→ Hermes Agent ←→ Session search
                         ↓
                  Evidence Loop
        Intent → Action → Artifact → Verification → Report
                         ↓
              Human Approval Gate, if external
                         ↓
              publish / send / deploy / push

Durable layers:
- Hermes memory: stable facts only
- Hermes skills: reusable procedures
- Repo harness: project-local state

Hermes routes work through durable layers, then through an evidence loop. External side effects stop at the Human Approval Gate.

The concrete task was:

Create a repeatable convention for repo-local agent state, verify it, and keep task ownership outside chat.

The workflow:

A Multica issue defined the work.
Hermes recovered prior context through session search.
Hermes wrote the repo-local harness files:

AGENTS.md
CLAUDE.md
agent-progress.md
AGENT_LESSONS.md
session-handoff.md
feature_list.json
.agent-harness/validate_feature_list.py
1. Reusable procedure was promoted into Hermes skills.
2. Project-specific state stayed in the repository.
3. Active ownership stayed in Multica.
4. The harness was verified with tests and a validator command.

Task ownership in Multica: repo harness setup and validator test suite are done, while skill promotion and the DEV.to submission are still in progress.

The point is not that an agent edited files. The point is that the workflow forced each kind of information into the correct durability layer.

Evidence loop

The workflow uses this loop:

Intent -> Tool action -> Artifact -> Verification -> Evidence report -> Approval if external

Examples:

A repo update is verified by reading the changed file or checking the diff.
A harness update is verified by running tests.
A task completion is verified by a Multica comment or linked artifact.
A reusable procedure is verified by a committed Hermes skill.
A public action, like pushing a repo or publishing a post, stops at the approval gate.

This changes the agent contract from “trust me, I did it” to “here is the artifact and here is how it was verified.”

Code

Repository: https://github.com/Levash0v/verifiable-agent-harness

The public artifact is intentionally small, but it has a real project shape:

templates/      AGENTS.md, CLAUDE.md, handoff files
examples/       feature_list.example.json
agent_harness/  validator
tests/          validator tests
docs/           evidence loop, diagram, article draft

Each repository gets a small operating contract.

Excerpt from templates/AGENTS.md:

# Agent Guide

This repository uses a repo-local agent harness. Treat these files as source of truth for agent work state:

- feature_list.json
- agent-progress.md
- session-handoff.md
- AGENT_LESSONS.md

## Startup protocol

1. Run `pwd`.
2. Run `git status --short --branch`.
3. Read this file and `CLAUDE.md` if present.
4. Read `feature_list.json`, `agent-progress.md`, `session-handoff.md`, and `AGENT_LESSONS.md`.
5. Run `python .agent-harness/validate_feature_list.py`.
6. Pick one unfinished feature only.

That contract means the next agent session does not need to reconstruct the project from chat. The repository carries its own operating state: current features, verified progress, and repo-specific lessons.

The repo is not only documentation. It has an executable validator path:

python3 -m agent_harness validate examples/feature_list.example.json
python3 -m unittest discover -s tests -v

The harness is executable: the feature list validator passes, and the test suite verifies both valid and invalid project-state files.

This is deliberately small. The goal is to make the convention executable and testable instead of purely narrative.

My Tech Stack

Hermes Agent — agent runtime, memory, skills, tools, session search, scheduled jobs, and gateways
Multica — active task ownership and routing
Python — repo harness validator
unittest — validation tests
Markdown — repo-local operating contracts
JSON — machine-readable feature state
Git / GitHub — versioned repo artifacts and proof trail
DEV.to — publication and challenge submission

How I Used Hermes Agent

Hermes Agent powered the project as the orchestrator and verifier.

I used Hermes memory only for stable facts: user preferences, environment facts, and long-lived workflow conventions.

I used Hermes skills as procedural memory: repo harness setup, publication workflow, clean-state checks, task handoff patterns, and debugging or routing procedures discovered during work.

I used session search for historical recall: prior decisions, old implementation attempts, and context reconstruction before updating a repo or task.

I used Hermes tools for concrete work: reading and editing files, running terminal commands, checking diffs, executing validators, and verifying test output.

Repo-local state lives in files such as:

AGENTS.md
CLAUDE.md
feature_list.json
agent-progress.md
AGENT_LESSONS.md
session-handoff.md
clean-state-checklist.md
evaluator-rubric.md
.agent-harness/validate_feature_list.py

Multica handles active task ownership and routing: what is being worked on, who owns it, what needs approval, and what result was reported back.

External side effects remain gated: GitHub pushes, DEV.to publishing, social posts, Discord messages, infrastructure deploys, and irreversible task comments.

Hermes can draft, edit, verify, and stage. The human approves the public action.

The biggest change was operating discipline:

Hermes stopped using global memory as a scratchpad.
Repeated fixes became skills instead of disappearing into chat history.
Project rules moved into repo-local files.
Task ownership moved from local Kanban to Multica.
Completion claims became evidence-backed reports.

This made the system less magical and more reliable.

Limitations

This is not a full agent platform by itself.

The harness validates conventions, not semantic correctness.
Multica is an external coordination layer, not required by the repo template.
Human approval is still required for external effects.
Evidence quality depends on disciplined updates to files, tasks, and skills.

That is intentional. The system is boring at the boundaries because those boundaries are where long-running agents usually fail.

Next steps

Next, I want to add more validators, richer handoff examples for Hermes / Claude Code / Codex, a stricter approval protocol, and more examples of skill promotion from repeated work.

The lesson I took from this build is simple:

Agent memory should be designed like infrastructure, not treated like a magic notebook.

Hermes gave me the primitives: memory, skills, tools, session search, scheduled jobs, and gateways.

The harness turns those primitives into an operating discipline.

Top comments (5)

Mykola Kondratiuk • Jun 8

the memory-type split makes sense but I’d push back on who decides the categories at runtime. if the agent routes its own work to "global memory" vs "project state", that routing call is doing load-bearing work with no audit trail.

Harjot Singh • May 31

"Verifiable agent operating system" is an ambitious and right-headed framing. The OS metaphor works because what agents need is the same thing an OS provides, process isolation, resource limits, scheduling, and a permission model, but for non-deterministic actors. The "verifiable" part is the hard, important bit: an agent OS where you can't prove what ran and why is just a fancier black box. Auditability and reproducibility are what make it trustworthy enough to actually run real things. That verifiable-execution-layer idea is core to how I think about Moonshift. What does "verifiable" concretely mean in yours, a signed action log, replayable runs, or formal checks on each step?

Levash0v • May 31

Thanks — the OS framing wasn't accidental. Agents fail the same way processes fail without an OS: unbounded resource consumption, no isolation, no audit trail. The metaphor earns its keep.
On "verifiable" concretely: in the current implementation it's structured evidence, not formal proofs. Every action exits through a fixed loop — Intent → Tool Action → Artifact → Verification → Evidence Report. The artifact is the proof: a file, a diff, a log that exists independently of the agent's claim about what it did. Verification is a separate step that checks the artifact against the intent before the loop closes. Nothing self-reports.
External side effects (push, publish, deploy) require an explicit Human Approval Gate before execution — so the agent can't exit the loop silently on actions with real-world consequences.
It's not replayable runs or formal checks yet — closer to a structured audit log with mandatory checkpoints. The gap you're pointing at (signed logs, replayability) is real and on the roadmap.
Curious what "verifiable" looks like in Moonshift — are you doing execution traces, or something closer to a constraint layer at the tool-call level?

Harjot Singh • May 31

In Moonshift verifiable is closer to your structured-evidence model than to formal proofs, with a constraint layer at the tool-call boundary as the other half. Two things together. First, artifacts as proof, exactly your point: every phase emits a real artifact (a file, a diff, a passing test run, a deployed URL) that exists independently of the agent's claim, and a separate verify step checks the artifact against the phase's intent before the loop closes. Nothing self-reports. Second, the constraint layer: irreversible actions (spend, deploy, publish) can't be executed by the agent freely, they pass through a tool boundary that enforces what's allowed and gates the dangerous ones, so safety is structural rather than the agent promising to behave. So: execution traces plus mandatory artifact-verification checkpoints, with a hard constraint layer on the tool calls. Same gap you named is on my list too, signed/replayable runs, right now it's a structured audit log with enforced checkpoints, not cryptographic proof. The Intent to Tool Action to Artifact to Verification to Evidence loop you described is almost exactly the phase contract I run. Are you enforcing the tool-call constraints at the boundary, or relying on the verify step to catch a bad action after it executes?

Levash0v • Jun 5

Same architecture, arrived independently — that's a good signal we're solving the real problem.
Honest answer: currently the gate is in the verify step, not at the tool boundary. It stops the action, but structurally yours is tighter — the boundary refuses the call before it fires, not after it's staged.
Moving toward boundary-level enforcement. Verify handles artifact integrity well; irreversible actions need the wall earlier.
Static whitelist per phase, or does the agent earn broader tool access as the run progresses?