Muhammad Ahmad

Posted on May 18

The Agent That Writes Its Own Manual: A Deep Dive Into Hermes Agent's Self-Improving Architecture

#hermesagentchallenge #devchallenge #agents

Hermes Agent Challenge Submission

This is a submission for the Hermes Agent Challenge

The Agent That Writes Its Own Manual: A Deep Dive Into Hermes Agent's Self-Improving Architecture

Most AI agents have a memory problem — and not the kind you fix with a bigger context window.

You spend an afternoon building context. You explain your project structure, your deployment quirks, your naming conventions. The agent follows along beautifully. Then the session ends. You open a new one and it's back to square one. Blank slate. You're teaching the same class to the same student, every single day.

I've been running Hermes Agent — the open-source agent from Nous Research — for several weeks now. What pulled me in wasn't the feature list. It was one sentence from the README:

"The only agent with a built-in learning loop."

That's a bold claim. So I decided to actually pull apart how it works.

This post is that breakdown — how the learning architecture functions under the hood, what the memory system looks like, what changed in v0.13.0, and honestly, where it still falls short.

Why Most Agents Don't Actually Learn

Before getting into Hermes specifically, it's worth understanding the standard agent loop — because most frameworks follow the same pattern:

receive task → plan → execute → return result

Session ends. Nothing persists. Run the same type of task a hundred times, and on the 101st, the agent approaches it like a brand new problem. It has no memory of how it solved the previous 100, what worked, what failed, or what shortcuts it discovered.

This is fine for one-shot tasks. But for developers using an agent as an ongoing workflow partner — something that handles deploys, monitors logs, drafts weekly reports, maintains docs — that reset is a real productivity tax.

Hermes makes a different architectural bet.

The Closed Learning Loop

The core architectural decision in Hermes is what Nous Research calls the Reflective Phase — a step added after task execution, not before or during.

The standard Hermes loop looks like this:

receive task → plan → execute → [Reflective Phase] → return result

In the Reflective Phase, Hermes does something unusual: it analyzes its own performance on the task it just completed, extracts reusable patterns from how it solved it, and writes a skill file — a markdown document encoding the exact steps, tools, and decision logic it used.

The next time a similar task arrives, the agent doesn't reason from scratch. It queries its skill library first.

Here's what a generated skill file actually looks like in practice:

# Skill: Deploy to Staging via SSH

## Trigger
User asks to deploy, push to staging, or update the staging environment.

## Steps
1. SSH into staging-01 using stored credentials
2. Run `git pull origin main` in /var/www/app
3. Execute `npm run build && pm2 restart app`
4. Verify with `pm2 status` — confirm "online" state
5. Report deployment URL with commit hash

## Notes
- If pm2 reports "errored", run `pm2 logs app --lines 50` before escalating
- Database migrations run separately — never automatic on staging

These files follow the agentskills.io open standard — the same format used by Claude Code and Cursor — which means skills are portable between tools.

Over time, this library grows from Hermes' 40+ bundled skills to hundreds of domain-specific ones shaped entirely by your own workflows. The institutional knowledge compounds. This is the part that's genuinely hard to replicate by just adding a longer system prompt.

The Three-Layer Memory System

Hermes doesn't run on a single memory store. There are three distinct layers working together:

1. Working Memory — the standard LLM context window, cleared between sessions.

2. Episodic Memory — a searchable log of past conversations, stored locally in ~/.hermes/. Hermes can query this explicitly when it needs to recall how something was handled before. As of v0.10.0, this layer is fully pluggable — you can swap in vector stores, Honcho, or custom databases via a plugin interface.

3. Skill Memory — the generated skill library described above. This is the persistent, growing layer. Unlike episodic memory (which stores what happened), skill memory stores how to do things in an executable, reusable form.

Critical gotcha for new users: Persistent memory and skill generation are disabled by default. If you miss this in ~/.hermes/config.toml, Hermes behaves like a standard single-session agent. The "grows with you" promise doesn't materialize until you explicitly enable it.

# ~/.hermes/config.toml

[memory]
enabled           = true   # REQUIRED — disabled by default
skill_generation  = true   # enables the learning loop
user_modeling     = true   # builds a persistent model of your preferences

I didn't catch this on my first install. Ran it for three days wondering why nothing was carrying over. Read the config docs.

What v0.13.0 "Tenacity" Actually Changed

On May 7, 2026, Hermes shipped v0.13.0 with 864 commits and 295 contributors. Most coverage focused on the new Kanban board UI. That's not the interesting part.

Buried in the changelog are three new primitives that solve real production failure modes. They're easy to miss because they're disabled by default and have no dedicated blog post.

1. `/goal` — Persistent Goal Tracking

The problem it solves: Agent drift. Long multi-step tasks gradually lose sight of the original objective. By step 8 of a 12-step task, the agent is solving a subtask so intently it forgets the actual goal.

/goal lets you set a sticky objective that persists across the entire task execution. Hermes checks its progress against the stated goal at each decision point rather than only evaluating the immediate next step.

/goal deploy the new payment service to production with zero downtime

Once set, every tool call, every plan revision, every sub-task the agent spawns is evaluated against that anchor. It doesn't just ask "is this step correct?" — it asks "does this step move toward the goal?"

2. The Ralph Loop — Reflective Hallucination Prevention

The problem it solves: Silent corruption. The agent runs a command, gets ambiguous output, and assumes it succeeded. Or it fabricates a plausible-sounding result when the actual output was empty. This is the failure mode that causes the most downstream damage in production workflows.

The Ralph Loop adds a reflection step after each tool call. Before proceeding, Hermes explicitly asks itself:

Did this tool call actually return what I expected?
Is my interpretation of the output grounded in the actual output or in what I assumed the output would be?
Should I run a verification step before treating this as confirmed?

It's named after Ralph Waldo Emerson's idea of self-reliance applied to verification — the agent learning not to take its own assumptions at face value.

Enable it in config:

[agent]
ralph_loop = true

It adds latency. On long tasks, sometimes meaningfully. But on tasks where correctness matters — database operations, deployments, financial data processing — it's the difference between catching a silent failure and finding out about it an hour later.

3. Hallucination Gate

The problem it solves: The agent confidently invents file paths, variable names, API endpoints, or command outputs that don't exist.

The Hallucination Gate adds a lightweight verification pass before any factual claim about the environment gets used as input to the next step. If Hermes is about to reference a file path, it checks that the path actually exists before building the next action on top of it. If it's about to use an API endpoint, it validates the endpoint is reachable before constructing the full request.

These three primitives together address something important: the failure modes that matter most in production aren't the dramatic ones. Agents rarely fail by spectacularly hallucinating something obviously wrong. They fail by quietly assuming something is true when it isn't, then building five correct steps on top of a false premise.

Getting Started: The Short Version

Hermes runs on Linux, macOS, and WSL2. One command installs everything — no prerequisites, no manual dependency management:

curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bash

Then:

hermes setup   # interactive wizard — connects your LLM provider
hermes         # start the CLI

For model choice, Hermes works with Nous Portal (native OAuth), OpenRouter (200+ models), OpenAI, Anthropic, local vLLM, or any OpenAI-compatible endpoint. Switch providers with hermes model — no code changes, no reconfiguration.

For messaging platform integration (Telegram, Discord, Slack, WhatsApp, and 15+ others):

hermes gateway setup
hermes gateway install   # runs as a systemd service

The cost profile is genuinely low. On a $5 VPS running budget models via OpenRouter, you're looking at approximately $0.30 per complex task. On serverless infrastructure, idle costs are near zero — you only pay when the agent is actively reasoning.

The Curator: v0.12's Other Major Addition

Before v0.13.0, the skill library had a long-term problem: skill rot. Skills written six months ago for a workflow that no longer exists, skills that were specific to a one-off task but got written as general-purpose, skills that became redundant after a newer, better skill was created.

v0.12 introduced the Curator — an autonomous background process that monitors skill library health. It tracks which skills are being used, which are being skipped in favor of ad-hoc reasoning, and which are producing errors. It surfaces suggestions for refactoring, consolidation, or deletion, and with permission can apply those changes automatically.

It uses rubric-based quality assessment rather than the ad-hoc feedback loops from earlier versions — meaning it evaluates each skill against a consistent set of criteria rather than just tracking whether the skill "worked" in a narrow sense.

Where It Still Falls Short

I want to be direct about this because most coverage glosses over it.

Cold start is real. A fresh Hermes install is not impressive. You won't see the compound learning benefits until you've built up a meaningful skill library — roughly 20+ domain-specific skills. That takes time and consistent usage. If you're evaluating Hermes on a one-day trial, you're evaluating the wrong thing.

The skill generation quality varies. Not every skill the agent writes for itself is good. Early in a deployment, before the Curator has had time to audit the library, you'll accumulate some low-quality auto-generated skills. The Hallucination Gate helps, but it doesn't eliminate this.

Multi-agent coordination is still maturing. The parallel sub-agents feature works well for independent workstreams. Cross-agent coordination on shared state is technically possible but requires manual plumbing. It's not as seamless as the docs imply.

WSL2/Windows caveats apply. The docs call native Windows support "experimental" and recommend WSL2. This is accurate. If you're on Windows, budget extra time for setup.

The Design Bet Worth Paying Attention To

What Hermes is really arguing — architecturally — is that the long-term value of an AI agent is in accumulated operational knowledge, not in real-time reasoning capability alone.

Every other agent framework optimizes primarily for the quality of the LLM doing the reasoning. Hermes optimizes for that too — it's model-agnostic and you can use the best available model — but it adds a second axis: the quality of the skill library built from your specific workflows.

Two developers can run Hermes on identical hardware with identical models. After six months, their agents will be meaningfully different, because their skill libraries will reflect six months of their individual workflows, preferences, and domain knowledge.

That's the claim worth taking seriously. Not "Hermes is better than X today." But: an agent that learns your specific operational context over time is qualitatively different from one that doesn't — regardless of which underlying model it runs.

Whether that bet pays off at scale, and whether the Curator can keep skill library quality high as it grows, is still an open question. But it's the right question to be asking.

If you're exploring Hermes Agent:

GitHub repo — start here, read the config docs before you run it
Official documentation — the quick start is accurate
agentskills.io — community skills, the open standard reference
r/hermesagent — active community, good place for operational questions

The v0.13.0 changelog is worth reading in full if you're already running it — the three primitives above are documented there, just not highlighted.

DEV Community

The Agent That Writes Its Own Manual: A Deep Dive Into Hermes Agent's Self-Improving Architecture

The Agent That Writes Its Own Manual: A Deep Dive Into Hermes Agent's Self-Improving Architecture

Why Most Agents Don't Actually Learn

The Closed Learning Loop

The Three-Layer Memory System

What v0.13.0 "Tenacity" Actually Changed

1. `/goal` — Persistent Goal Tracking

2. The Ralph Loop — Reflective Hallucination Prevention

3. Hallucination Gate

Getting Started: The Short Version

The Curator: v0.12's Other Major Addition

Where It Still Falls Short

The Design Bet Worth Paying Attention To

Top comments (0)

The Agent That Writes Its Own Manual: A Deep Dive Into Hermes Agent's Self-Improving Architecture

Why Most Agents Don't Actually Learn

The Closed Learning Loop

The Three-Layer Memory System

What v0.13.0 "Tenacity" Actually Changed

1. /goal — Persistent Goal Tracking

2. The Ralph Loop — Reflective Hallucination Prevention

3. Hallucination Gate

Getting Started: The Short Version

The Curator: v0.12's Other Major Addition

Where It Still Falls Short

The Design Bet Worth Paying Attention To

1. `/goal` — Persistent Goal Tracking