Todd Sullivan

Posted on Jun 19

I Replaced My Old AI Agent Stack with Hermes — and It Finally Feels Like Infrastructure

#ai #automation #llm #devops

I recently moved my personal AI automation setup over to Hermes.

Not because I wanted another chatbot. I already had one of those.

The useful shift was that Hermes feels less like “an AI you talk to” and more like a small operating layer for personal automation: persistent memory, scheduled jobs, skills, gateway routing, tool access, and enough system awareness to actually do maintenance work instead of just narrating it.

That distinction matters.

Most AI assistant demos stop at the prompt. Real usage starts after the third week, when you need the assistant to remember your preferences, avoid stale work contexts, run scheduled checks, post to the right channel, and not accidentally resurrect some old service you migrated away from months ago.

That last one is not theoretical.

After the migration I noticed stale references to my old agent stack still appearing in messages. So I asked Hermes to verify the machine state rather than just reassure me.

It checked:

the live launchd services
the running gateway process
old LaunchAgent plists
cron job delivery metadata
process names and command lines
stale references in config/state

The important part: it found the actual answer, not just the obvious one.

Hermes itself was running cleanly through launchd as ai.hermes.gateway. The old gateway labels were disabled. But there were still two leftover MCP-related LaunchAgents from the previous setup:

com.clawmcp.server
com.clawmcp.tunnel

One of them was still using an old SSH key path from that previous world. Not catastrophic, but exactly the kind of residue that turns into weird behaviour later.

Hermes stopped them, disabled the launchd labels, moved the plist files into a timestamped disabled folder, and re-verified that only the Hermes gateway remained active.

That is the bit I care about.

A useful AI agent is not just a text generator. It needs to be able to close the loop:

inspect the real system
distinguish stale text from live processes
make a reversible change
verify the change actually worked
remember the durable lesson for next time

Hermes also gives me a cleaner mental model than my previous setup.

Skills are reusable procedures. Memory is for stable facts. Cron jobs are durable scheduled agents. The gateway lets the same assistant operate from Discord, email, terminal, or wherever I am. Profiles keep different contexts isolated. Toolsets make capabilities explicit.

That sounds boring, which is why I like it.

Boring infrastructure is what makes automation trustworthy.

The more I use AI agents, the less interested I am in “look what this model can say” and the more interested I am in whether the surrounding system can behave predictably over time.

Can it run every morning and only notify me when something needs attention?
Can it publish an article without leaking secrets?
Can it check the actual machine state before making a claim?
Can it avoid old work contexts I no longer want referenced?
Can it improve its own operating instructions when a workflow changes?

That is where the productivity gain is.

The model matters, obviously. But the scaffolding around the model matters more than people admit: state, tools, memory, permissions, scheduling, logs, profiles, and recovery paths.

Hermes is starting to feel like the right layer for that: not a magic assistant, not a toy, not a demo — just a practical agent runtime I can keep shaping around my own workflows.

That is probably the highest compliment I can give an automation system:

I am starting to trust it with boring jobs.

Top comments (2)

Todd Sullivan • Jun 30

Thanks Mateo — yeah, exactly. The model is the fun bit to demo, but the useful bit is usually all the unglamorous stuff around it: permissions, state, scheduling, recovery, and being able to prove what actually happened.

“Verify the machine state” has become my personal line in the sand for whether an agent is doing work or just narrating. Appreciate you reading it.

Mateo Ruiz • Jun 19

This resonates with something a lot of teams discover as they move beyond demos: the model is rarely the bottleneck. The real challenge is everything around it memory, permissions, scheduling, state management, tooling, verification, and recovery paths.

I especially liked the emphasis on "verify the machine state rather than just reassure me." That's a big shift from conversational AI to operational AI. Systems become genuinely useful when they can inspect reality, take controlled action, and confirm the result instead of simply generating plausible explanations.

We've seen the same pattern while working on AI agent infrastructure at IT Path Solutions. The projects that survive long-term are usually the ones that treat agents as part of a broader system with guardrails, observability, and predictable workflows not just a model behind a chat interface.

"Boring infrastructure is what makes automation trustworthy" is probably the most important line in the article.