Nawi

Posted on Jun 10

Running Hermes Agent in the Cloud Safely: A Reader's Guide to Their Trust Model

#agents #ai #cloud #security

You can run Hermes Agent on a $5 VPS. You can run it on a GPU cluster. You can run it serverless on Daytona or Modal so it hibernates when idle and costs almost nothing between sessions. You can talk to it from Telegram while it works on a cloud VM. That flexibility is the headline feature - and it's also the security question this post is about.

NousResearch publishes a detailed security policy for Hermes Agent. It is unusually clear about what the project treats as load-bearing and what it does not. If you operate Hermes in the cloud, read it first; this post is the operator-friendly companion, not a replacement.

What Hermes already gives you

Three things in the box are worth knowing about up front, because they shape the rest of the deployment story.

1. A trust model that names the boundary. Hermes Agent's policy says, in so many words: the only security boundary against an adversarial LLM is the operating system. Not the approval gate. Not output redaction. Not the Skills Guard. Those are useful - they catch the cooperative-mode mistakes that account for most real-world incidents - but they are heuristics operating on an attacker-influenced string, and the project does not pretend they are containment.

That's an honest framing, and it determines what "safely" means: you are responsible for choosing an OS-level isolation posture that matches the trust you've extended to the content flowing through the agent.

Worth separating two questions that often get blurred. Containment asks what a compromised agent can damage - and on that axis the policy is right: the OS is the boundary. Visibility asks something the policy doesn't address: what the agent actually did inside the boundary - what it touched, what it cost, what left through a channel you allowed. Containment can be perfect and you can still be blind on the second axis. The rest of this guide covers containment first, because it's load-bearing, then comes back to visibility.

2. Seven terminal backends. The terminal() tool - the one through which shell commands run - is pluggable. The supported backends are local, docker, ssh, singularity, modal, daytona, and vercel_sandbox. Switching from the default local backend to a containerized or remote one moves the agent's shell out of your host. Pick deliberately; the default does not isolate anything.

3. A non-trivial gateway. Hermes can be talked to from Telegram, Discord, Slack, WhatsApp, Signal, Matrix, Email, SMS, and a dozen other platforms. Each adapter is a network-exposed surface. Each one needs an allowlist. The project's policy treats every adapter without an allowlist as a code bug, but operator configuration is required - adapters do not magically lock themselves to your account.

With those three points in mind, the rest of this guide is the practical follow-through.

Step 1 - Pick your isolation posture

Hermes Agent's policy describes two postures. They protect against different things, and operators should choose one explicitly rather than fall into the default.

Terminal-backend isolation swaps the local backend for a sandboxed one - Docker, an SSH'd container, Modal, Daytona, Vercel Sandbox. Shell and file tools (terminal, read_file, write_file, patch) execute inside the sandbox. What this confines: anything the agent does through the shell contract. What it does not confine: the code-execution tool (which spawns a host subprocess), MCP server subprocesses, plugins, hooks, and skill loading. Those are all in the agent's own Python process.

Whole-process wrapping runs the entire agent process tree - shell, code-execution, MCP, plugins, hooks, skills - inside a single sandbox. Hermes supports this two ways:

The project's own Docker image and Compose setup. Lighter weight; standard container with operator-configured mounts and network policy.
NVIDIA OpenShell. Per-session sandboxes with declarative policy across filesystem, network L7 egress, process/syscall layers, and inference routing. Credentials live in a Provider store and never touch the sandbox filesystem.

The decision rule from the policy: if the content flowing into the agent comes from surfaces you do not control - the open web, inbound email, multi-user Slack channels, untrusted MCP servers - you want whole-process wrapping. If the operator is trusted and the concern is just LLM-emitted destructive shell, terminal-backend isolation is the supported posture.

In practice: most cloud deployments fall into the first category. If your Hermes can be messaged from Telegram, you are accepting input from a surface you don't fully control. Plan accordingly.

Step 2 - Lock down the gateway

If you have any gateway adapter enabled, this is your largest remote attack surface. Three rules apply uniformly across all of them.

Allowlist. Every adapter must refuse to dispatch agent work, resolve approvals, or relay output until a caller allowlist is configured. The policy treats fail-open behavior as a bug - but the operator still has to set the list. For Telegram, that means the chat IDs (or user IDs) that are authorized to talk to your bot. For Slack, the workspace and user IDs. For email, the From addresses. Until you set this, anyone who guesses your bot token or scrapes a public Slack channel has the same access you do.

Session IDs are routing handles, not authorization. Knowing another caller's session ID grants no access. Authorization is re-checked against the allowlist on every call. You should not treat session URLs as secrets, and you should not embed authorization checks that rely on them.

Treat token leakage as account compromise. Telegram bot tokens, Slack tokens, Discord webhook URLs - if any of these leaks, an attacker with the token has whatever the bot can do. Rotate proactively, store in a secrets manager (not in the repo, not in your shell history), and audit access logs.

For network-exposed HTTP surfaces (the dashboard plugin, the API server, the kanban plugin), the defaults bind to loopback. Switching them to --host 0.0.0.0 is a break-glass operator decision. If you make it, you own the public-internet hardening - TLS, auth, rate limiting - none of which Hermes provides for you on that path.

Step 3 - Match the cloud environment to the posture

The cloud-deployment specifics that Hermes' own docs leave to the operator:

On AWS or any VPC-based cloud:

Run Hermes in a private subnet. The agent does not need to be reachable from the public internet for any of the messaging gateways to work - those poll outbound or accept webhooks via a separate ingress that you control.
Outbound security group: deny by default. Allowlist (a) your secrets manager endpoint, (b) the inference provider's endpoint (whichever LLM you've selected via hermes model), (c) the messaging platforms in use, (d) your logging endpoint. Nothing else.
Inbound security group: deny by default. If a gateway uses inbound webhooks (Slack events API, Telegram webhook mode), put them behind an ALB with WAF and explicit path-based routing.
IAM role attached to the EC2 / Fargate task: scoped to specific resources. The agent does not need *:* on anything. If it does file work, scope to specific S3 buckets. If it queries databases, scope to specific Secrets Manager secrets and specific RDS hosts.

On Modal or Daytona (the serverless paths):

The "hibernates when idle" feature is genuinely great for cost. It is also a different security model from a persistent VM: cold-start state and warm-start state may both be reachable. Keep credentials in the Modal/Daytona secret store, not in the image.
Default network policy on Modal lets containers reach the internet broadly. If your workload doesn't need that, restrict it.

On a $5 VPS (the path most casual users will take):

Disable password auth on SSH. Use keys. This is table stakes, but it is also the most common compromise vector for a small VPS.
Run Hermes as a dedicated non-root user (the Docker image already does this - UID 10000). If you're running outside Docker, create the user explicitly.
Put ufw (or equivalent) in front of everything. Allow SSH from your IP only. Deny everything else inbound.
If you need the dashboard or API surfaces externally, front them with Tailscale or WireGuard rather than exposing them to the public internet. A free-tier Tailscale account covers a small operator deployment indefinitely.

Step 4 - Skills and plugins are code you're installing

Hermes' Skills system is one of the things that makes it powerful: the agent creates skills from experience, improves them during use, and can install community-contributed skills from external repositories. Plugins extend the architecture further - they load into the agent's Python process and run with full agent privileges.

This is - and the project says so explicitly - operator review surface. Skills Guard exists as a review aid that scans installable content for injection patterns. It is not a boundary. The supported workflow for third-party skills is:

Read the actual Python and shell scripts in the skill, not just SKILL.md. Skills execute arbitrary Python at import time.
Treat plugin installs the same way you treat installing any package from PyPI on a production system. Pin versions. Review the source. Treat the install audit log as evidence of what you've actually run.
If you wouldn't install the underlying code on the system without review, don't install the wrapping skill or plugin either.

For credential handling, Hermes filters the environment passed to shell, MCP, and code-execution subprocesses (provider API keys and gateway tokens are stripped by default), but anything running inside the agent process - every skill, every plugin, every hook handler - can read whatever the agent itself can read. The mitigation is review-before-install. There is no in-process containment of plugin code, and the policy is explicit about that.

Step 5 - Add an in-process gate, knowing what it is

OS-level isolation is the boundary. In-process heuristics catch most of the day-to-day mistakes - the agent generating a rm -rf because of a confusing prompt, the agent reaching for git push --force because the LLM concluded that was the simplest path. These mistakes are common, they are usually not adversarial, and a sharper in-process gate prevents them from reaching the boundary at all.

Hermes' built-in approval gate does some of this. It detects common destructive shell patterns and asks the operator before execution. The policy is upfront that it's a denylist over shell strings and structurally incomplete - "the gate catches cooperative-mode mistakes, not adversarial output."

If you want the in-process gate to be sharper, you can layer one on. This is where Node9 fits in a Hermes deployment: an AST-based policy engine that parses a command's structure rather than pattern-matching its raw text. That's the difference between a denylist that fires on the string rm -rf sitting inside a commit message (a false positive that trains operators to click through warnings) or misses a real destructive command split across a heredoc or command substitution (a false negative), and an engine that resolves what the command actually does before deciding. It also runs a per-call inspection layer that flags credentials in outbound arguments, anomalously large payloads, and force-push patterns that simple denylists miss. No in-process gate wins the obfuscation arms race outright - that's exactly what the OS boundary is for - but moving from text-matching to structural parsing closes the gap that trips naive denylists. The approach is covered in detail in Why Regex Is Not Enough.

The same layer answers the visibility question the OS boundary can't. Isolation contains a leak; it doesn't tell you that a credential sat in a tool argument on its way to an allowlisted endpoint, or that the agent spent an hour looping on the same file. A per-call inspection layer reads that content and keeps an audit trail of what every tool call actually did - which is orthogonal to containment, not a substitute for it. Your egress security group will happily pass a secret it can't see inside.

The honest framing - and the one Hermes' policy would approve of - is that this is belt and suspenders on top of OS isolation, not a replacement for it. It reduces the rate at which the boundary has to do the catching, which makes the whole system more livable. It isn't the containment boundary, and doesn't pretend to be - it's load-bearing on the other axis: whether the boundary ever had to act, and what slipped through the channels you opened on purpose.

A 30-minute audit checklist

If you have Hermes running in the cloud right now, walk through this before you close your laptop:

Isolation posture. Have you explicitly chosen one - terminal-backend or whole-process - and is the configuration consistent with the choice? "I'm not sure" is a finding.
Gateway allowlists. Does every enabled adapter (Telegram, Slack, Discord, email, etc.) have an explicit allowlist? Send a message from a non-allowlisted account - does it actually refuse, or does it accept and let the LLM-level gate sort it out?
Inbound security group / firewall. From an external IP, what's reachable? The answer should be either nothing or only your ALB / webhook endpoint, never the agent process directly.
Outbound security group. From inside the agent container, curl https://example.com. It should fail. If it succeeds, you don't have egress control.
Secret storage. Are your gateway tokens, LLM provider keys, and SSH keys in a secrets manager / OpenShell Provider store, or in a .env file on disk?
Skills installed. List every skill installed. For each: did someone read the Python before running it? Or did it just get added via a one-line command?
Approval gate behavior. Trigger a destructive shell command (rm -rf /tmp/test) through a chat. Does the gate intercept it? Now try the obfuscated form (r''m -rf /tmp/test, \rm -rf /tmp/test, echo cm0gLXJmIC90bXAvdGVzdA== | base64 -d | sh). Does the gate still intercept it? The honest answer is usually "not all of them" - that's why OS isolation is the boundary and the gate is a heuristic.

If any of these come back as "I'm not sure", that's the next thing to fix.

Closing

Hermes Agent is one of the more interesting things to land in the open-source agent space - model-agnostic, multi-platform, runs on infrastructure ranging from a Raspberry Pi to a GPU cluster. The flexibility is genuine. It also means the operator has more setup work than a typical desktop agent: gateway allowlists, isolation posture, terminal backend choice, skill review, credential scoping. None of it is exotic; all of it has to be done.

Node9 wires into Hermes through the same shell-hooks system this guide describes. One command auto-detects Hermes and routes every tool call through an AST-based gate and inspection layer before it executes:

npx node9-ai init

It appends a hooks: block to ~/.hermes/config.yaml and pre-populates the allowlist - the Step 5 in-process gate, live on your running agent, plus the per-call visibility and audit trail. Local-only, no telemetry. Apache 2.0.

(If you also run Claude Code, Codex, Gemini, or Cursor, npx node9-ai scan reads their existing session logs and reports the risky tool calls already in your history. Hermes history scanning is on the roadmap - Hermes integrates live via hooks rather than writing session files, which is why the live init path is the one to use there. If you hit a snag, drop a note in the issues.)

If you've deployed Hermes Agent in the cloud and have war stories worth sharing - failure modes, hardening tricks, edge cases that surprised you - drop them in the comments. The boundary work is the part nobody writes about.

Top comments (2)

VoltageGPU • Jun 13

Interesting take on Hermes Agent's flexibility. From an infrastructure standpoint, running it on a GPU cluster adds complexity around resource isolation—something I’ve wrestled with when deploying confidential workloads on VoltageGPU. The trust model described here feels similar to how enclave-based systems balance accessibility with security.

VoltageGPU • Jun 17

Interesting take on running Hermes Agent across different infrastructures. From a security standpoint, I've found that isolating the agent in a TEE (like Intel SGX or AMD SEV) adds a strong layer of defense, especially when dealing with untrusted environments. If you're considering GPU clusters, keep in mind that secure memory isolation is even more critical—VoltageGPU's approach to enclave-based execution could be worth a look in that context.