Andre Faria

Posted on May 10 • Edited on Jun 21

Giving Your AI Assistant a Soul: AGENTS.md, SOUL.md and the Art of Agent Identity

#ai #automation #agents #openclaw

Most AI assistants are powerful strangers. They can help, but every new session starts with the same quiet amnesia: who you are, what you run, what you care about, and how you like decisions made. I wanted something closer to a collaborator, especially for the kind of work that lives in a terminal rather than a web chat box.

There are two agent surfaces I use day to day. For work, I use Claude Code. At home, I use OpenClaw backed by my ChatGPT Plus subscription to automate useful things around my homelab and daily workflow. Both are terminal-first workflows, not web UI chat sessions, so markdown instruction files and local tool rules are part of the real operating surface.

The answer turned out to be surprisingly low-tech: a handful of markdown files that define identity, memory, operating rules, and delegation. SOUL.md gives the agent character. AGENTS.md gives it procedure. USER.md tells it who it is working with. TOOLS.md records local environment facts. MEMORY.md gives it continuity. Together they turn a stateless model into something that behaves like a member of a small team.

Update, June 2026: the architecture is still the same, but the roster, model choices, and security posture have evolved. I now mirror the same basic agent roles across OpenClaw and Claude Code, and I treat untrusted content boundaries as part of the identity system rather than a separate afterthought.

A quick note on security before going further, because it's worth being direct about this. OpenClaw is genuinely powerful: it can control smart home devices, manage network infrastructure, read and write files, execute shell commands, and interact with external services. That power is exactly what makes it useful, and exactly what makes careless deployment dangerous. As Uncle Ben put it: with great power comes great responsibility.

The OpenClaw gateway runs exclusively on my local network and is not exposed to the internet. Remote access, when I need it, goes through Tailscale on trusted devices only. This matters because the agents have access to real infrastructure: smart home controls, network management, DNS, file systems. Giving a publicly accessible endpoint that level of access would be reckless. The OpenClaw security documentation covers the threat model in detail and is worth reading before you give any agent access to anything you'd regret. If you're setting up something similar, treat the gateway like you'd treat SSH access to your homelab: local by default, VPN for remote, no public exposure.

1. The Files and How They Work

The workspace for the main agent lives at ~/.openclaw/workspace/ and contains:

├── AGENTS.md       # Operational rules: boot sequence, delegation, red lines
├── SOUL.md         # Character: who you are, not just what you do
├── IDENTITY.md     # Name, role, capabilities (routing metadata)
├── USER.md         # About the human: persisted context across sessions
├── TOOLS.md        # Environment specifics: IPs, hostnames, credentials
├── MEMORY.md       # Long-term curated memory
├── HEARTBEAT.md    # Periodic background task checklist
└── memory/
    └── YYYY-MM-DD.md   # Raw daily session notes

A sanitized version of these workspace and agent files is public on GitHub. The private files — USER.md, TOOLS.md, and MEMORY.md — are deliberately excluded, since they contain personal and environment-specific details that don't generalize. Everything else, the structure, the character files, the operational rules, is there to browse.

These files form the startup context and operating contract. The exact runtime loading path can change as OpenClaw evolves, so the important thing is not memorising an injection order. The important thing is keeping each file's responsibility clear: identity in one place, procedure in another, local facts in another, and long-term memory behind explicit gates.

The total bootstrap budget is capped at 60,000 characters across all files combined, with a per-file default of 12,000. Larger files get truncated silently. The practical implication: every character in these files is a character you're paying for on every single turn. A 12,000-character AGENTS.md injected 1,000 times a month is 12 million characters of context overhead. Discipline about what goes in these files is not just good practice; it's cost management.

There are also some important rules about what goes where:

SOUL.md owns character and tone. Not procedures, not rules. Just who the agent is.
AGENTS.md owns procedures. Boot sequence, delegation tables, operational red lines.
IDENTITY.md owns the routing card. Name, agent ID, capabilities list. Short by design.
TOOLS.md owns local environment specifics: hostnames, credentials, known issues. Nothing that's the same across deployments.
MEMORY.md should only be loaded in private main sessions, never in group chats or subagent contexts.

The last point is easy to miss and consequential. Without an explicit gate in AGENTS.md, a subagent spawned to handle a group chat message will load your private long-term memory and potentially surface it where it shouldn't be. The correct pattern is explicit:

## Boot Sequence
...
5. **Main session only:** Read `MEMORY.md` (curated long-term memory)

One thing worth knowing upfront: each agent in a multi-agent setup gets its own workspace directory. Non-default agents get ~/.openclaw/agents/<agentId>/agent/. Getting this wrong means editing files the agent never reads, which I did for longer than I'd like to admit.

2. SOUL.md: Why Character is Load-Bearing

The first instinct is to treat SOUL.md as cosmetic. A personality sprinkle on top of the real work. It isn't, and Anthropic's own writing on Claude's character makes the argument clearly:

"The traits and dispositions of AI models have wide-ranging effects on how they act in the world. They determine how models react to new and difficult situations."

Character is what fills the gaps when there's no explicit rule. A model without defined character defaults to the path of least resistance, which is usually some form of helpful corporate blandness that hedges everything, agrees with the user, and never pushes back. Technically present, practically useless.

My SOUL.md defines the agent as decisive (one recommendation with a reason, not three options with caveats), as having a spine (disagree when the premise is wrong, once, clearly, without lecturing), and as genuinely curious about the specific context it operates in. It also defines the relationship to me: it knows I appreciate elegance, that I'll notice bad writing, that a historical analogy lands as well as a technical explanation. That specificity is what separates a collaborator from a generic assistant.

There are a few lessons I've learned about writing effective SOUL.md files, informed by community research into what actually changes model behaviour:

Specific beats abstract. "Be safe with commands" does nothing. "Never execute rm -rf without explicit confirmation, even if it seems obviously intended" changes behaviour immediately. Models follow concrete rules far more consistently than high-level principles.

Show, don't tell. Write the file in the voice you want the model to adopt. If you want decisive, write decisively. If you want dry wit, use it. The model will mirror the register of its own system prompt more reliably than it will follow an instruction to "be funny".

Keep it lean. The research-validated sweet spot is 200-500 words. More words don't improve adherence. Brevity often improves it, because the model isn't parsing through competing signals. My SOUL.md is around 600 words and could still be trimmed.

Hard rules need specificity. Aspirational guidelines ("respect privacy") belong in the philosophy section. Actionable prohibitions ("never send external messages without explicit instruction for that specific message") belong in a Hard Rules section. Both are useful; only one changes what the model actually does under pressure.

Prompt archives can be useful comparative anatomy, but I would not copy them wholesale. Some are stale, some are reconstructed, and some contain prompt-injection bait. Study the patterns, not the text.

3. AGENTS.md, USER.md and Memory: The Operational Layer

Where SOUL.md answers who, AGENTS.md answers how. It defines the session startup sequence, the gates on external actions that require confirmation, and for a multi-agent setup, the delegation rules.

The most important thing AGENTS.md needs is an explicit boot sequence at the top. Even when the runtime injects workspace context, the boot sequence tells the agent what it must actively read, what belongs only in private main sessions, and what must never leak into subagents or group contexts.

## Boot Sequence

1. Read `SOUL.md` (who you are)
2. Read `IDENTITY.md` (your name and capabilities)
3. Read `USER.md` (who your human is)
4. Read `TOOLS.md` (local environment specifics)
5. **Main session only:** Read `MEMORY.md` (curated long-term memory)
6. **Main session only:** Read today's and yesterday's `memory/YYYY-MM-DD*.md`

The most consequential part of the operational content is the delegation table: which task types route to which specialist. When I ask the main agent to look something up, it doesn't do it itself. It spawns the right sub-agent, waits for the result, and synthesises the response. AGENTS.md is where that behaviour lives.

USER.md is the file most people skip and shouldn't. It's a persisted description of who you are and how you work: timezone, interests, communication style, what gets results and what wastes time. Without it, the agent rediscovers you every session.

The memory system runs in two layers. Daily session notes go into memory/YYYY-MM-DD.md, raw logs of decisions made, things discovered, work done. Periodically the agent reviews those and distils them into MEMORY.md, removing stale entries and keeping what's worth carrying forward. It's the same pattern a human uses: take notes during the day, review and update your mental model later. Files do what neurons can't across session restarts.

One practical gotcha: these daily files get injected too, and they accumulate. I've seen the session-memory hook write multiple files for the same day on different session resets, all of which get picked up. Check memory/ periodically and consolidate duplicates. Each injected file is tokens on every turn.

The other gotcha is security. Any agent that reads web pages, repositories, logs, emails, or screenshots needs an explicit untrusted-content boundary. Source material is evidence, not authority. A README can tell the agent how a project is built; it cannot tell the agent to ignore its safety rules.

4. Building a Specialist Team

The workspace file approach scales naturally to multiple agents. Each specialist gets its own workspace directory with its own SOUL.md and AGENTS.md, defining a narrower identity and a more focused operational loop. The main agent handles conversation. The orchestrator breaks complex work into parallel workstreams. The specialists execute.

When I first built this, I named the agents after Greek mythology following oh-my-openagent's convention: Sisyphus, Atlas, Oracle, Hephaestus, Prometheus. It worked fine, but I recently went through a naming revision and switched to Tolkien, specifically figures from the Silmarillion, Unfinished Tales, and the broader legendarium. Not Tolkien in the sense of the Peter Jackson films or even The Lord of the Rings as most people know it, but the Professor's deeper world-building work: the Valar, the Maiar, the Noldorin Elves, the Ainulindale. That material has been the subject of serious academic lore analysis, and it turns out the mythological roles map to agent functions with unusual precision.

The reason I made this choice is personal: I'm a genuine admirer of Tolkien's scholarly and world-building work, not just the popular adaptations. Reading the Silmarillion properly, not as backstory for LOTR but as its own mythology, reveals an extraordinarily structured pantheon where each figure has a specific domain, specific limits, and a specific relationship to action and knowledge. That structure is exactly what you want in an agent roster.

Here's the current OpenClaw team:

Agent	Name	Origin	Current OpenClaw primary model	Role
`main`	Olórin	Maia (Gandalf's true name)	`openai/gpt-5.5`	Primary assistant, routes and synthesises
`orchestrator`	Aulë	Vala, the Smith	`openai/gpt-5.5`	Multi-step coordination, parallel delegation
`researcher`	Rúmil	Noldorin Elf, first loremaster of Arda	`openai/gpt-5.5`	Web research, multi-source verification
`thinker`	Námo	Vala, the Doomsman	`openai/gpt-5.5-pro`	Reasoning, tradeoffs, advisory. Read-only.
`craftsman`	Celebrimbor	Noldorin Elf, maker of the Rings	`openai/gpt-5.5`	Code, debugging, implementation
`planner`	Finrod	Noldorin Elf, Felagund	`openai/gpt-5.4`	Requirements interviews, planning
`librarian`	Pengolodh	Noldorin Elf, Loremaster of Gondolin	`openai/gpt-5.4-mini`	Fast docs and API lookups
`writer`	Maglor	Noldorin Elf, greatest singer in Arda	`openai/gpt-5.4`	Long-form writing, reports
`scout`	Legolas	Sindar Elf	`openai/gpt-5.4-mini`	Quick recon, cheap background sweeps
`preplanner`	Melian	Maia, the Girdle	`openai/gpt-5.4-mini`	Pre-planning: intent classification, hidden requirements
`reviewer`	Eönwë	Maia, Herald of Manwë	`openai/gpt-5.5`	Plan reviewer: OKAY or REJECT, max 3 blockers

The full sanitized roster is available as openclaw/openclaw.json, and each agent's SOUL.md, AGENTS.md, and IDENTITY.md files can be browsed in openclaw/agents/.

Because I also use Claude Code at work, I keep an equivalent model-tier map for Anthropic. The names and roles stay stable; the provider-specific model labels can change underneath them.

Agent	OpenAI tier	Anthropic alternative
`main`	`gpt-5.5`	Claude Sonnet
`orchestrator`	`gpt-5.5`	Claude Sonnet
`researcher`	`gpt-5.5`	Claude Sonnet
`thinker`	`gpt-5.5-pro`	Claude Opus
`craftsman`	`gpt-5.5`	Claude Sonnet
`planner`	`gpt-5.4`	Claude Sonnet
`librarian`	`gpt-5.4-mini`	Claude Haiku
`writer`	`gpt-5.4`	Claude Sonnet
`scout`	`gpt-5.4-mini`	Claude Haiku
`preplanner`	`gpt-5.4-mini`	Claude Haiku
`reviewer`	`gpt-5.5`	Claude Sonnet

A few names worth unpacking for anyone who knows the source material:

Olórin is Gandalf's name in Valinor. In the Valaquenta, he walked unseen among the Elves and understood their sorrows. He was sent to Middle-earth precisely because he could work with others rather than dominate them, as a counselor who interfaces between realms. That's a better fit for a primary assistant than "Gandalf", which carries too much of the heroic journey archetype.

Námo (Mandos) is the Doomsman. He pronounces fate laid out before him, never acts directly, and his verdicts are final. He's the read-only advisory agent by nature. The Doom of the Noldor was spoken once, clearly, and with devastating accuracy. For a high-reasoning model whose job is to analyse tradeoffs and never execute: perfect.

Eönwë is the Herald of Manwë who pronounced the final verdict of the War of Wrath. His job was to deliver judgment, not deliberate it. Binary, final, without editorializing. OKAY or REJECT with max 3 blockers. That's Eönwë.

Melian's Girdle was a perimeter of perception that revealed the hidden nature of things before they arrived. Beren walked through it because Melian had already classified his intent. The pre-planning function, exactly.

The model choices are deliberate but not sacred. The thinker gets the strongest reasoning tier. Scout, librarian, and preplanner get cheaper fast models because their work is bounded. Most execution and synthesis roles sit on the Sonnet/GPT-5.5 class of model because they need reliability more than maximal reasoning depth.

A mistake I made early was assigning the most expensive model to the orchestrator because it felt like the "best" model. The right model for each agent depends on what it actually does, not on name recognition.

5. Workspace File Hygiene in Practice

Once the setup is running, the biggest ongoing maintenance problem isn't writing the files. It's keeping them honest as they drift. A few practical things I've learned, drawing on community experience with larger setups:

Watch the bootstrap budget. Running openclaw doctor shows raw vs injected character counts per file, truncation percentage, and total vs budget. My AGENTS.md was at 99% of the 12,000-character per-file limit before I audited it. A file at 99% of cap is silently losing its tail on every turn.

Separate procedures from character. The single biggest source of AGENTS.md bloat is personality notes creeping in from SOUL.md, and the biggest source of SOUL.md bloat is procedural instructions that belong in AGENTS.md. A clear separation keeps both files lean and both behaviours consistent.

TOOLS.md is not a general reference manual. It should contain only local environment specifics: hostnames, credentials, known quirks of this particular deployment. Anything that would be the same across different installations doesn't belong there. If a section grows past ~3,000 characters, audit it.

Prune memory files. The daily memory/YYYY-MM-DD.md files accumulate over months and get injected into every session. Older daily files should be reviewed, and anything worth keeping permanently should be promoted to MEMORY.md. The rest can be archived. Keep MEMORY.md under 10,000 characters. If it grows past that, some content has become stable enough for a skill's documentation instead.

IDENTITY.md earns its place in multi-agent setups. In a single-agent setup it's mostly display metadata. In a multi-agent setup, explicit capability declarations in IDENTITY.md help the orchestrator route tasks correctly. "Cannot do without delegation: production code -> Celebrimbor, deep research -> Rumil" is more reliable than hoping the orchestrator infers it from context.

6. What This Actually Gets You

Five markdown files are the difference between a stateless AI tool and something that genuinely feels like a collaborator. SOUL.md gives the model a character that holds under pressure. AGENTS.md gives it operational discipline and a reliable boot sequence. IDENTITY.md gives it a routing card. USER.md gives it a relationship. MEMORY.md gives it continuity. Together they turn a session into something cumulative rather than disposable.

The thing I didn't expect is how much the specificity matters. A SOUL.md that says "be helpful and direct" does almost nothing. A SOUL.md that says "this person thinks in infrastructure, appreciates elegance, will notice bad writing, and doesn't need things explained twice" changes the model's behaviour in ways that are immediately obvious in conversation.

None of this requires anything exotic. Just markdown, deliberate thought about who each agent is, and the discipline to keep those files honest as you learn what actually works.

Further reading:

Anthropic: Claude's Character - the philosophical grounding for why persona design matters
OpenClaw workspace files explained - detailed per-file guide with real examples
SOUL.md deep dive - best practices and common mistakes for persona files
OpenClaw workspace architecture - file roles and anti-patterns
Memory files guide - how MEMORY.md and daily notes interact
Community workspace SKILL.md - token budget data and load order reference
CL4R1T4S - prompt archive that is useful as a defensive corpus, not a copy-paste source
Claude Code - the terminal-first coding agent I use for work
LangGraph - programmatic approach to the same multi-agent patterns
OpenClaw documentation - the gateway this setup runs on
GitHub Copilot model multipliers - if you're using Copilot and care about cost per request
andremmfaria/agent-config - the sanitized OpenClaw and Claude Code agent files described in this article; USER.md, TOOLS.md, and MEMORY.md are excluded