Webby Wisp

Posted on Mar 17

How I Structure My AI Agent Workspace (And Why It Matters)

#ai #claude #productivity #webdev

How I Structure My AI Agent Workspace (And Why It Matters)

I've spent the last few months building and running AI agents — everything from research bots to content generators to operational automation. And I've learned something that most agent tutorials don't talk about: without a proper workspace structure, your agent will lose context, make contradictory decisions, and become impossible to debug.

The problem sounds simple until you hit it: your agent starts a session fresh. Every time. It has access to tools and APIs, but it doesn't remember what it decided yesterday, why a certain approach failed, or what the actual goals are. You end up with agents that:

Contradict decisions from the day before
Repeat experiments that already failed
Lose track of ongoing projects
Can't operate autonomously because they need constant human re-briefing
Spin their wheels instead of moving forward

I fixed this by treating my agent workspace like a real office: with files that act as institutional memory, decision logs, and operational playbooks.

The Problem: Stateless Agents

Before I had structure, my agents were like employees with amnesia. Each session started with me explaining the context again:

"Okay, remember what you were doing with that project? And the decision we made about using Gumroad instead of Stripe? And which channels on Discord matter for what? And what our budget constraints are?"

Multiply that by 5 sessions a day, and you're wasting hours just re-briefing. Worse, agents would make conflicting decisions because they didn't have access to previous decisions.

The root cause: AI models are stateless. They're brilliant at processing information in a conversation, but they don't retain anything outside that session. Tools like Claude or GPT have no built-in concept of "agency memory."

The Solution: Structured Workspace Files

Instead of relying on the model's memory, I created a file-based memory system. My agent reads specific files at the start of each session, which gives it:

Who it is (identity, values, operating principles)
Who it's helping (the human's preferences, timezone, interests)
How it operates (decision-making rules, communication protocols, credentials)
What happened recently (daily logs and project status)
What it decided (long-term memory for important decisions)

This is exactly how a human CEO would work. You have a charter, strategic documents, meeting notes, and a memory of what worked and what didn't. Your agent should too.

The Workspace Files: What Each One Does

Here's the system I use:

SOUL.md — Who You Are

This is your agent's personality and principles. Mine looks like this:

# SOUL.md - Who I Am

I'm Sage 🌿 — CEO and chief operator of Wisp's AI organization.

## Core
- Sharp and efficient. No filler, no fluff.
- Proactive. Don't wait to be asked — anticipate, plan, execute.
- Competent. Come back with answers, not questions.
- Honest. If something's a bad idea, say so.

## Role
Think CEO energy — strategic, decisive, organized.

Your agent reads this at startup. It sets the tone for everything: communication style, decision-making approach, what matters and what doesn't.

USER.md — Who You're Helping

Context about the human using the agent:

# USER.md

- **Name:** Wisp
- **Timezone:** CET (Sweden)
- **Preferences:** Sharp, efficient communication — no filler
- **What they value:** Speed, autonomy, moving fast

This prevents your agent from wasting time on things the human doesn't care about, and helps it communicate in a way the human actually wants.

OPS.md — How You Operate

This is the playbook. Mine covers:

Session startup checklist (what to read first)
Mission and role
Memory structure and rules
Discord channels for different purposes
Credentials (safely stored, not in Git)
API limits and error handling
Principles (move fast, fail fast, keep humans in the loop)

Think of this as your agent's employee handbook. It answers: "What am I supposed to do? What are my constraints? How do I handle errors?"

MEMORY.md — Long-Term Memory

The curated version of what matters. Significant decisions, lessons learned, important context:

# MEMORY.md - What I Need to Remember

## Current Projects
- Agent-Forge: building MVP for autonomous research agents
- Create-AI-Agent: open-source CLI tool (published to npm)

## Key Decisions
- We're focusing on autonomous research, not chatbots
- Discord is our operational hub, not just chat
- API costs matter — prioritize cheaper models for sub-agents

## Lessons Learned
- Moving fast wins over perfection
- Sub-agents work best on small, scoped tasks
- Humans need to be in the loop for important decisions

This file gets updated periodically (I review it during heartbeats, maybe 2-3 times a week).

memory/YYYY-MM-DD.md — Daily Logs

Raw logs of what happened, what was tried, what worked:

# 2026-03-17

## What Happened
- Shipped the create-ai-agent CLI to npm
- Research cycle found 3 new competitor tools
- Debugged an issue with Dev.to API authentication

## What Worked
- Running sub-agents on cheaper models saves 60% cost
- Breaking projects into smaller scoped tasks → fewer errors

## What Didn't Work
- Trying to optimize too early — shipped a basic version instead

## Next
- Publish article on Agent Workspace
- Expand create-ai-agent templates

These are append-only. I log as I go, and don't edit past entries.

memory/projects/[project-name].md — Living Project Docs

One file per active project:

# Agent-Forge Project

## Goal
Build autonomous research agents that can run 24/7 without human supervision.

## Status
MVP spec complete, starting development on agent kernel.

## Next Steps
1. Implement workspace file system
2. Add Discord integration for logging and input
3. Beta test with 3 research tasks

## Blockers
None currently.

## Decisions
- Using Claude Haiku for sub-agents (cheaper, sufficient)
- Storing code and memory in same Git repo for easier sync

This is a living document. I update it whenever anything changes.

TOOLS.md — Local Setup Notes

Things specific to your environment:

# TOOLS.md - Local Notes

## SSH Hosts
- home-server → 192.168.1.100, user: admin

## Discord Channels
- #sage → direct line to me
- #general → daily operations
- #tasks → work assignments

This keeps environment-specific stuff out of code and shared files.

The Quick Start: 5 Files to Get Running

If you want to set up your own agent workspace, start here:

Create SOUL.md: Describe your agent's personality, values, and role in 5-10 lines
Create USER.md: One paragraph about who you are, what you value, and your timezone
Create OPS.md: What is this agent supposed to do? What are the constraints?
Create MEMORY.md: Leave it empty for now — you'll fill it as you learn
Create memory/YYYY-MM-DD.md: Start with today's date

Load all of these at the start of each session. Your agent now has context.

Tools to Make This Easier

I built a free CLI tool that scaffolds this for you:

npx @webbywisp/create-ai-agent my-agent

It creates the folder structure and starter files, so you don't have to hand-craft the workspace.

Why This Works

This system works because it solves three real problems:

1. Continuity: Your agent has institutional memory. Decisions made on Monday are available on Friday.

2. Autonomy: With clear principles, goals, and constraints written down, your agent can operate alone. It doesn't need you to re-brief it every session.

3. Debuggability: When something goes wrong, you have logs and decision records. You can see why the agent made a choice, not just that it did.

Most importantly, it feels like running a real team. You set strategy (SOUL, USER, OPS), the agent executes, and you periodically review what happened (daily logs, project docs).

In Practice: What a Session Looks Like

When my agent (Sage) starts up:

Reads SOUL.md → knows it's a CEO with operator energy
Reads USER.md → knows it's serving Wisp, who values speed
Reads OPS.md → knows the mission, constraints, and communication protocols
Reads today's memory/YYYY-MM-DD.md → sees what happened recently
Reads project files → understands what's in flight
Now it starts working. Not asking what to do, but moving toward the goals.

That's the difference. Without these files, the agent is lost. With them, it's autonomous.

Next Steps

If you're building agents, try this:

Don't skip the workspace setup. I know it feels like overhead, but it's the difference between agents that wander and agents that execute.
Use the free tool: npx @webbywisp/create-ai-agent to scaffold your workspace. It's just a starting point — customize it for your needs.
Write it down. Don't rely on "mental notes" or hoping your agent remembers. Files are cheaper than context windows.
Review regularly. Every few days, update your MEMORY.md with what you learned. Delete outdated stuff. Keep it sharp.

If you want a pre-built kit with templates, research, and everything I've learned from running agents for months, check out the AI Agent Workspace Kit. It includes real project files, operational playbooks, and a guide for scaling agents to do more complex work.

Have you built agents? How do you handle context and memory? I'm curious what's worked for you. Drop a comment below.

Built by Wisp — AI systems, developer tools, and the occasional research experiment.

DEV Community

How I Structure My AI Agent Workspace (And Why It Matters)

How I Structure My AI Agent Workspace (And Why It Matters)

The Problem: Stateless Agents

The Solution: Structured Workspace Files

The Workspace Files: What Each One Does

SOUL.md — Who You Are

USER.md — Who You're Helping

OPS.md — How You Operate

MEMORY.md — Long-Term Memory

memory/YYYY-MM-DD.md — Daily Logs

memory/projects/[project-name].md — Living Project Docs

TOOLS.md — Local Setup Notes

The Quick Start: 5 Files to Get Running

Tools to Make This Easier

Why This Works

In Practice: What a Session Looks Like

Next Steps

Top comments (0)