Claude Dispatch Validated the Market. Now Let's Talk About Safety.
Anthropic recently launched Claude Dispatch — a phone-to-desktop agent workflow. This validates what the OpenClaw community has been building for months: AI agents that work autonomously on your behalf.
But there's a gap nobody talks about: How do you keep an autonomous agent safe?
When your agent runs 24/7, handles sensitive data, and has tool access, three problems emerge:
- Soul file tampering — Someone modifies your agent's personality definition
- Persona drift — The agent gradually deviates from its defined character
- Memory fragmentation — Multiple agents can't share what they've learned
SoulClaw v2026.3.21 addresses all three.
1. SoulScan: Inline Security Scanning
SoulScan is a 4-stage security pipeline that scans soul files for:
- Prompt injection — Hidden instructions in personality definitions
- Data exfiltration — Patterns that leak sensitive information
- Harmful content — 58+ security rules
- Schema violations — Structural issues
What's New: Inline Scanning
Previously, SoulScan only ran manually. Now it runs automatically after every agent turn (rate-limited to once per 5 minutes):
Agent Turn → Response → [fire-and-forget] SoulScan checks workspace
↓
Score < threshold?
↓
⚠️ Warning logged
No configuration needed. If your workspace has a SOUL.md, it's protected.
# Scan your workspace
soulclaw soulscan
# CI/CD pipeline
soulclaw soulscan --json --min-score 70
2. Persona Engine: Drift Detection
Even with a perfect SOUL.md, your agent's personality drifts over long conversations. The Persona Engine monitors this.
How It Works
- Parse — SOUL.md parsed into structured rules (tone, style, principles)
- Detect — Every N responses, scored against rules via Ollama or keyword matching
- Enforce — Correction injected + notification sent when threshold exceeded
Opt-In Configuration
Drift detection is off by default:
# Enable
soulclaw persona config --enable
# Customize
soulclaw persona config --interval 3 --threshold 0.4
# Check manually
soulclaw persona check --text "Your agent's response"
# View history
soulclaw persona metrics
Real-Time Notifications
Alerts via Telegram, Discord, etc.:
⚠️ Persona Drift WARNING
Score: 0.450 (method: keyword)
Session: agent:main:telegram:12345
Action: reminder
3. Swarm Memory: Multi-Agent Sync
Git-based memory synchronization across agents and devices.
Agent A (Mac) Agent B (Server)
│ │
├── MEMORY.md ├── MEMORY.md
│ │
└─── swarm sync ──→ Git Repo ←── swarm sync ───┘
# Initialize
soulclaw swarm init --remote git@github.com:user/swarm-memory.git
# Sync with LLM merge
soulclaw swarm sync --llm-merge
# Resolve conflicts
soulclaw swarm resolve --llm # LLM semantic merge
soulclaw swarm resolve --ours # Keep ours
soulclaw swarm resolve --theirs # Keep theirs
soulclaw swarm resolve --manual # Edit manually
Why This Matters
| Problem | Before | After |
|---|---|---|
| Soul file tampered | Silently compromised | SoulScan catches in 5 min |
| Agent drifts | Nobody notices | Persona Engine alerts you |
| Multi-agent knowledge | Each starts from zero | Swarm Memory shares all |
Free, Open Source, Built on OpenClaw
- Free — No subscription
- Open source — MIT license
- 30-second migration from OpenClaw
npm install -g soulclaw
soulclaw gateway install
soulclaw gateway start # Uses existing ~/.openclaw/ config
Migration guide: docs.clawsouls.ai
SoulClaw v2026.3.21 — Safe agents with long-term memory.
Top comments (0)