OpenClaw Token Saver Skill: Optimize Your AI Costs

#news #insights #ginie #openclaw

What is the OpenClaw Token Saver Skill?

The Token Saver skill is a model-aware optimization tool that reduces OpenClaw
AI costs by intelligently compressing workspace files. Every API call sends
your workspace files (SOUL.md, USER.md, MEMORY.md, AGENTS.md, etc.) along with
your message, and these files count toward your context window, slowing
responses and costing real money on every message.

How It Works

Token Saver v3 is model-aware—it knows your model's context window and adapts
recommendations accordingly. Using Gemini's 1M context? Presets scale up. On
GPT-4o's 128K? Presets adjust down. This intelligent approach ensures optimal
compression without losing essential information.

Key Features of Token Saver v3

Model-Aware Dashboard

The skill provides a comprehensive dashboard showing your current model,
context window, and usage percentage. For example:

🤖 Model: Claude Opus 4.5 (200K context)

Detected: openclaw.json

📊 Context Usage: [████████░░░░░░░░░░░░] 42% (84K/200K)

Workspace File Compression

The skill scans all .md files in your workspace, showing token count and
potential savings. It features smart bypass that skips already-optimized
files, ensuring you don't waste time re-compressing content that's already
efficient.

File-Aware Compression Strategies

Each file type receives optimized compression:

SOUL.md — Light compression, keeps personality language
AGENTS.md — Medium compression, dense instructions
USER.md / MEMORY.md — Heavy compression, key:value format
PROJECTS.md — No compression (user structure preserved)

### Dynamic Compaction Presets

Presets adapt to your model's context window:

Preset	% of Context	Claude 200K	GPT-4o 128K	Gemini 1M
Aggressive	40%	80K	51K	400K
Balanced	60%	120K	77K	600K
Conservative	80%	160K	102K	800K
Off	95%	190K	122K	950K

### Robust Model Detection

The skill features a sophisticated detection system with multiple fallback
options:

 1. Runtime injection (--model=...)
 2. Environment variables (SKILL_MODEL, OPENCLAW_MODEL)
 3. Config file (~/.openclaw/openclaw.json)
 4. File inference (TOOLS.md, MEMORY.md mentions)
 5. Fallback: Claude Sonnet 4 (safe default)

### 24+ Model Registry

The skill supports extensive model coverage:

 * **Claude:** Opus 4.6 (1M), Opus 4.5, Sonnet 4.5, Sonnet 4, Haiku 4.5, Haiku 3.5 (200K)
 * **OpenAI:** GPT-5.2, GPT-5.1, GPT-5-mini, GPT-5-nano (256K), GPT-4.1, GPT-4o (128K), o1, o3, o4-mini
 * **Gemini:** 3 Pro (2M), 2.5 Pro, 2.0 Flash (1M)
 * **Others:** DeepSeek V3 (64K), Kimi K2.5 (128K), Llama 3.3 70B, Mistral Large

## Available Commands

The skill provides several commands for different optimization needs:

 * **/optimize** — Full dashboard showing files, models, and context usage percentage
 * **/optimize tokens** — Compress workspace files with auto-backup
 * **/optimize compaction** — Chat compaction control with model-aware settings
 * **/optimize compaction balanced** — Apply balanced preset (60% of context)
 * **/optimize compaction 120** — Custom threshold (compact at 120K)
 * **/optimize models** — Detailed model audit with registry
 * **/optimize revert** — Restore backups, disable persistent mode

## Installation and Setup

Installing the Token Saver skill is straightforward:

 clawhub install token-saver --registry "https://www.clawhub.ai"

After installation, the skill automatically detects your model and applies
optimal compression settings.

## Safety and Integrity Features

The skill prioritizes safety and data integrity:

 * **Auto-backup** — All modified files get .backup extension
 * **Integrity > Size** — Never sacrifices meaning for smaller tokens
 * **Smart bypass** — Skips already-optimized files
 * **Revert anytime** — /optimize revert restores everything
 * **No external calls** — All analysis runs locally

## Persistent Mode

The skill includes a persistent mode that adds writing guidance to AGENTS.md
for continued token efficiency:

 * **SOUL.md** — Evocative, personality-shaping language
 * **AGENTS.md** — Dense instructions, symbols OK
 * **USER.md** — Key:value facts
 * **MEMORY.md** — Ultra-dense data

## Version History

The skill has evolved significantly:

 * **3.0.0** — Model registry, dynamic presets, robust detection, smart bypass
 * **2.0.1** — Chat compaction, file-aware compression, persistent mode
 * **1.0.0** — Initial release

## Why Use Token Saver?

The Token Saver skill can significantly reduce your AI costs by optimizing
how your workspace files are sent with each API call. By intelligently
compressing files based on your specific model's context window, it ensures
you're not paying for unnecessary tokens while maintaining the quality and
functionality of your OpenClaw experience.

Whether you're using Claude, GPT, Gemini, or other models, the Token Saver
skill adapts to your needs, providing automatic optimization that saves you
money without requiring manual intervention or technical expertise.

Skill can be found at:
saver/SKILL.md>