Guyoung Studio

Posted on Jun 7

BoxAgnts Runtime (7) — Sandboxed Execution, Rebuilding Agent Infrastructure

#ai #agents #rust #webassembly

The AI industry is moving fast. Every week brings a new agent framework, coding assistant, autonomous workflow engine, or multi-agent platform. Most discussions focus on model capabilities—reasoning performance, planning ability, context window size, tool selection accuracy.

Yet a more fundamental problem receives far less attention:

Most AI agents lack a trustworthy execution environment.

Current agent systems are becoming increasingly capable of interacting with the real world—executing code, modifying repositories, browsing websites, accessing databases, operating cloud infrastructure. As they gain operational authority, execution safety—not model intelligence—is becoming the decisive challenge.

The Industry Is Optimizing Intelligence, Not Execution

Most AI infrastructure investment goes toward making models smarter—larger models, better reasoning, longer context, more sophisticated planning. These technologies answer the question: "How can agents make better decisions?"

But production systems must answer a different question: "What happens when those decisions are wrong?"

Traditional software engineering has long assumed that failures will occur, designing fault isolation, permission boundaries, process containment, resource governance, and recovery mechanisms accordingly. Many AI systems still lack these properties—they rely on the fragile assumption that the model will behave correctly.

BoxAgnts rejects this assumption outright. In boxagnts/query/src/query.rs, the query loop has multiple defense layers built in:

// Protection mechanisms in the query loop
const MAX_TOKENS_RECOVERY_LIMIT: u32 = 3;   // Recovery attempt cap
const MAX_TOKENS_RECOVERY_MSG: &str = "...";  // Recovery message

// Inside the loop:
// - turn counter (prevents infinite loops)
// - max_tokens recovery mechanism (prevents token-exhaustion deadlock)
// - budget checking (prevents cost runaway)
// - cancel_token signal (interruptible at any time)

These aren't prompt-level "suggestions"—they are runtime-level hard constraints.

AI Agents Are Execution Systems, Not Chatbots

Viewing agents as advanced chat interfaces is an outdated and dangerous perspective. Modern agents can create files, run commands, call APIs, update databases, and deploy infrastructure—once an agent produces actions rather than text, the consequences of mistakes grow exponentially.

BoxAgnts' core execution loop clearly demonstrates the execution-system nature of agents:

User Request → LLM Planning → Tool Selection → Tool Execution → Environment Modification

The critical step isn't planning—it's execution. And every step of execution operates under runtime constraints.

Why Current Architectures Are Fragile

Most agent architectures boil down to:

LLM → Tool Call → Python Runtime → Shell Command → Host System

It's not a Python problem—it's a trust boundary problem. The model decides what to execute, what to access, and when to stop, but the model itself is exposed to prompt injection, adversarial documents, and untrusted content. This creates the architectural paradox: "untrusted planner → trusted execution."

BoxAgnts solves this by inserting runtime boundaries between Planner and Executor:

LLM (Planner)
    ↓
Query Loop (run_query_loop — execution governance)
    ↓
Tool Interface (permission_level check)
    ↓
WASM Sandbox (hard constraints)
    ↓
Host Resources (protected)

Each layer is an independent governance point. No implicit trust exists between layers.

Sandbox as a First-Class Runtime Primitive

BoxAgnts elevates sandboxed execution from "optional feature" to "architectural foundation." All WASM tools run inside sandboxes by default. The sandbox is the lowest infrastructure component (boxagnts/wasm-sandbox/), sitting beneath tools and gateway.

This design ensures security isn't a bolt-on—no matter how upper layers change, execution constraints remain in effect.

Tool Runtime and Workflow Engine Are Different Layers

A common confusion in the AI ecosystem is the relationship between workflow engines and runtime engines.

Workflow engines (chains, graphs, planners) determine "what should happen."
Runtime engines determine "what is allowed to happen."

BoxAgnts has three explicit orchestration layers:

Query Layer (boxagnts/query/): Workflow orchestration—manages conversation loops, auto-compaction, context management
Tool Layer (boxagnts/tools/ + boxagnts/wasm-tools/): Tool interface—permission checks, parameter validation
Sandbox Layer (boxagnts/wasm-sandbox/): Execution constraints—memory limits, network allowlists, timeout control

Workflow coordinates. Runtime governs. Both are necessary. Only the runtime provides security guarantees.

Multi-Agent Systems Make Isolation More Important

BoxAgnts' Managed Agent mode supports parallel Executors, each running in independent sandboxes. This improves specialization and scalability, but also amplifies risk.

Without proper isolation, malicious outputs propagate, context contamination spreads, capability escalation becomes possible, and debugging becomes nearly impossible. BoxAgnts' response is process-level thinking—each Executor has independent capabilities, isolated resources, independent context, and optional Git worktree isolation. This directly mirrors the process isolation model in modern operating systems.

Resource Governance

BoxAgnts enforces system-level resource control across all Executors through the WASM runtime:

Governance Dimension	Implementation
CPU Usage	`wasm_fuel` (instruction-level fuel) + `wasm_timeout`
Memory Usage	`wasm_max_memory_size` + `wasm_max_wasm_stack`
Network Access	`allowed_outbound_hosts` + `block_networks` + `block_url`
File Access	`work_dir` + `map_dirs` (precise directory mounts)
Token Budget	`total_budget_usd` (Managed Agent mode)
Concurrency Control	`max_concurrent_executors`

Without this governance, the more autonomous an agent becomes, the greater its destructive potential.

AI Agents Need Runtime Engineering

The AI industry has spent years on prompt engineering, model engineering, and workflow engineering. A new discipline is emerging: runtime engineering—focusing on execution boundaries, capability systems, resource governance, fault containment, sandboxed tooling, and orchestration safety.

As agents gain authority over real environments, runtime engineering is no longer optional—it's infrastructure necessity.

The Future Looks More Like an Operating System

Many current AI products are designed as applications. Future AI infrastructure will more closely resemble operating systems—providing scheduling, isolation, permissions, process management, and resource governance.

BoxAgnts' module architecture already shows this embryonic form:

Operating System              BoxAgnts
──────────────              ─────────
Process Scheduling    ←→    Query Loop (run_query_loop)
Process Isolation     ←→    WASM Sandbox
File Permissions      ←→    PermissionLevel + RunOption
Network Filtering     ←→    allowed_outbound_hosts + block_networks
Memory Management     ←→    wasm_max_memory_size
Timeout Control       ←→    wasm_timeout + wasm_fuel
Task Scheduling       ←→    Cron Scheduler (gateway/cron/)
State Management      ←→    Workspace Persistence (workspace/)

This isn't coincidence—when AI agents become autonomous execution units, managing them requires operating-system-level thinking. Prompt engineering is a user-space tool; true security guarantees come from the kernel-space runtime.

Conclusion

The next generation of AI systems will not be defined by model intelligence alone—they will be defined by execution reliability. The "model-is-correct, therefore system-is-correct" assumption that current architectures depend on does not hold in production.

BoxAgnts' engineering practice points to the right direction: sandboxed execution, capability isolation, resource governance, deterministic boundaries, secure orchestration. These are runtime problems, and solving them will be the most important engineering challenge in AI infrastructure over the coming decade.

The future of AI agents isn't about making models smarter—it's about making execution trustworthy.