Custodia-Admin

Posted on Mar 11 • Originally published at pagebolt.dev

Building with Claude Agent SDK? Here's the MCP Tool Stack That Gives You Visual Proof

#mcp #claude #agentsdk #observability

Building with Claude Agent SDK? Here's the MCP Tool Stack That Gives You Visual Proof

You're building with Claude Agent SDK. You've chosen it because it's MCP-native — which means you're not locked into a closed ecosystem. You want MCP tools that extend what your agent can do.

But here's the gap: Claude can use tools. Claude can call APIs. Claude can navigate web apps. But when you ask "what did my agent actually see?" — Claude's log output doesn't answer that.

That's where PageBolt fits into your MCP stack.

The MCP Tool Stack Problem

Claude Agent SDK gives you the agent infrastructure. You add tools via MCP servers. A typical stack looks like:

Core MCP server — your custom agent logic
Web navigation tool — browser automation (Puppeteer, Playwright)
Data fetching tools — APIs, databases
Action tools — write, delete, modify operations

But nobody asks: "Did the agent actually see what we expected?" You have logs. You have API responses. You don't have proof.

PageBolt as the Visual Proof Layer

PageBolt is an MCP server that adds three critical capabilities:

1. Screenshot capture — Claude can call capture_screenshot at any step and get a PNG of the current page state.

@mcp.tool()
async def capture_screenshot(url: str, selector: str = None) -> str:
    """Capture visual proof of what the agent sees."""

2. Step replay — Record the full step-by-step execution as video. See the exact moment an agent navigates, fills a form, or encounters an error.

3. Audit trails — Every screenshot is timestamped and stored with your logs. Correlate visual proof with agent execution traces.

Adding PageBolt to Your Claude Agent Stack

One install:

npm install pagebolt-mcp

{
  "mcp_servers": [
    {
      "name": "pagebolt",
      "command": "npx",
      "args": ["pagebolt-mcp"],
      "env": {
        "PAGEBOLT_API_KEY": "your_api_key"
      }
    }
  ]
}

Now your Claude agent has access to:

capture_screenshot(url) — Get a PNG of the current state
record_video(start_step, end_step) — Record a workflow as video
inspect_page(url) — Lightweight DOM inspection (60-80% cheaper token cost than full snapshots)

Practical Example

Agent workflow: "Navigate to GitHub, check PR status, report findings."

1. Agent navigates to https://github.com/my-org/my-repo/pulls
2. Agent calls: capture_screenshot("https://github.com/my-org/my-repo/pulls")
   → Returns PNG of actual PR board
3. Agent parses the page and identifies open PRs
4. Agent navigates to each PR
5. Agent captures screenshot of each PR detail page
6. Agent compiles report: "3 PRs open, all passing tests"
7. You review: original screenshots prove the agent saw the real GitHub interface, not a fake one

No logs can give you that confidence. Screenshots do.

Why This Matters for Claude Agent SDK Developers

You chose Claude Agent SDK for its flexibility and MCP-native architecture. You want tools you control, not a locked platform.

PageBolt is a native MCP tool that respects that philosophy. It doesn't replace your agent framework. It adds a single, powerful capability: visual proof.

When auditors ask "did your agent do what you say it did?" you show them the screenshot timeline. When you're debugging why an agent failed, you see exactly what the page looked like at the moment it failed.

That's the visual proof layer your stack needs.

Getting Started

Install PageBolt MCP: npm install pagebolt-mcp
Add to your agent config (see above)
Set PAGEBOLT_API_KEY (free tier: 100 requests/month)
Call capture_screenshot in your agent workflows at critical steps
Store screenshots alongside logs for audit compliance

Your Claude Agent SDK now has visual proof built in.

PageBolt MCP is open source and fully self-hosted compatible. Free tier: 100 requests/month. Get started.

DEV Community

Building with Claude Agent SDK? Here's the MCP Tool Stack That Gives You Visual Proof

Building with Claude Agent SDK? Here's the MCP Tool Stack That Gives You Visual Proof

The MCP Tool Stack Problem

PageBolt as the Visual Proof Layer

Adding PageBolt to Your Claude Agent Stack

Practical Example

Why This Matters for Claude Agent SDK Developers

Getting Started

Top comments (0)