DEV Community

Custodia-Admin
Custodia-Admin

Posted on • Originally published at pagebolt.dev

How to Give Local LLMs Browser Automation Superpowers with MCP

How to Give Local LLMs Browser Automation Superpowers with MCP

You run Ollama or LM Studio locally. Your LLM is fast, private, and under your control. One problem: it can't see the web or automate browser tasks.

You could call Claude API for those tasks, but then you're not really running local. You're hybrid — privacy for inference, cloud for everything else.

There's a better way. PageBolt's MCP server works with any MCP-compatible runtime, including Ollama, LM Studio, and other local clients. Your local LLM can call browser tools natively: take screenshots, generate PDFs, inspect page elements, run multi-step sequences, and record demo videos.

No cloud dependency. Pure local automation.

The Setup

Install PageBolt MCP:

npm install pagebolt-mcp
Enter fullscreen mode Exit fullscreen mode

Configure it in your local MCP client (Ollama MCP, LM Studio, or whatever supports the spec):

{
  "mcpServers": {
    "pagebolt": {
      "command": "node",
      "args": ["node_modules/pagebolt-mcp/dist/index.js"],
      "env": {
        "PAGEBOLT_API_KEY": "your_api_key"
      }
    }
  }
}
Enter fullscreen mode Exit fullscreen mode

Your local model now has five tools available.

Concrete Example: Screenshot a Competitor's Pricing Page

Your local Llama-2 instance can now run this:

User: "Take a screenshot of https://competitors-site.com/pricing and tell me what they're charging."

Llama-2 (using PageBolt MCP):
→ Calls take_screenshot with url=https://competitors-site.com/pricing
→ Gets back a PNG
→ Analyzes the image
→ Returns: "They're charging $99/month for the Pro plan, $299/month for Enterprise."
Enter fullscreen mode Exit fullscreen mode

No screenshot library needed. No Puppeteer. No infrastructure. Just HTTP calls to PageBolt, results back to your local model.

What This Enables

Pricing monitors: Run a local LLM that screenshots competitor sites daily and alerts you to price changes.

Automated testing: Local models run web tests, take screenshots on failure, generate reports — all without leaving your machine.

Document generation: Local LLM inspects pages, extracts data, generates PDFs — invoices, reports, certificates — no cloud calls.

Demo videos: Your local model describes what it wants to show, PageBolt records it with narration, you get an MP4.

Multi-step workflows: Navigate → fill form → submit → screenshot result. All orchestrated by your local model.

The Tradeoff

You keep inference local (privacy, control, no vendor lock-in). Browser automation goes to a managed service (no infrastructure overhead, no Puppeteer headaches).

This is the right split: reasoning stays local, heavy lifting goes to a managed API.

Getting Started

Free tier: 100 requests/month. Enough to run daily competitor monitoring, weekly test suites, or monthly report generation without paying.

Download the API key, install pagebolt-mcp, add it to your MCP config, and you're done.

Your local LLM now has browser superpowers.

Get started at https://pagebolt.dev

Top comments (0)