DEV Community

Broxhq
Broxhq

Posted on

I built an AI agent that runs manual test cases in a real browser

The problem

Every deploy — same manual test steps. Login, open the form,
fill the fields, check the result. Over and over.

I wanted to skip the Playwright/Selenium boilerplate and just
paste my existing test cases as plain text.

What I built

qpilot — an AI agent that reads your manual test case and
executes it in a real Chrome browser step by step.

You write this:

  1. Go to https://myapp.com/login
  2. Enter email and password
  3. Click Login
  4. Verify dashboard is visible

The agent opens Chrome, clicks, fills forms, and reports
pass/fail/warn per step with evidence from the page.

If it hits an OTP or captcha — it pauses and asks you directly.

How it works

  • Playwright controls the real Chrome browser
  • Each step: snapshot → action → snapshot → report
  • Claude Haiku reads the snapshot (ARIA tree) and decides what to click
  • Element refs (e.g. e12) are used for precise targeting
  • Context window is managed to avoid hitting token limits

Try it

npx qpilot

No code. No config. No Selenium.

Stack

TypeScript, Playwright, Claude Haiku via Anthropic API.

Open source: qpilot

Curious what you think — especially about edge cases
you'd want it to handle.

Top comments (0)