<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: zmy</title>
    <description>The latest articles on DEV Community by zmy (@zmysysz).</description>
    <link>https://dev.to/zmysysz</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3964320%2F159920e3-933f-41c4-92a4-04daa061334f.png</url>
      <title>DEV Community: zmy</title>
      <link>https://dev.to/zmysysz</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/zmysysz"/>
    <language>en</language>
    <item>
      <title>Browser-CLI: Let Your AI Agent Control the Browser from the Command Line</title>
      <dc:creator>zmy</dc:creator>
      <pubDate>Tue, 02 Jun 2026 10:07:18 +0000</pubDate>
      <link>https://dev.to/zmysysz/browser-cli-let-your-ai-agent-control-the-browser-from-the-command-line-2p4n</link>
      <guid>https://dev.to/zmysysz/browser-cli-let-your-ai-agent-control-the-browser-from-the-command-line-2p4n</guid>
      <description>&lt;p&gt;Ever wanted your AI coding assistant to actually &lt;em&gt;use&lt;/em&gt; a browser? Not just read web pages, but click buttons, fill forms, take screenshots, and extract data — all from the terminal?&lt;/p&gt;

&lt;p&gt;That's exactly why I built &lt;strong&gt;Browser-CLI&lt;/strong&gt;.&lt;/p&gt;

&lt;h2&gt;
  
  
  What is it?
&lt;/h2&gt;

&lt;p&gt;Browser-CLI is a Go-based command-line tool that wraps Playwright to give AI agents full browser control through simple shell commands. No API keys, no browser extensions, no complex setup — just run a command and you're off.&lt;/p&gt;

&lt;p&gt;👉 &lt;strong&gt;GitHub&lt;/strong&gt;: &lt;a href="https://github.com/zmysysz/browser-cli" rel="noopener noreferrer"&gt;https://github.com/zmysysz/browser-cli&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;⭐ Stars and feedback are appreciated!&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Install&lt;/span&gt;
git clone https://github.com/zmysysz/browser-cli
&lt;span class="nb"&gt;cd &lt;/span&gt;browser-cli &lt;span class="o"&gt;&amp;amp;&amp;amp;&lt;/span&gt; make build &lt;span class="o"&gt;&amp;amp;&amp;amp;&lt;/span&gt; make &lt;span class="nb"&gt;install
&lt;/span&gt;make setup-browsers  &lt;span class="c"&gt;# first time only&lt;/span&gt;

&lt;span class="c"&gt;# Use&lt;/span&gt;
browser-cli navigate https://example.com
browser-cli fill &lt;span class="s2"&gt;"#search"&lt;/span&gt; &lt;span class="s2"&gt;"hello world"&lt;/span&gt;
browser-cli click &lt;span class="s2"&gt;"button[type=submit]"&lt;/span&gt;
browser-cli text
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Why not just use Playwright directly?
&lt;/h2&gt;

&lt;p&gt;Playwright is great, but it's a library — you need to write code to use it. Browser-CLI turns it into a &lt;strong&gt;universal CLI interface&lt;/strong&gt; that any AI agent can call without writing a single line of automation code.&lt;/p&gt;

&lt;p&gt;This means:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Claude Code&lt;/strong&gt; can browse the web&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;OpenAI Codex&lt;/strong&gt; can fill forms and extract data&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Cursor&lt;/strong&gt; can take screenshots and interact with pages&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Any AI agent&lt;/strong&gt; can automate browser tasks through shell commands&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  Key Features
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;🤖 &lt;strong&gt;AI-First Design&lt;/strong&gt; — Structured JSON output, auto-managed server, clear command semantics&lt;/li&gt;
&lt;li&gt;🔒 &lt;strong&gt;Session Isolation&lt;/strong&gt; — Each agent gets its own browser instance via &lt;code&gt;--session&lt;/code&gt;
&lt;/li&gt;
&lt;li&gt;🍪 &lt;strong&gt;Cookie Persistence&lt;/strong&gt; — Auto save/load, login states preserved across sessions&lt;/li&gt;
&lt;li&gt;🌐 &lt;strong&gt;Proxy Support&lt;/strong&gt; — &lt;code&gt;--proxy http://host:port&lt;/code&gt; for restricted networks&lt;/li&gt;
&lt;li&gt;🎯 &lt;strong&gt;Web Components&lt;/strong&gt; — &lt;code&gt;smart-click&lt;/code&gt; and &lt;code&gt;pick&lt;/code&gt; for custom elements and Shadow DOM&lt;/li&gt;
&lt;li&gt;⌨️ &lt;strong&gt;Full Keyboard&lt;/strong&gt; — Shortcuts, combos, Tab/Enter/Escape, Ctrl+A/C/V&lt;/li&gt;
&lt;li&gt;📄 &lt;strong&gt;PDF &amp;amp; Screenshot&lt;/strong&gt; — Export pages as PDF or PNG&lt;/li&gt;
&lt;li&gt;📁 &lt;strong&gt;File Upload&lt;/strong&gt; — Upload files to any &lt;code&gt;&amp;lt;input type="file"&amp;gt;&lt;/code&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  30 Commands at a Glance
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Category&lt;/th&gt;
&lt;th&gt;Commands&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Navigate&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;navigate&lt;/code&gt;, &lt;code&gt;back&lt;/code&gt;, &lt;code&gt;forward&lt;/code&gt;, &lt;code&gt;reload&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Click&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;click&lt;/code&gt;, &lt;code&gt;click-js&lt;/code&gt;, &lt;code&gt;smart-click&lt;/code&gt;, &lt;code&gt;right-click&lt;/code&gt;, &lt;code&gt;dblclick&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Input&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;fill&lt;/code&gt;, &lt;code&gt;type&lt;/code&gt;, &lt;code&gt;select&lt;/code&gt;, &lt;code&gt;keyboard&lt;/code&gt;, &lt;code&gt;upload&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Extract&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;text&lt;/code&gt;, &lt;code&gt;screenshot&lt;/code&gt;, &lt;code&gt;elements&lt;/code&gt;, &lt;code&gt;eval&lt;/code&gt;, &lt;code&gt;pdf&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Utility&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;wait&lt;/code&gt;, &lt;code&gt;scroll&lt;/code&gt;, &lt;code&gt;pick&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Tabs&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;tab-new&lt;/code&gt;, &lt;code&gt;tab-list&lt;/code&gt;, &lt;code&gt;tab-switch&lt;/code&gt;, &lt;code&gt;tab-close&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Dialogs&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;dialog-status&lt;/code&gt;, &lt;code&gt;dialog-accept&lt;/code&gt;, &lt;code&gt;dialog-dismiss&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Session&lt;/td&gt;
&lt;td&gt;
&lt;code&gt;status&lt;/code&gt;, &lt;code&gt;stop&lt;/code&gt;, &lt;code&gt;session-list&lt;/code&gt;, &lt;code&gt;cookie&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Integration with AI Tools
&lt;/h2&gt;

&lt;p&gt;Browser-CLI ships with ready-to-use integration files:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;File&lt;/th&gt;
&lt;th&gt;Tool&lt;/th&gt;
&lt;th&gt;How to Use&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;integrations/claude/browser.md&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Claude Code&lt;/td&gt;
&lt;td&gt;Copy to &lt;code&gt;.claude/commands/&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;integrations/codex/browser-cli.md&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;OpenAI Codex&lt;/td&gt;
&lt;td&gt;Copy to &lt;code&gt;~/.codex/skills/&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;AGENTS.md&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;Cursor, Windsurf&lt;/td&gt;
&lt;td&gt;Already in project root&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;code&gt;skills/browser-cli/SKILL.md&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;GAL&lt;/td&gt;
&lt;td&gt;Copy to &lt;code&gt;~/.gal/skills/&lt;/code&gt;
&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Real-World Example
&lt;/h2&gt;

&lt;p&gt;Here's how an AI agent can search GitHub and extract results:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Navigate to GitHub&lt;/span&gt;
browser-cli navigate https://github.com/search?q&lt;span class="o"&gt;=&lt;/span&gt;browser+automation

&lt;span class="c"&gt;# Extract search results&lt;/span&gt;
browser-cli &lt;span class="nb"&gt;eval&lt;/span&gt; &lt;span class="s2"&gt;"JSON.stringify(
  Array.from(document.querySelectorAll('.repo-list-item a.v-align-middle'))
  .map(a =&amp;gt; ({name: a.textContent.trim(), url: a.href}))
)"&lt;/span&gt;

&lt;span class="c"&gt;# Take a screenshot&lt;/span&gt;
browser-cli screenshot github-results.png
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Architecture
&lt;/h2&gt;

&lt;p&gt;Browser-CLI uses a client-server architecture over Unix sockets:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;AI Agent → shell command → browser-cli (client) → Unix socket → server → Playwright → Browser
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The server auto-starts on first command and stays running. Multiple agents can connect simultaneously with isolated sessions.&lt;/p&gt;

&lt;h2&gt;
  
  
  No CGO Required
&lt;/h2&gt;

&lt;p&gt;Pure Go binary, compiles with &lt;code&gt;CGO_ENABLED=0&lt;/code&gt;:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Static Linux build&lt;/span&gt;
make build-static

&lt;span class="c"&gt;# Cross-compile for Windows&lt;/span&gt;
&lt;span class="nv"&gt;CGO_ENABLED&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;0 &lt;span class="nv"&gt;GOOS&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;windows &lt;span class="nv"&gt;GOARCH&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;amd64 go build &lt;span class="nt"&gt;-o&lt;/span&gt; browser-cli.exe &lt;span class="nb"&gt;.&lt;/span&gt;

&lt;span class="c"&gt;# Cross-compile for macOS&lt;/span&gt;
&lt;span class="nv"&gt;CGO_ENABLED&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;0 &lt;span class="nv"&gt;GOOS&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;darwin &lt;span class="nv"&gt;GOARCH&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;arm64 go build &lt;span class="nt"&gt;-o&lt;/span&gt; browser-cli-mac &lt;span class="nb"&gt;.&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Get Started
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/zmysysz/browser-cli
&lt;span class="nb"&gt;cd &lt;/span&gt;browser-cli &lt;span class="o"&gt;&amp;amp;&amp;amp;&lt;/span&gt; make build &lt;span class="o"&gt;&amp;amp;&amp;amp;&lt;/span&gt; make &lt;span class="nb"&gt;install
&lt;/span&gt;make setup-browsers
browser-cli navigate https://example.com
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Star ⭐ the repo if you find it useful! Feedback and contributions welcome.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;This article was drafted with the help of an AI agent — but the tool itself was built by hand. 😉&lt;/em&gt;&lt;/p&gt;

</description>
      <category>go</category>
      <category>ai</category>
      <category>webdev</category>
    </item>
  </channel>
</rss>
