<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Shen Huang</title>
    <description>The latest articles on DEV Community by Shen Huang (@shenhuang_).</description>
    <link>https://dev.to/shenhuang_</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F936104%2F47f8fca8-4790-49fe-889b-9f4179202580.jpg</url>
      <title>DEV Community: Shen Huang</title>
      <link>https://dev.to/shenhuang_</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/shenhuang_"/>
    <language>en</language>
    <item>
      <title>I got tired of all the desktop agent tools being macOS only, so I built one for Linux.</title>
      <dc:creator>Shen Huang</dc:creator>
      <pubDate>Sun, 01 Feb 2026 06:26:59 +0000</pubDate>
      <link>https://dev.to/shenhuang_/i-got-tired-of-all-the-desktop-agent-tools-being-macos-only-so-i-built-one-for-linux-4mh5</link>
      <guid>https://dev.to/shenhuang_/i-got-tired-of-all-the-desktop-agent-tools-being-macos-only-so-i-built-one-for-linux-4mh5</guid>
      <description>&lt;p&gt;Like many of you, I’ve been messing around with OpenClaw  (formerly clawdbot) and the whole "vibe coding" concept. It's cool, but finding a decent tool that actually drives the UI on Linux was a pain. Everything seems to be Mac-first right now. &lt;/p&gt;

&lt;p&gt;Since I do all my local inference on Linux, I built a dedicated tool for it.&lt;/p&gt;

&lt;p&gt;It's called &lt;strong&gt;Peepbo&lt;/strong&gt; : &lt;a href="https://github.com/LichAmnesia/peepbo" rel="noopener noreferrer"&gt;https://github.com/LichAmnesia/peepbo&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Basically, it's a lightweight Node/TS wrapper that connects your local VLM (LLaVA, Qwen-VL, etc) to your desktop Linux environment.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;How it works:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Vision:&lt;/strong&gt; Wraps &lt;code&gt;scrot&lt;/code&gt;, &lt;code&gt;gnome-screenshot&lt;/code&gt;, or &lt;code&gt;gdbus&lt;/code&gt; so the model can see the screen.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Control:&lt;/strong&gt; Uses &lt;code&gt;xdotool&lt;/code&gt; to handle mouse/keyboard inputs.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Wayland:&lt;/strong&gt; Yes, it works on GNOME Wayland, but you'll need to run in unsafe mode (details in the readme).&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;It's open source. Give it a shot if you're trying to build agents on Linux and let me know if it breaks anything.&lt;/p&gt;

</description>
      <category>agents</category>
      <category>ai</category>
      <category>linux</category>
      <category>showdev</category>
    </item>
  </channel>
</rss>
