<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: David Laurenvil</title>
    <description>The latest articles on DEV Community by David Laurenvil (@david_laurenvil_5d43bea48).</description>
    <link>https://dev.to/david_laurenvil_5d43bea48</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3943337%2F89e8cbdb-5378-49d6-8325-f688f5fd73ce.jpg</url>
      <title>DEV Community: David Laurenvil</title>
      <link>https://dev.to/david_laurenvil_5d43bea48</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/david_laurenvil_5d43bea48"/>
    <language>en</language>
    <item>
      <title>I Built The UnoQ's Claw: A Tiny Agentic AI Assistant That Lives Inside an Arduino Uno Q</title>
      <dc:creator>David Laurenvil</dc:creator>
      <pubDate>Thu, 21 May 2026 05:42:00 +0000</pubDate>
      <link>https://dev.to/david_laurenvil_5d43bea48/i-built-the-unoqs-claw-a-tiny-agentic-ai-assistant-that-lives-inside-an-arduino-uno-q-g48</link>
      <guid>https://dev.to/david_laurenvil_5d43bea48/i-built-the-unoqs-claw-a-tiny-agentic-ai-assistant-that-lives-inside-an-arduino-uno-q-g48</guid>
      <description>&lt;p&gt;Every "AI on hardware" demo you have ever seen has a LLM behind it. The user talks to a board via a terminal or Telegram, and the board calls an API to have a cloud model do the work. QClaw flips that arrangement. The Arduino Uno Q hosts the language model, runs the agent loop, drives the compile toolchain, and flashes its own microcontroller. &lt;/p&gt;

&lt;p&gt;Ask QClaw to scroll "QClaw" across the LED matrix and it does. End to end. On the board. Offline. &lt;/p&gt;

&lt;p&gt;QClaw has an eight-tool agentic surface, a fifteen-skill pre-router, and a direct OpenOCD flash route that makes autonomous uploads actually execute. A dual-path runtime lets you pick speed or full hardware control on the same model. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Why the Uno Q Is the Right Board for This&lt;/strong&gt; &lt;/p&gt;

&lt;p&gt;The Arduino Uno Q is a split-silicon device. It looks like a classic Arduino on the outside, but it is two boards in a trench-coat: &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh2tr6jh65wq303cwsaec.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh2tr6jh65wq303cwsaec.png" alt=" " width="799" height="436"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The MPU and MCU share the same PCB. The MPU can hold the MCU in reset and reprogram its flash directly through GPIO pins wired to SWD via the &lt;code&gt;linuxgpiod&lt;/code&gt; driver. No USB cable between them. No probe. No second machine. That is the genius of QClaw. &lt;/p&gt;

&lt;p&gt;The agentic loop orchestrates the full sketch lifecycle across the Arduino Uno Q's dual-silicon topology, with the MPU driving the loop and the MCU executing the resulting firmware. This is how QClaw generates, compiles, flashes, and observes. &lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fbi5nwiprupq93ivozb4m.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fbi5nwiprupq93ivozb4m.png" alt=" " width="799" height="436"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;The QClaw &lt;code&gt;arduino&lt;/code&gt; tool invokes OpenOCD directly at the correct address. The tool compiles with &lt;code&gt;arduino-cli compile --fqbn arduino:zephyr:unoq --export-binaries&lt;/code&gt;, picks up the resulting &lt;code&gt;.elf-zsk.bin&lt;/code&gt;, and pipes it through OpenOCD over the GPIO SWD bridge. No SSH, no network credentials, no remote OCD tunnel. Just MPU to MCU, on the same board. Sub-second flash once the binary is on disk. &lt;/p&gt;

&lt;p&gt;Four gigs of RAM is also more than enough to host a Qwen3.5 0.8B Q4_0 model with an 8K context window, mlocked, with q8_0 KV cache. QClaw lives on the Uno Q at around 1.3 GB. Decode runs at roughly 8 tokens per second. Slow next to a desktop GPU, fast enough that a sketch compiles and flashes before you have finished your coffee. &lt;/p&gt;

&lt;p&gt;*&lt;em&gt;How To Use QClaw *&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;QClaw ships two runtimes on top of the same llama-server backend, the same &lt;code&gt;SOUL.md&lt;/code&gt;, and the same 23-rule pre-router. They differ in what wraps the LLM call. &lt;/p&gt;

&lt;p&gt;Agentic path (&lt;code&gt;make qclaw-agentic&lt;/code&gt;). the qclaw Go gateway sits in front of the model. It runs channel adapters (terminal, SSH, Telegram), the multi-iteration agent loop, the pre-router, and the eight-tool dispatcher. This is the production default. It is the only path that can actually compile and flash a sketch. &lt;br&gt;
Direct path (&lt;code&gt;make qclaw-direct&lt;/code&gt;). A thin Python REPL POSTs directly to &lt;code&gt;llama-server&lt;/code&gt; after running the same pre-router rules in Python. No loop, no tools, no Telegram. About 33 percent lower latency on pure factual prompts at equivalent correctness, because there is no tool schema in the prompt and no second iteration.&lt;/p&gt;

&lt;p&gt;Use the agentic path when you want a sketch flashed or a frame captured. Use the direct path when you just want to ask which pins on the Uno Q do PWM. &lt;/p&gt;

&lt;p&gt;Drop in two commands and you have a session:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;git clone https://github.com/laurenvil/Uno-QClaw.git ~/ArduinoApps/QClaw     

&lt;span class="nb"&gt;cd&lt;/span&gt; ~/ArduinoApps/QClaw     

git submodule update &lt;span class="nt"&gt;--init&lt;/span&gt; &lt;span class="nt"&gt;--recursive&lt;/span&gt;     

&lt;span class="c"&gt;# Download the inference engine     &lt;/span&gt;

&lt;span class="nb"&gt;cd &lt;/span&gt;yzma &lt;span class="o"&gt;&amp;amp;&amp;amp;&lt;/span&gt; make download-llama.cpp &lt;span class="o"&gt;&amp;amp;&amp;amp;&lt;/span&gt; &lt;span class="nb"&gt;cd&lt;/span&gt; ..     

&lt;span class="c"&gt;# Download the model (~490 MB for Q4_0)     &lt;/span&gt;

&lt;span class="nb"&gt;mkdir&lt;/span&gt; &lt;span class="nt"&gt;-p&lt;/span&gt; ~/models     

wget &lt;span class="nt"&gt;-O&lt;/span&gt; ~/models/Qwen_Qwen3.5-0.8B-Q4_0.gguf &lt;span class="se"&gt;\ &lt;/span&gt;    

     &lt;span class="s1"&gt;'https://huggingface.co/Qwen/Qwen3.5-0.8B-GGUF/resolve/main/Qwen3.5-0.8B-Q4_0.gguf'&lt;/span&gt;     

&lt;span class="c"&gt;# Build, install arduino-cli, configure (one time)     &lt;/span&gt;

make qclaw-install     

&lt;span class="c"&gt;# Start a session — pick a path     &lt;/span&gt;

make qclaw-agentic    &lt;span class="c"&gt;# full agent loop + 8 tools (compile/upload/camera/sysfs_led/network/i2cdetect)     &lt;/span&gt;

make qclaw-direct     &lt;span class="c"&gt;# pre-router + direct API (fast Q&amp;amp;A, no tools)&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;code&gt;make qclaw-install&lt;/code&gt; builds the Go binary, copies the system prompt and the fifteen-skill tree into &lt;code&gt;~/.qclaw/workspace/&lt;/code&gt;, installs &lt;code&gt;arduino-cli&lt;/code&gt; plus the &lt;code&gt;arduino:zephyr&lt;/code&gt; core, and runs an interactive wizard that sets up the optional Telegram gateway. &lt;/p&gt;

&lt;p&gt;Once it is running, the agent has eight narrowly-scoped tools available: &lt;/p&gt;

&lt;p&gt;&lt;code&gt;read_file&lt;/code&gt;, &lt;code&gt;write_file&lt;/code&gt;, &lt;code&gt;list_dir&lt;/code&gt; for workspace navigation &lt;br&gt;
&lt;code&gt;arduino&lt;/code&gt; for compile and flash via OpenOCD &lt;br&gt;
&lt;code&gt;camera&lt;/code&gt; for single-frame V4L2 capture through GStreamer &lt;br&gt;
&lt;code&gt;sysfs_led&lt;/code&gt; for the MPU-side RGB LEDs at &lt;code&gt;/sys/class/leds/*&lt;/code&gt; &lt;br&gt;
&lt;code&gt;network&lt;/code&gt; for hostname, interfaces, and the default gateway, all read-only stdlib Go &lt;br&gt;
&lt;code&gt;i2cdetect&lt;/code&gt; for listing and scanning Linux I²C buses with &lt;code&gt;-y -r&lt;/code&gt; only &lt;br&gt;
No general &lt;code&gt;exec&lt;/code&gt;. No general shell. Every tool validates its arguments against an allow-list. The total tool schema is around 3.4K characters, leaving plenty of room for the system prompt at an 8K context window. &lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The Pre-Router: Skills, Not RAG&lt;/strong&gt; &lt;/p&gt;

&lt;p&gt;The pre-router is the part of QClaw that does the heavy lifting on a 0.8B model. It is not RAG. It is a flat table of 23 keyword regex rules across 15 skills. When you send a message, the pre-router scans it, finds matching rules, and inlines the relevant &lt;code&gt;SKILL.md&lt;/code&gt; plus its referenced files directly into the system prompt before the LLM call. &lt;/p&gt;

&lt;p&gt;The model never has to call &lt;code&gt;read_file&lt;/code&gt; for canonical skill content. The content is already there. At 0.8B scale, a &lt;code&gt;read_file&lt;/code&gt; call costs a full LLM iteration, roughly 10 to 20 minutes of cold prefill plus decode. The pre-router amortizes that to zero. &lt;/p&gt;

&lt;p&gt;*&lt;em&gt;The skills cover: *&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Sketch fundamentals: blink, breathe, button, potentiometer, servo, compile and upload, CAN bus, DAC, OPAMP &lt;br&gt;
The 13x8 LED matrix with the canonical Arduino_LED_Matrix template &lt;br&gt;
Uno Q hardware: pin tables, voltage rules, connectors, power &lt;br&gt;
Dual-chip workflow: Bridge RPC, App Lab, Bricks &lt;br&gt;
Linux-side capabilities: Wi-Fi, Bluetooth, camera, OpenCV, microphone, sysfs LEDs &lt;br&gt;
Plug-and-play Modulino sensors &lt;br&gt;
Each skill is just a directory under &lt;code&gt;workspace/skills/&amp;lt;name&amp;gt;/&lt;/code&gt; with a &lt;code&gt;SKILL.md&lt;/code&gt; and optional reference files. Adding a new one is a matter of writing the markdown and adding a regex rule. &lt;/p&gt;

&lt;p&gt;*&lt;em&gt;Try It *&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Repo:&lt;/strong&gt;&lt;a href="https://github.com/laurenvil/Uno-QClaw" rel="noopener noreferrer"&gt;https://github.com/laurenvil/Uno-QClaw&lt;/a&gt; &lt;/p&gt;

&lt;p&gt;Issues, forks, and pull requests are welcome at &lt;a href="https://github.com/laurenvil/Uno-QClaw" rel="noopener noreferrer"&gt;https://github.com/laurenvil/Uno-QClaw&lt;/a&gt;. If you have an Arduino Uno Q on your desk, you can have a self-flashing AI assistant sitting on it tonight, with the Ethernet cable unplugged. &lt;/p&gt;

</description>
      <category>arduino</category>
      <category>ai</category>
      <category>vibecoding</category>
      <category>iot</category>
    </item>
  </channel>
</rss>
