<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Xybrid</title>
    <description>The latest articles on DEV Community by Xybrid (@xybrid).</description>
    <link>https://dev.to/xybrid</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Forganization%2Fprofile_image%2F12929%2F2d642c02-2d61-43d8-b912-2fbe655c4422.png</url>
      <title>DEV Community: Xybrid</title>
      <link>https://dev.to/xybrid</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/xybrid"/>
    <language>en</language>
    <item>
      <title>Run AI Models On-Device — Zero Config, Five Minutes</title>
      <dc:creator>Glenn Sonna</dc:creator>
      <pubDate>Mon, 06 Apr 2026 13:55:40 +0000</pubDate>
      <link>https://dev.to/xybrid/run-ai-models-on-device-zero-config-five-minutes-21k1</link>
      <guid>https://dev.to/xybrid/run-ai-models-on-device-zero-config-five-minutes-21k1</guid>
      <description>&lt;p&gt;You already know why on-device AI matters. Privacy, latency, cost. You've read the guides.&lt;/p&gt;

&lt;p&gt;Now you want to actually do it. Here's what that looks like with &lt;a href="https://github.com/xybrid-ai/xybrid" rel="noopener noreferrer"&gt;Xybrid&lt;/a&gt; — no tensor shapes, no preprocessing scripts, no ML expertise.&lt;/p&gt;




&lt;h2&gt;
  
  
  Install
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# macOS / Linux&lt;/span&gt;
curl &lt;span class="nt"&gt;-sSL&lt;/span&gt; https://raw.githubusercontent.com/xybrid-ai/xybrid/master/install.sh | sh
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;





&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Windows (PowerShell)&lt;/span&gt;
irm https://raw.githubusercontent.com/xybrid-ai/xybrid/master/install.ps1 | iex
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  Text-to-Speech
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;xybrid run &lt;span class="nt"&gt;--model&lt;/span&gt; kokoro-82m &lt;span class="nt"&gt;--input&lt;/span&gt; &lt;span class="s2"&gt;"Hello from the edge"&lt;/span&gt; &lt;span class="nt"&gt;--output&lt;/span&gt; hello.wav
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;That's it. Xybrid resolved the model from the registry, downloaded it, ran inference, and saved a WAV file. You configured nothing.&lt;/p&gt;

&lt;p&gt;Kokoro is an 82M parameter TTS model with 24 voices. First run downloads ~80MB and caches it locally. Subsequent runs are instant.&lt;/p&gt;

&lt;h2&gt;
  
  
  Speech Recognition
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;xybrid run &lt;span class="nt"&gt;--model&lt;/span&gt; whisper-tiny &lt;span class="nt"&gt;--input&lt;/span&gt; recording.wav
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Whisper Tiny transcribes audio in real-time on any modern laptop. Outputs plain text.&lt;/p&gt;

&lt;h2&gt;
  
  
  Text Generation
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;xybrid run &lt;span class="nt"&gt;--model&lt;/span&gt; qwen3.5-0.8b &lt;span class="nt"&gt;--input&lt;/span&gt; &lt;span class="s2"&gt;"Explain quantum computing in one sentence"&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Qwen 3.5 0.8B runs locally via llama.cpp. 201 languages, fits in 500MB quantized.&lt;/p&gt;

&lt;h2&gt;
  
  
  Browse the Registry
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;xybrid models list
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;25+ models, all hosted on HuggingFace, downloaded on-demand, cached locally:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Model&lt;/th&gt;
&lt;th&gt;Task&lt;/th&gt;
&lt;th&gt;Size&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;kokoro-82m&lt;/td&gt;
&lt;td&gt;Text-to-Speech&lt;/td&gt;
&lt;td&gt;82M&lt;/td&gt;
&lt;td&gt;24 voices, high quality&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;kitten-tts-nano-0.8&lt;/td&gt;
&lt;td&gt;Text-to-Speech&lt;/td&gt;
&lt;td&gt;15M&lt;/td&gt;
&lt;td&gt;Ultra-lightweight&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;qwen3-tts-0.6b&lt;/td&gt;
&lt;td&gt;Text-to-Speech&lt;/td&gt;
&lt;td&gt;600M&lt;/td&gt;
&lt;td&gt;Multilingual&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;whisper-tiny&lt;/td&gt;
&lt;td&gt;Speech Recognition&lt;/td&gt;
&lt;td&gt;39M&lt;/td&gt;
&lt;td&gt;Real-time, multilingual&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;wav2vec2-base-960h&lt;/td&gt;
&lt;td&gt;Speech Recognition&lt;/td&gt;
&lt;td&gt;95M&lt;/td&gt;
&lt;td&gt;CTC-based&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;lfm2.5-350m&lt;/td&gt;
&lt;td&gt;Text Generation&lt;/td&gt;
&lt;td&gt;354M&lt;/td&gt;
&lt;td&gt;9 languages, edge-optimized&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;smollm2-360m&lt;/td&gt;
&lt;td&gt;Text Generation&lt;/td&gt;
&lt;td&gt;360M&lt;/td&gt;
&lt;td&gt;Best tiny LLM&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;qwen3.5-0.8b&lt;/td&gt;
&lt;td&gt;Text Generation&lt;/td&gt;
&lt;td&gt;800M&lt;/td&gt;
&lt;td&gt;201 languages&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;gemma-4-e2b&lt;/td&gt;
&lt;td&gt;Text Generation&lt;/td&gt;
&lt;td&gt;5.1B&lt;/td&gt;
&lt;td&gt;Multimodal&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;mistral-7b&lt;/td&gt;
&lt;td&gt;Text Generation&lt;/td&gt;
&lt;td&gt;7B&lt;/td&gt;
&lt;td&gt;Function calling&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  Beyond the CLI
&lt;/h2&gt;

&lt;p&gt;The CLI is the fastest way to evaluate. When you're ready to integrate into an app, Xybrid has SDKs for Flutter, Swift, Kotlin, Unity, and Rust — same models, same behavior, every platform.&lt;/p&gt;




&lt;p&gt;Xybrid is in beta (v0.1.0-beta9), open-source under Apache 2.0.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;GitHub&lt;/strong&gt;: &lt;a href="https://github.com/xybrid-ai/xybrid" rel="noopener noreferrer"&gt;github.com/xybrid-ai/xybrid&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;Questions? Drop them in the comments — happy to help you get running.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>rust</category>
      <category>machinelearning</category>
      <category>opensource</category>
    </item>
  </channel>
</rss>
