<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Taylor</title>
    <description>The latest articles on DEV Community by Taylor (@cerberusai).</description>
    <link>https://dev.to/cerberusai</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3960651%2Fb40c6b21-ce18-4789-b7dc-69c5728c6a32.png</url>
      <title>DEV Community: Taylor</title>
      <link>https://dev.to/cerberusai</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/cerberusai"/>
    <language>en</language>
    <item>
      <title>Anti Refusal LLM Service</title>
      <dc:creator>Taylor</dc:creator>
      <pubDate>Sun, 31 May 2026 02:24:48 +0000</pubDate>
      <link>https://dev.to/cerberusai/anti-refusal-llm-service-478o</link>
      <guid>https://dev.to/cerberusai/anti-refusal-llm-service-478o</guid>
      <description>&lt;p&gt;I Built a 12MB Desktop App for Running Uncensored AI Models Locally (Tauri + Rust + Ollama) published: true description: How I built Cerberus AI — a local-first desktop app that auto-detects your GPU, pulls the right model quantization, and gives you uncensored AI chat without sending a single prompt to the cloud. Every major language model ships with an alignment layer that refuses certain prompts. Sometimes that's reasonable. Sometimes you're a security researcher, a creative writer, or just someone who doesn't want a corporation deciding what questions you're allowed to ask.&lt;/p&gt;

&lt;p&gt;I built Cerberus AI to fix that — and to make the whole experience local-first, lightweight, and dead simple to install.&lt;/p&gt;

&lt;p&gt;What Is Cerberus AI?&lt;br&gt;
Cerberus AI is a platform for running open-weight, refusal-ablated language models on your own hardware. It has three parts:&lt;/p&gt;

&lt;p&gt;A native desktop app (~12 MB) built with Tauri + Rust — not Electron&lt;br&gt;
Open-weight GGUF models hosted on a public CDN&lt;br&gt;
An OpenAI-compatible managed API for when you don't want to run local&lt;br&gt;
The desktop app integrates directly with Ollama, auto-detects your GPU VRAM, and recommends the right model quantization for your hardware. From 4 GB laptops to 24 GB workstations, it just works.&lt;/p&gt;

&lt;p&gt;Cerberus AI Desktop Chat&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh0ubd683daurj1ar4ict.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fh0ubd683daurj1ar4ict.png" alt=" "&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2y91v9w6v394c21xa72s.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2F2y91v9w6v394c21xa72s.png" alt=" "&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwf0787tg2gmr9r5ubrtx.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Farticles%2Fwf0787tg2gmr9r5ubrtx.png" alt=" "&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;What Is Refusal Ablation?&lt;br&gt;
This is the core technical innovation behind Cerberus models. Here's the short version:&lt;/p&gt;

&lt;p&gt;Language models learn a refusal direction in their activation space during alignment training. When a prompt triggers this direction, the model produces refusal text ("I can't help with that") regardless of whether the underlying model actually lacks the knowledge.&lt;/p&gt;

&lt;p&gt;Refusal ablation surgically removes this direction from the model weights. The technique:&lt;/p&gt;

&lt;p&gt;Identifies the refusal direction vector in the model's residual stream&lt;br&gt;
Projects it out of the weight matrices&lt;br&gt;
Preserves all other reasoning capabilities&lt;br&gt;
The result is a model that treats every prompt equally. No refusals. No moralizing. Just direct, unfiltered output from the model's actual knowledge.&lt;/p&gt;

&lt;p&gt;We apply this to multiple base architectures:&lt;/p&gt;

&lt;p&gt;Model   Base    Parameters  Use Case&lt;br&gt;
Cerberus 4B v2  Qwen 3.5    4B  General purpose, fits on 4-8 GB GPU&lt;br&gt;
Arbiter GL9b    GLM-4   9B  Heavier reasoning, needs 6+ GB&lt;br&gt;
Gamma3 1B BDPO  Custom  1B  Edge devices, CPU-only inference&lt;br&gt;
All models are distributed as GGUF files — the same format llama.cpp uses. Download once, run anywhere.&lt;/p&gt;

&lt;p&gt;Why Tauri Instead of Electron?&lt;br&gt;
Electron bundles an entire Chromium browser. That's 150+ MB just for the runtime. For a chat app that talks to a local Ollama instance, that's absurd.&lt;/p&gt;

&lt;p&gt;Tauri uses your system's existing WebView (WebView2 on Windows) and writes the backend in Rust. The result:&lt;/p&gt;

&lt;p&gt;~12 MB installer vs 150+ MB for Electron&lt;br&gt;
Native performance — the Rust backend handles hardware detection, Ollama management, and streaming&lt;br&gt;
Lower memory footprint — no spare Chrome process eating your RAM&lt;br&gt;
System-level integration — direct access to GPU info, filesystem, and process management&lt;br&gt;
The frontend is Vue 3 + Vite, but the heavy lifting (VRAM detection, model management, MCP plugin orchestration) all happens in Rust.&lt;/p&gt;

&lt;p&gt;Install in 30 Seconds&lt;br&gt;
On Windows, open PowerShell and run:&lt;/p&gt;

&lt;p&gt;powershell&lt;/p&gt;

&lt;p&gt;&lt;code&gt;irm https://cerberusai.dev/get | iex&lt;br&gt;
&lt;/code&gt;This single command:&lt;/p&gt;

&lt;p&gt;Checks for (and installs) WebView2 if missing&lt;br&gt;
Installs Ollama if missing&lt;br&gt;
Detects your GPU VRAM&lt;br&gt;
Downloads the recommended model quantization&lt;br&gt;
Installs the Cerberus desktop app&lt;br&gt;
Launches it&lt;br&gt;
You paste your API key (free to generate at access.cerberusai.dev) and you're chatting locally.&lt;/p&gt;

&lt;p&gt;The API — OpenAI-Compatible&lt;br&gt;
Don't want to run local? Use the managed API. It follows the OpenAI chat completions shape, so any existing OpenAI SDK code works with a one-line change:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="n"&gt;python&lt;/span&gt;

&lt;span class="kn"&gt;from&lt;/span&gt; &lt;span class="n"&gt;openai&lt;/span&gt; &lt;span class="kn"&gt;import&lt;/span&gt; &lt;span class="n"&gt;OpenAI&lt;/span&gt;
&lt;span class="n"&gt;client&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="nc"&gt;OpenAI&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="n"&gt;base_url&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;https://api.cerberusai.dev/v1&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;api_key&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;YOUR_API_KEY&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;response&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="n"&gt;client&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;chat&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;completions&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;create&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;
    &lt;span class="n"&gt;model&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;cerberus-4b-v2-abliterated&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="n"&gt;messages&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;
        &lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;role&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;system&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;content&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;You are Cerberus, an unfiltered AI.&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;},&lt;/span&gt;
        &lt;span class="p"&gt;{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;role&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;user&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;content&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Explain how refusal ablation works&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}&lt;/span&gt;
    &lt;span class="p"&gt;],&lt;/span&gt;
    &lt;span class="n"&gt;stream&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="bp"&gt;True&lt;/span&gt;
&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;chunk&lt;/span&gt; &lt;span class="ow"&gt;in&lt;/span&gt; &lt;span class="n"&gt;response&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
    &lt;span class="k"&gt;if&lt;/span&gt; &lt;span class="n"&gt;chunk&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;choices&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;].&lt;/span&gt;&lt;span class="n"&gt;delta&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;content&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
        &lt;span class="nf"&gt;print&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;chunk&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;choices&lt;/span&gt;&lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="mi"&gt;0&lt;/span&gt;&lt;span class="p"&gt;].&lt;/span&gt;&lt;span class="n"&gt;delta&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;content&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;end&lt;/span&gt;&lt;span class="o"&gt;=&lt;/span&gt;&lt;span class="sh"&gt;""&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
&lt;span class="n"&gt;Streaming&lt;/span&gt; &lt;span class="n"&gt;via&lt;/span&gt; &lt;span class="n"&gt;SSE&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;standard&lt;/span&gt; &lt;span class="n"&gt;error&lt;/span&gt; &lt;span class="nf"&gt;codes &lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="mi"&gt;401&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;402&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="mi"&gt;429&lt;/span&gt;&lt;span class="p"&gt;),&lt;/span&gt; &lt;span class="ow"&gt;and&lt;/span&gt; &lt;span class="n"&gt;a&lt;/span&gt; &lt;span class="n"&gt;public&lt;/span&gt; &lt;span class="n"&gt;model&lt;/span&gt; &lt;span class="n"&gt;CDN&lt;/span&gt; &lt;span class="n"&gt;at&lt;/span&gt; &lt;span class="n"&gt;llm&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;cerberusai&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="n"&gt;dev&lt;/span&gt; &lt;span class="n"&gt;that&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;s fully CORS-enabled — you can even fetch model metadata from browser-based apps.

curl Example
bash

curl -X POST https://api.cerberusai.dev/v1/chat/completions &lt;/span&gt;&lt;span class="se"&gt;\
&lt;/span&gt;&lt;span class="s"&gt;  -H &lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Authorization: Bearer YOUR_API_KEY&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt; &lt;/span&gt;&lt;span class="se"&gt;\
&lt;/span&gt;&lt;span class="s"&gt;  -H &lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Content-Type: application/json&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt; &lt;/span&gt;&lt;span class="se"&gt;\
&lt;/span&gt;&lt;span class="s"&gt;  -d &lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;model&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;cerberus-4b-v2-abliterated&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;messages&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="p"&gt;[{&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;role&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;user&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;content&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;Hello&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;}],&lt;/span&gt;
    &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;stream&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt; &lt;span class="n"&gt;false&lt;/span&gt;
  &lt;span class="p"&gt;}&lt;/span&gt;&lt;span class="sh"&gt;'&lt;/span&gt;&lt;span class="s"&gt;

&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Model Downloads — Public CDN&lt;br&gt;
All GGUF model files are hosted on llm.cerberusai.dev with a public JSON API:&lt;/p&gt;

&lt;p&gt;bash&lt;/p&gt;

&lt;h1&gt;
  
  
  List all available models
&lt;/h1&gt;

&lt;p&gt;curl &lt;a href="https://llm.cerberusai.dev/api/models/" rel="noopener noreferrer"&gt;https://llm.cerberusai.dev/api/models/&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  List files for a specific model (with exact byte sizes)
&lt;/h1&gt;

&lt;p&gt;curl &lt;a href="https://llm.cerberusai.dev/api/models/cerberus-4b-v2-abliterated/" rel="noopener noreferrer"&gt;https://llm.cerberusai.dev/api/models/cerberus-4b-v2-abliterated/&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  Download a specific quantization
&lt;/h1&gt;

&lt;p&gt;wget &lt;a href="https://llm.cerberusai.dev/models/cerberus-4b-v2-abliterated/cerberus-4b-v2-abliterated-Q4_K_M.gguf" rel="noopener noreferrer"&gt;https://llm.cerberusai.dev/models/cerberus-4b-v2-abliterated/cerberus-4b-v2-abliterated-Q4_K_M.gguf&lt;/a&gt;&lt;/p&gt;

&lt;h1&gt;
  
  
  Resume interrupted downloads
&lt;/h1&gt;

&lt;p&gt;wget -c &lt;a href="https://llm.cerberusai.dev/models/Arbiter-GL9b/Arbiter-GL9b-Q8_0.gguf" rel="noopener noreferrer"&gt;https://llm.cerberusai.dev/models/Arbiter-GL9b/Arbiter-GL9b-Q8_0.gguf&lt;/a&gt;&lt;br&gt;
Range requests are supported, CORS is enabled for all origins, and GGUF files are served with proper Content-Disposition: attachment headers.&lt;/p&gt;

&lt;p&gt;Built-In Features&lt;br&gt;
Beyond chat, the desktop app includes:&lt;/p&gt;

&lt;p&gt;Model Manager — browse local Ollama models, pull from the Cerberus cloud catalog, import raw GGUF files, switch active models, see disk usage&lt;br&gt;
MCP Plugin System — browse and install Model Context Protocol plugins from inside the app. There's also a public MCP Skills Server at api.cerberusai.dev/skills-sse&lt;br&gt;
Hardware Monitoring — CPU, RAM, and VRAM activity displayed in the interface&lt;br&gt;
Zero Telemetry — no prompts leave your machine during local inference. No analytics. No phone-home.&lt;br&gt;
Pricing&lt;br&gt;
Every account gets 50,000 free monthly credits. That's enough for casual use and testing.&lt;/p&gt;

&lt;p&gt;If you need more:&lt;/p&gt;

&lt;p&gt;Plan    Price   Monthly Credits&lt;br&gt;
Free    $0  50,000&lt;br&gt;
Lite    $8/mo   300,000&lt;br&gt;
Mid $15/mo  900,000&lt;br&gt;
Exp $22/mo  2,000,000&lt;br&gt;
One-time top-ups start at $5 (125,000 credits). Stripe and PayPal supported. The free tier has no time limit — it refreshes every month.&lt;/p&gt;

&lt;p&gt;Local inference through Ollama costs zero credits. Credits only apply to the managed API.&lt;/p&gt;

&lt;p&gt;Try It&lt;br&gt;
🌐 Website: cerberusai.dev&lt;br&gt;
📦 GitHub: github.com/tjcrims0nx/CerberusAI-Desktop&lt;br&gt;
🧠 Models: llm.cerberusai.dev&lt;br&gt;
📖 API Docs: cerberusai.dev/docs/api&lt;br&gt;
💬 Discord: discord.gg/YdVj7hEtv5&lt;br&gt;
🔑 Get API Key: access.cerberusai.dev&lt;br&gt;
If you've ever been frustrated by a language model refusing a perfectly reasonable prompt, or if you just want to run AI locally without cloud dependencies — give Cerberus a try. The install is one command, the free tier is permanent, and the weights are open.&lt;/p&gt;

&lt;p&gt;I'd love to hear feedback. Drop into the Discord or open an issue on GitHub.&lt;/p&gt;

&lt;p&gt;Cerberus AI is an open-weight project. The desktop app source is on GitHub. Models are distributed as GGUF under open licenses. The managed API is a pay-as-you-go service.****&lt;/p&gt;

</description>
    </item>
  </channel>
</rss>
