<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: carlosortet</title>
    <description>The latest articles on DEV Community by carlosortet (@carlosortet).</description>
    <link>https://dev.to/carlosortet</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3731777%2F134ab90c-169c-476a-8ce9-83665d0c706b.png</url>
      <title>DEV Community: carlosortet</title>
      <link>https://dev.to/carlosortet</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/carlosortet"/>
    <language>en</language>
    <item>
      <title>From expensive tokens to intelligent compression: how we optimize LLM costs in production</title>
      <dc:creator>carlosortet</dc:creator>
      <pubDate>Thu, 26 Mar 2026 09:24:16 +0000</pubDate>
      <link>https://dev.to/carlosortet/from-expensive-tokens-to-intelligent-compression-how-we-optimize-llm-costs-in-production-18ig</link>
      <guid>https://dev.to/carlosortet/from-expensive-tokens-to-intelligent-compression-how-we-optimize-llm-costs-in-production-18ig</guid>
      <description>&lt;p&gt;We spend absurd amounts on AI tokens. And that number is only going up.&lt;/p&gt;

&lt;p&gt;At 498Advance we run multiple LLMs in production — Claude for development, Gemini for multimodal, DeepSeek and OpenAI models locally for routine tasks. Every model does something well and fails at something else. That is why they coexist.&lt;/p&gt;

&lt;p&gt;But this creates a problem: &lt;strong&gt;dependency and cost&lt;/strong&gt;. What happens when a provider goes down? What happens when pricing changes overnight?&lt;/p&gt;

&lt;p&gt;Here is how we deal with it, and why a new Google Research paper caught our attention this week.&lt;/p&gt;

&lt;h2&gt;
  
  
  Layer 1: Fallback policies
&lt;/h2&gt;

&lt;p&gt;If a model fails, the system automatically redirects to the next available model. No human intervention, no perceptible downtime.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight python"&gt;&lt;code&gt;&lt;span class="c1"&gt;# Simplified fallback logic
&lt;/span&gt;&lt;span class="n"&gt;models&lt;/span&gt; &lt;span class="o"&gt;=&lt;/span&gt; &lt;span class="p"&gt;[&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;claude-opus&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;gpt-4o&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;gemini-pro&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="s"&gt;deepseek-local&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;]&lt;/span&gt;

&lt;span class="k"&gt;def&lt;/span&gt; &lt;span class="nf"&gt;inference&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;prompt&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;task_type&lt;/span&gt;&lt;span class="p"&gt;):&lt;/span&gt;
    &lt;span class="k"&gt;for&lt;/span&gt; &lt;span class="n"&gt;model&lt;/span&gt; &lt;span class="ow"&gt;in&lt;/span&gt; &lt;span class="nf"&gt;get_ranked_models&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;task_type&lt;/span&gt;&lt;span class="p"&gt;):&lt;/span&gt;
        &lt;span class="k"&gt;try&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
            &lt;span class="k"&gt;return&lt;/span&gt; &lt;span class="nf"&gt;call_model&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="n"&gt;model&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="n"&gt;prompt&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
        &lt;span class="k"&gt;except&lt;/span&gt; &lt;span class="n"&gt;ModelUnavailable&lt;/span&gt;&lt;span class="p"&gt;:&lt;/span&gt;
            &lt;span class="n"&gt;log&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;warning&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="sa"&gt;f&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="si"&gt;{&lt;/span&gt;&lt;span class="n"&gt;model&lt;/span&gt;&lt;span class="si"&gt;}&lt;/span&gt;&lt;span class="s"&gt; unavailable, falling back&lt;/span&gt;&lt;span class="sh"&gt;"&lt;/span&gt;&lt;span class="p"&gt;)&lt;/span&gt;
            &lt;span class="k"&gt;continue&lt;/span&gt;
    &lt;span class="k"&gt;raise&lt;/span&gt; &lt;span class="nc"&gt;AllModelsUnavailable&lt;/span&gt;&lt;span class="p"&gt;()&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Simple but effective. The key is having your models ranked per task type, not globally.&lt;/p&gt;

&lt;h2&gt;
  
  
  Layer 2: Router shadow
&lt;/h2&gt;

&lt;p&gt;Not every task needs a frontier model. A two-line summary does not need Claude Opus. A 50-page legal analysis does.&lt;/p&gt;

&lt;p&gt;Router shadow evaluates each incoming task and routes it to the optimal model based on complexity and cost. We categorize tasks into tiers:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Tier&lt;/th&gt;
&lt;th&gt;Task type&lt;/th&gt;
&lt;th&gt;Model class&lt;/th&gt;
&lt;th&gt;Cost&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;1&lt;/td&gt;
&lt;td&gt;Simple extraction, formatting&lt;/td&gt;
&lt;td&gt;Local (DeepSeek 7B)&lt;/td&gt;
&lt;td&gt;~$0&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;2&lt;/td&gt;
&lt;td&gt;Summarization, translation&lt;/td&gt;
&lt;td&gt;Mid-tier API (Haiku, Flash)&lt;/td&gt;
&lt;td&gt;Low&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;3&lt;/td&gt;
&lt;td&gt;Complex analysis, code generation&lt;/td&gt;
&lt;td&gt;Frontier (Opus, GPT-4o)&lt;/td&gt;
&lt;td&gt;High&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The result: cost optimization per project without sacrificing quality where it matters.&lt;/p&gt;

&lt;h2&gt;
  
  
  Layer 3: Local models
&lt;/h2&gt;

&lt;p&gt;At 498Advance we have been running &lt;strong&gt;DeepSeek&lt;/strong&gt; and &lt;strong&gt;OpenAI&lt;/strong&gt; models locally for three months. They handle a significant portion of production tasks.&lt;/p&gt;

&lt;p&gt;The benefits go beyond cost:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Security&lt;/strong&gt;: data never leaves your infrastructure&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Compliance&lt;/strong&gt;: concrete guarantees about where data is processed&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Latency&lt;/strong&gt;: no network round-trip for simple tasks&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Availability&lt;/strong&gt;: no dependency on external uptime&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The trade-off: local models are not frontier models. You lose capability on complex tasks. The strategy is selective migration — identify what can run locally, move it, keep frontier for what needs it.&lt;/p&gt;

&lt;h2&gt;
  
  
  The compression landscape
&lt;/h2&gt;

&lt;p&gt;At some point, better hardware is not enough. You need &lt;strong&gt;efficiency&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;LLMs keep growing — tens or hundreds of billions of parameters. The compression techniques that make them deployable:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Quantization&lt;/strong&gt; reduces weight precision. A quantized Llama 70B fits on 1 NVIDIA A100. Unquantized, it needs 4.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Pruning&lt;/strong&gt; removes low-relevance weights. 2:4 Sparse Llama achieved 98.4% accuracy recovery on the Open LLM Leaderboard V1, with +30% throughput and -20% latency from sparsity alone.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Knowledge distillation&lt;/strong&gt; trains a small student model to replicate a large teacher's behavior.&lt;/p&gt;

&lt;p&gt;These are not mutually exclusive. Sparsity + quantization yields improvements greater than either alone.&lt;/p&gt;

&lt;h3&gt;
  
  
  Real-world examples
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;LinkedIn&lt;/strong&gt; built domain-adapted EON models on open source LLMs with proprietary data, reducing prompt size by 30%.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Roblox&lt;/strong&gt; scaled from &amp;lt;50 to ~250 concurrent ML inference pipelines using Ray and vLLM.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Red Hat&lt;/strong&gt; maintains pre-optimized models on Hugging Face (Llama, Qwen, DeepSeek, Granite) — quantized and ready for inference with vLLM.&lt;/p&gt;

&lt;h2&gt;
  
  
  TurboQuant: the paper that caught our attention
&lt;/h2&gt;

&lt;p&gt;On March 24, 2026, Google Research published &lt;strong&gt;TurboQuant&lt;/strong&gt; (ICLR 2026). Authors: Amir Zandieh and Vahab Mirrokni.&lt;/p&gt;

&lt;p&gt;The headline numbers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;6x minimum&lt;/strong&gt; KV cache memory reduction&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;8x speedup&lt;/strong&gt; with 4-bit quantization on H100 GPUs&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;3-bit&lt;/strong&gt; KV cache quantization with &lt;strong&gt;zero accuracy loss&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;No fine-tuning or retraining required&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Why it matters technically
&lt;/h3&gt;

&lt;p&gt;Traditional quantization has a memory overhead problem. Most methods need to store quantization constants for each data block, adding 1-2 extra bits per number. TurboQuant eliminates this.&lt;/p&gt;

&lt;p&gt;It combines two algorithms:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;PolarQuant&lt;/strong&gt; converts vectors from Cartesian to polar coordinates. Instead of normalizing data on a shifting square grid, it maps to a fixed circular grid where boundaries are known. This eliminates the normalization overhead.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;QJL (Quantized Johnson-Lindenstrauss)&lt;/strong&gt; compresses the residual error from PolarQuant to a single sign bit (+1 or -1) using the JL Transform. Zero memory overhead.&lt;/p&gt;

&lt;p&gt;The pipeline:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;PolarQuant compresses with most of the bits&lt;/li&gt;
&lt;li&gt;QJL uses 1 bit to correct residual bias&lt;/li&gt;
&lt;/ol&gt;

&lt;h3&gt;
  
  
  Benchmark results
&lt;/h3&gt;

&lt;p&gt;Tested on LongBench, Needle In A Haystack, ZeroSCROLLS, RULER, and L-Eval with Gemma and Mistral:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;TurboQuant (KV: 3.5 bits)&lt;/strong&gt; scores 50.06 on LongBench — identical to Full Cache (KV: 16 bits)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;KIVI&lt;/strong&gt; needs 5 bits for 50.16, drops to 48.50 at 3 bits&lt;/li&gt;
&lt;li&gt;Perfect needle-in-haystack results with 6x memory reduction&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For vector search, TurboQuant outperforms PQ and RabbiQ in recall ratio even when those baselines use large codebooks and dataset-specific tuning.&lt;/p&gt;

&lt;h2&gt;
  
  
  What this means in practice
&lt;/h2&gt;

&lt;p&gt;TurboQuant is a research paper, not a product. But the direction is clear:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Same hardware, bigger models&lt;/strong&gt;: 6x KV cache compression means the GPU running an 8B model could handle something significantly larger&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Lower inference costs&lt;/strong&gt;: 8x attention speedup = fewer GPUs for the same workload&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Edge deployment&lt;/strong&gt;: compression is what separates "interesting demo" from "deployable product"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Simpler compliance&lt;/strong&gt;: smaller models running locally = less data traveling externally&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The race is not for bigger models. It is for models that are smarter about how they use their resources.&lt;/p&gt;

&lt;h2&gt;
  
  
  Further reading
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;TurboQuant paper (ICLR 2026) — Google Research&lt;/li&gt;
&lt;li&gt;PolarQuant (AISTATS 2026)&lt;/li&gt;
&lt;li&gt;Red Hat optimized models repository on Hugging Face&lt;/li&gt;
&lt;li&gt;&lt;a href="https://zoopa.es/en/digital-marketing-en/llm-optimization-compression-tokens-turboquant/" rel="noopener noreferrer"&gt;Full analysis with data visualizations on our blog&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;What compression or optimization techniques are you using in production? Have you tried running models locally? Would love to hear about your setup.&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Carlos Ortet | 498Advance&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>machinelearning</category>
      <category>deeplearning</category>
    </item>
    <item>
      <title>OpenAI's Agentic Commerce Protocol: a technical look at how ChatGPT becomes a shopping agent</title>
      <dc:creator>carlosortet</dc:creator>
      <pubDate>Wed, 25 Mar 2026 08:51:38 +0000</pubDate>
      <link>https://dev.to/carlosortet/openais-agentic-commerce-protocol-a-technical-look-at-how-chatgpt-becomes-a-shopping-agent-20fd</link>
      <guid>https://dev.to/carlosortet/openais-agentic-commerce-protocol-a-technical-look-at-how-chatgpt-becomes-a-shopping-agent-20fd</guid>
      <description>&lt;p&gt;Last week, OpenAI launched a redesigned shopping experience in ChatGPT. 900 million weekly users can now browse products visually, compare options side-by-side, and get real-time pricing — all inside the conversation.&lt;/p&gt;

&lt;p&gt;The protocol powering it is called &lt;strong&gt;ACP (Agentic Commerce Protocol)&lt;/strong&gt;, an open standard co-developed by OpenAI and Stripe under Apache 2.0. And the technical implementation is worth a closer look.&lt;/p&gt;

&lt;h2&gt;
  
  
  What ACP actually is
&lt;/h2&gt;

&lt;p&gt;ACP defines a common language for AI agents and merchants to coordinate transactions. Think of it as an API contract between ChatGPT (the buyer's agent) and a merchant's product catalog.&lt;/p&gt;

&lt;p&gt;Key architectural principle: &lt;strong&gt;OpenAI is not the merchant of record.&lt;/strong&gt; Merchants retain full control over products, pricing, payments and fulfillment. ChatGPT is just the conversational intermediary.&lt;/p&gt;

&lt;p&gt;The protocol lives on GitHub with 1,300+ stars and 192 forks:&lt;br&gt;
&lt;a href="https://github.com/agentic-commerce-protocol/agentic-commerce-protocol" rel="noopener noreferrer"&gt;github.com/agentic-commerce-protocol/agentic-commerce-protocol&lt;/a&gt;&lt;/p&gt;
&lt;h2&gt;
  
  
  The pivot nobody expected
&lt;/h2&gt;

&lt;p&gt;OpenAI originally launched ACP in September 2025 with "Instant Checkout" — buy directly inside the chat. Etsy was the first integration.&lt;/p&gt;

&lt;p&gt;It didn't work. Out of Shopify's millions of merchants, only about 12 activated checkout. Users browsed and compared, but went to the retailer's site to pay.&lt;/p&gt;

&lt;p&gt;OpenAI acknowledged it in March 2026: &lt;em&gt;"the initial version of Instant Checkout did not offer the level of flexibility that we aspire to provide."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The new focus is &lt;strong&gt;product discovery&lt;/strong&gt;: visual browsing, image-based search, comparison tables, budget filtering — then redirect to the merchant's site for checkout.&lt;/p&gt;
&lt;h2&gt;
  
  
  Technical integration: how it works
&lt;/h2&gt;
&lt;h3&gt;
  
  
  Feed specification
&lt;/h3&gt;

&lt;p&gt;Merchants push product data to an OpenAI endpoint via encrypted HTTPS. The feed spec:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Parameter&lt;/th&gt;
&lt;th&gt;Value&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Formats&lt;/td&gt;
&lt;td&gt;CSV, TSV (recommended), XML, JSON&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Encoding&lt;/td&gt;
&lt;td&gt;UTF-8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Max size&lt;/td&gt;
&lt;td&gt;10 GB&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Compression&lt;/td&gt;
&lt;td&gt;gzip&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Update frequency&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Every 15 minutes&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Required fields per product:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;code&gt;id&lt;/code&gt; — unique identifier&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;title&lt;/code&gt; — max 150 chars&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;description&lt;/code&gt; — max 5,000 chars&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;price&lt;/code&gt; — with ISO 4217 currency code&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;availability&lt;/code&gt; — in stock / out of stock / preorder&lt;/li&gt;
&lt;li&gt;
&lt;code&gt;image_url&lt;/code&gt; — at least one high-res image&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Recommended: GTIN/UPC/MPN, reviews, rich media, shipping options, performance signals.&lt;/p&gt;
&lt;h3&gt;
  
  
  Three APIs
&lt;/h3&gt;


&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Feeds API      → Upload/manage full product catalogs
Products API   → Individual product upserts
Promotions API → Manage promotions (API-only)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;


&lt;p&gt;For existing Stripe merchants, integration can take as little as one line of code.&lt;/p&gt;
&lt;h3&gt;
  
  
  Delegated payments
&lt;/h3&gt;

&lt;p&gt;The payment flow uses single-use, time-bound, amount-restricted tokens. OpenAI prepares a delegated payment request; the payment service provider (Stripe, PayPal, Checkout.com) handles tokenization. PCI compliant by design.&lt;/p&gt;
&lt;h3&gt;
  
  
  Versioning
&lt;/h3&gt;

&lt;p&gt;ACP uses date-based versioning (YYYY-MM-DD). The latest stable spec is &lt;code&gt;2026-01-30&lt;/code&gt;, which added extensions, discounts, and payment handlers.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;spec/2025-09-29/  → Initial release
spec/2025-12-12/  → Fulfillment enhancements
spec/2026-01-16/  → Capability negotiation
spec/2026-01-30/  → Extensions, discounts, payment handlers (latest)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;h2&gt;
  
  
  ACP vs. Google UCP vs. Amazon
&lt;/h2&gt;

&lt;p&gt;Three ecosystems are forming:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;ACP (OpenAI + Stripe):&lt;/strong&gt; Open standard, Apache 2.0. Feeds update every 15 minutes. 900M weekly users. Focused on conversational product discovery.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;UCP (Google):&lt;/strong&gt; Open standard, launched January 2026. Backed by Shopify, Walmart, Visa, Mastercard. 50 billion indexed products. Covers the full purchase journey.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Amazon (Rufus/Alexa+):&lt;/strong&gt; Closed ecosystem. 300M Rufus users, 60% higher conversion than standard Amazon flow. Blocks OpenAI crawlers. Removed 600M products from ChatGPT results.&lt;/p&gt;

&lt;p&gt;The key difference between ACP and Google Shopping feeds: ACP feeds are designed for AI reasoning, not indexing. Every field can become an argument the agent uses to explain &lt;em&gt;why&lt;/em&gt; a product is relevant — not just that it exists.&lt;/p&gt;

&lt;h2&gt;
  
  
  The data that matters
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Agentic traffic converts at &lt;strong&gt;15-30%&lt;/strong&gt; (Q1 2026 data) — 5-10x over traditional e-commerce&lt;/li&gt;
&lt;li&gt;AI-generated recommendations convert &lt;strong&gt;4.4x&lt;/strong&gt; better than traditional search (McKinsey)&lt;/li&gt;
&lt;li&gt;ChatGPT drives &lt;strong&gt;20%+ of referral traffic to Walmart&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;But: less than &lt;strong&gt;0.2% of total e-commerce sessions&lt;/strong&gt; come from ChatGPT&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The volume is small. The conversion is extraordinary. The year-over-year growth is +805% (Adobe, Black Friday 2025).&lt;/p&gt;

&lt;h2&gt;
  
  
  What this means if you build e-commerce
&lt;/h2&gt;

&lt;p&gt;If you're building or maintaining e-commerce systems, ACP changes the optimization target.&lt;/p&gt;

&lt;h3&gt;
  
  
  1. Feed quality &amp;gt; ad spend
&lt;/h3&gt;

&lt;p&gt;Products surface based on what the AI can parse and verify. Incomplete feeds = invisible products. Every missing attribute is a lost recommendation opportunity. A product without a GTIN, without proper category depth (up to 5 levels), or with a generic "Great product, buy now!" description will never be recommended when a user asks for something specific.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. Descriptions for NLP, not SEO
&lt;/h3&gt;

&lt;p&gt;Attribute-rich descriptions that an AI can reason about beat keyword-stuffed copy. When someone asks "waterproof running shoes for flat feet under $120," the AI needs to find "waterproof," "arch support for flat feet," and the price in your product data. If those attributes are buried in marketing copy instead of structured fields, you lose.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. Schema markup matters more than ever
&lt;/h3&gt;

&lt;p&gt;Product, Offer, AggregateRating with real data. The AI cross-references your structured data against user queries. Complete schema is no longer optional — it's the foundation of agentic visibility.&lt;/p&gt;

&lt;h3&gt;
  
  
  4. Multi-protocol support
&lt;/h3&gt;

&lt;p&gt;Prepare to maintain feeds for both ACP (OpenAI) and UCP (Google). Shopify already offers dual support. The formats are similar enough that a single product data pipeline can serve both, but the update frequencies differ (15 minutes for ACP vs. 24 hours default for UCP).&lt;/p&gt;

&lt;h3&gt;
  
  
  5. Attribution is fundamentally broken
&lt;/h3&gt;

&lt;p&gt;The customer behavior that drives the sale is invisible to existing analytics. A user asks ChatGPT for a recommendation, gets three product suggestions, visits one site, and buys. Your analytics sees a direct visit or an unknown referrer. No impression. No click. No session.&lt;/p&gt;

&lt;p&gt;Server-side webhook tracking from ACP/UCP is the path forward. Start building this infrastructure now — you'll need 18-24 months of data before it becomes reliable.&lt;/p&gt;

&lt;h2&gt;
  
  
  Getting started
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;Merchant portal: &lt;a href="https://chatgpt.com/merchants" rel="noopener noreferrer"&gt;chatgpt.com/merchants&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Developer docs: &lt;a href="https://developers.openai.com/commerce" rel="noopener noreferrer"&gt;developers.openai.com/commerce&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Feed spec: &lt;a href="https://developers.openai.com/commerce/specs/feed" rel="noopener noreferrer"&gt;developers.openai.com/commerce/specs/feed&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;GitHub: &lt;a href="https://github.com/agentic-commerce-protocol/agentic-commerce-protocol" rel="noopener noreferrer"&gt;github.com/agentic-commerce-protocol&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Stripe integration: &lt;a href="https://docs.stripe.com/agentic-commerce" rel="noopener noreferrer"&gt;docs.stripe.com/agentic-commerce&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Currently US-only for merchant onboarding. Product discovery is available globally for all ChatGPT users (Free, Go, Plus, Pro).&lt;/p&gt;




&lt;p&gt;We wrote a detailed analysis with market data, competitive comparisons, and practical recommendations:&lt;br&gt;
&lt;a href="https://zoopa.es/en/digital-marketing-en/agentic-commerce-protocol-acp-chatgpt-shopping/" rel="noopener noreferrer"&gt;zoopa.es/en/digital-marketing-en/agentic-commerce-protocol-acp-chatgpt-shopping/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;What's your take on agentic commerce? Are you already preparing your product feeds for AI agents, or waiting to see how ACP vs. UCP plays out?&lt;/p&gt;

</description>
      <category>ai</category>
      <category>ecommerce</category>
      <category>openai</category>
      <category>webdev</category>
    </item>
    <item>
      <title>robots.txt is a sign, not a fence: 8 technical vectors through which AI still reads your website</title>
      <dc:creator>carlosortet</dc:creator>
      <pubDate>Mon, 23 Mar 2026 07:54:14 +0000</pubDate>
      <link>https://dev.to/carlosortet/robotstxt-is-a-sign-not-a-fence-8-technical-vectors-through-which-ai-still-reads-your-website-37c6</link>
      <guid>https://dev.to/carlosortet/robotstxt-is-a-sign-not-a-fence-8-technical-vectors-through-which-ai-still-reads-your-website-37c6</guid>
      <description>&lt;p&gt;You configure &lt;code&gt;robots.txt&lt;/code&gt; like this:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight conf"&gt;&lt;code&gt;&lt;span class="n"&gt;User&lt;/span&gt;-&lt;span class="n"&gt;agent&lt;/span&gt;: &lt;span class="n"&gt;GPTBot&lt;/span&gt;
&lt;span class="n"&gt;Disallow&lt;/span&gt;: /

&lt;span class="n"&gt;User&lt;/span&gt;-&lt;span class="n"&gt;agent&lt;/span&gt;: &lt;span class="n"&gt;CCBot&lt;/span&gt;
&lt;span class="n"&gt;Disallow&lt;/span&gt;: /

&lt;span class="n"&gt;User&lt;/span&gt;-&lt;span class="n"&gt;agent&lt;/span&gt;: &lt;span class="n"&gt;anthropic&lt;/span&gt;-&lt;span class="n"&gt;ai&lt;/span&gt;
&lt;span class="n"&gt;Disallow&lt;/span&gt;: /

&lt;span class="n"&gt;User&lt;/span&gt;-&lt;span class="n"&gt;agent&lt;/span&gt;: &lt;span class="n"&gt;PerplexityBot&lt;/span&gt;
&lt;span class="n"&gt;Disallow&lt;/span&gt;: /

&lt;span class="n"&gt;User&lt;/span&gt;-&lt;span class="n"&gt;agent&lt;/span&gt;: *
&lt;span class="n"&gt;Disallow&lt;/span&gt;: /
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;You enable Cloudflare Bot Management. You set up Akamai. Maybe even a server-side paywall.&lt;/p&gt;

&lt;p&gt;And then you query ChatGPT about your product and it cites your website as a source.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;How?&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I work on GEO (Generative Engine Optimization) projects where we audit how LLMs represent brands. We routinely analyze thousands of prompt-response pairs. Across multiple projects, we consistently find that &lt;strong&gt;10–20% of LLM responses&lt;/strong&gt; cite the brand's own website as a source — even when every known bot is blocked.&lt;/p&gt;

&lt;p&gt;Here are the 8 technical vectors we documented, with academic sources and industry data.&lt;/p&gt;

&lt;h2&gt;
  
  
  1. Historical crawl data (Common Crawl)
&lt;/h2&gt;

&lt;p&gt;This is the biggest one and the least understood.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://commoncrawl.org/" rel="noopener noreferrer"&gt;Common Crawl&lt;/a&gt; is a nonprofit that has been archiving the web since 2007. The numbers:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;9.5+ petabytes&lt;/strong&gt;, 300+ billion documents&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;~2/3 of the 47 LLMs&lt;/strong&gt; published between 2019–2023 use it as training data&lt;/li&gt;
&lt;li&gt;GPT-3, LLaMA, T5, Red Pajama all trained on it&lt;/li&gt;
&lt;li&gt;Google's C4 dataset: 750 GB filtered from Common Crawl&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Blocking crawlers today &lt;strong&gt;does not retroactively remove&lt;/strong&gt; content already captured. Those snapshots are permanent, public resources.&lt;/p&gt;

&lt;p&gt;Source: &lt;a href="https://dl.acm.org/doi/10.1145/3630106.3659033" rel="noopener noreferrer"&gt;ACM FAccT 2024 — "A Critical Analysis of Common Crawl"&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  2. Client-side paywall bypass
&lt;/h2&gt;

&lt;p&gt;Common Crawl does not execute JavaScript. If your paywall depends on client-side JS:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight html"&gt;&lt;code&gt;&lt;span class="c"&gt;&amp;lt;!-- Your paywall loads after DOM ready --&amp;gt;&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;script&amp;gt;&lt;/span&gt;
  &lt;span class="nb"&gt;document&lt;/span&gt;&lt;span class="p"&gt;.&lt;/span&gt;&lt;span class="nf"&gt;addEventListener&lt;/span&gt;&lt;span class="p"&gt;(&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="s1"&gt;DOMContentLoaded&lt;/span&gt;&lt;span class="dl"&gt;'&lt;/span&gt;&lt;span class="p"&gt;,&lt;/span&gt; &lt;span class="p"&gt;()&lt;/span&gt; &lt;span class="o"&gt;=&amp;gt;&lt;/span&gt; &lt;span class="p"&gt;{&lt;/span&gt;
    &lt;span class="nf"&gt;showPaywall&lt;/span&gt;&lt;span class="p"&gt;();&lt;/span&gt;
  &lt;span class="p"&gt;});&lt;/span&gt;
&lt;span class="nt"&gt;&amp;lt;/script&amp;gt;&lt;/span&gt;

&lt;span class="c"&gt;&amp;lt;!-- But the crawler already captured the full HTML --&amp;gt;&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;The crawler gets the complete article before JS even runs.&lt;/p&gt;

&lt;p&gt;Alex Reisner documented this for The Atlantic (Nov 2025): Common Crawl was capturing full articles from NYT, WSJ, The Economist and The Atlantic itself.&lt;/p&gt;

&lt;h2&gt;
  
  
  3. User-agent spoofing
&lt;/h2&gt;

&lt;p&gt;Some AI bots change their identity when blocked.&lt;/p&gt;

&lt;p&gt;Cloudflare documented (Aug 2024) that Perplexity was using:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight http"&gt;&lt;code&gt;&lt;span class="err"&gt;# Declared user-agent
PerplexityBot/1.0

# What they actually sent
Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 Chrome/120.0.0.0
&lt;/span&gt;&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Plus ASN rotation to evade IP-based blocking. The evasion ecosystem includes FlareSolverr (Selenium + undetected-chromedriver), Scrapfly (94–98% bypass rates), and residential proxy rotation.&lt;/p&gt;

&lt;h2&gt;
  
  
  4. Syndication redistribution
&lt;/h2&gt;

&lt;p&gt;Once your content leaves your domain through any syndication channel, your &lt;code&gt;robots.txt&lt;/code&gt; is irrelevant:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Original domain (robots.txt: Disallow)
  → RSS feed (no robots.txt)
  → Apple News (different domain)
  → Email newsletter (archived on web)
  → Cross-posted to social (scraped by bots)
  → API aggregators (reformatted downstream)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Each channel creates a copy outside your control.&lt;/p&gt;

&lt;h2&gt;
  
  
  5. Web archives (Wayback Machine)
&lt;/h2&gt;

&lt;p&gt;Internet Archive: 1+ billion pages, 99+ petabytes. &lt;code&gt;web.archive.org&lt;/code&gt; is domain #187 in Google's C4 dataset.&lt;/p&gt;

&lt;p&gt;Harvard's &lt;a href="https://lil.law.harvard.edu/" rel="noopener noreferrer"&gt;WARC-GPT&lt;/a&gt; lets you ingest WARC archives directly into RAG pipelines. As of Feb 2026, publishers like The Guardian and NYT started blocking Wayback Machine over AI concerns.&lt;/p&gt;

&lt;h2&gt;
  
  
  6. Real-time RAG access
&lt;/h2&gt;

&lt;p&gt;Modern LLMs don't just rely on training data. They fetch content in real time:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Bot&lt;/th&gt;
&lt;th&gt;Growth 2024–2025&lt;/th&gt;
&lt;th&gt;Mechanism&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;ChatGPT-User&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;+2,825%&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Fetch on user "search the web"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;PerplexityBot&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;+157,490%&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Fetch on every query&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Meta-ExternalFetcher&lt;/td&gt;
&lt;td&gt;New in 2024&lt;/td&gt;
&lt;td&gt;Meta AI features&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;These bots claim the fetch is "user-initiated" (not autonomous crawling), trying to exempt themselves from &lt;code&gt;robots.txt&lt;/code&gt;.&lt;/p&gt;

&lt;p&gt;Cloudflare reported Anthropic's bots have crawl-to-refer ratios of &lt;strong&gt;38,000:1 to 70,000:1&lt;/strong&gt;. For every time they send traffic back, they crawl tens of thousands of times.&lt;/p&gt;

&lt;p&gt;Sources: &lt;a href="https://blog.cloudflare.com/" rel="noopener noreferrer"&gt;Cloudflare Blog 2025&lt;/a&gt;, &lt;a href="https://platform.openai.com/docs/bots" rel="noopener noreferrer"&gt;OpenAI Crawlers Overview&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  7. Content farms
&lt;/h2&gt;

&lt;p&gt;Content farms — human or AI-operated — rewrite your articles on unrestricted domains:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;1. Scrape/copy original article
2. Rewrite to avoid plagiarism detection
3. Publish on domain with no robots.txt restrictions
4. AI crawler indexes the rewrite
5. LLM absorbs the rewritten version
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;In &lt;em&gt;Bartz v. Anthropic PBC&lt;/em&gt;, the court ruled that training AI with content from "pirate sites" constituted fair use. This sets precedent for rewritten content too.&lt;/p&gt;

&lt;h2&gt;
  
  
  8. Direct non-compliance
&lt;/h2&gt;

&lt;p&gt;The simplest vector: bots just ignore &lt;code&gt;robots.txt&lt;/code&gt;.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;12.9% of bots&lt;/strong&gt; ignore it entirely (was 3.3%) — &lt;a href="https://paulcalvano.com/" rel="noopener noreferrer"&gt;Paul Calvano, Aug 2025&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Duke University (2025): "several categories of AI-related crawlers never request robots.txt"&lt;/li&gt;
&lt;li&gt;
&lt;a href="https://arxiv.org/abs/2505.21733" rel="noopener noreferrer"&gt;Kim &amp;amp; Bock (ACM IMC 2025)&lt;/a&gt;: scrapers are &lt;strong&gt;less likely&lt;/strong&gt; to comply with more restrictive directives&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The legal status is clear: in &lt;em&gt;Ziff Davis v. OpenAI&lt;/em&gt; (2025), the judge described &lt;code&gt;robots.txt&lt;/code&gt; as "more like a sign than a fence" — not a technological measure that "effectively controls access" under the DMCA.&lt;/p&gt;

&lt;h2&gt;
  
  
  The compliance stats
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Metric&lt;/th&gt;
&lt;th&gt;Value&lt;/th&gt;
&lt;th&gt;Source&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Bots ignoring robots.txt&lt;/td&gt;
&lt;td&gt;12.9%&lt;/td&gt;
&lt;td&gt;Paul Calvano, 2025&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Top 10K sites with AI bot rules&lt;/td&gt;
&lt;td&gt;Only 14%&lt;/td&gt;
&lt;td&gt;Market analysis 2025&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Sites with any robots.txt&lt;/td&gt;
&lt;td&gt;94% (12.2M sites)&lt;/td&gt;
&lt;td&gt;Global study 2025&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h2&gt;
  
  
  So what do you do?
&lt;/h2&gt;

&lt;p&gt;Blocking alone doesn't work. Defensive measures reduce direct crawling by 40–60% for compliant bots, but they can't touch historical data, syndicated copies, or content farm rewrites.&lt;/p&gt;

&lt;p&gt;The alternative is &lt;strong&gt;offensive&lt;/strong&gt;: control the narrative instead of trying to hide from it.&lt;/p&gt;

&lt;p&gt;At &lt;a href="https://498as.com" rel="noopener noreferrer"&gt;498 Advance&lt;/a&gt; we built tools for this: &lt;strong&gt;GEOdoctor&lt;/strong&gt; for technical auditing of brand visibility in LLMs, and &lt;strong&gt;S.A.M.&lt;/strong&gt; (Semantic Alignment Machine) for content alignment across owned media, UGC platforms (social GEO) and authority domains.&lt;/p&gt;




&lt;p&gt;Full analysis with all academic sources: &lt;a href="https://zoopa.es/en/digital-marketing-en/why-ai-keeps-reading-your-website-even-after-you-block-every-bot/" rel="noopener noreferrer"&gt;zoopa.es/en/blog&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Have you run into this paradox? Blocking everything but still appearing in LLM outputs? I'd love to hear what you've observed in your own infrastructure.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>security</category>
      <category>webdev</category>
      <category>devops</category>
    </item>
    <item>
      <title>How I built a proactive personal AI assistant based on Moltbot for 10€ a month</title>
      <dc:creator>carlosortet</dc:creator>
      <pubDate>Fri, 30 Jan 2026 07:18:51 +0000</pubDate>
      <link>https://dev.to/carlosortet/how-i-built-a-proactive-personal-ai-assistant-based-on-moltbot-for-10eu-a-month-34dp</link>
      <guid>https://dev.to/carlosortet/how-i-built-a-proactive-personal-ai-assistant-based-on-moltbot-for-10eu-a-month-34dp</guid>
      <description>&lt;p&gt;&lt;strong&gt;An open source agent with persistent memory, command execution, real proactivity, and WhatsApp as its interface. Full architecture, real costs, and lessons learned.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Why Build Your Own Assistant
&lt;/h2&gt;

&lt;p&gt;The idea of having an AI assistant that actually knows you is nothing new. Anyone who uses ChatGPT, Claude, or Gemini daily has felt the same frustration: every conversation starts from scratch, you have to re-explain your project context, and the assistant cannot do anything beyond the chat window.&lt;/p&gt;

&lt;p&gt;At 498AS, we have been experimenting for months with language models applied to real workflows. Not just text generation, but agents that execute tasks, access tools, and maintain long-term context.&lt;/p&gt;

&lt;p&gt;When we discovered Clawdbot (Moltbot), we saw an opportunity to close that gap. An open source project that lets you run your own AI agent with persistent memory, command execution, and the ability to message you before you message it.&lt;/p&gt;

&lt;p&gt;We set it up. It works. And it costs 10 euros a month.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Is Clawdbot
&lt;/h2&gt;

&lt;p&gt;Clawdbot (Moltbot) is an open source project that provides the infrastructure for running your own AI agent. It is not a closed product or a subscription app. It is the technical skeleton on which you build your personalized assistant.&lt;/p&gt;

&lt;p&gt;Its core capabilities:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Multichannel communication.&lt;/strong&gt; It talks to you via WhatsApp, Telegram, Discord, or whatever platform you prefer.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Persistent memory.&lt;/strong&gt; It remembers previous conversations, your projects, your preferences. It does not start from zero.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Command execution.&lt;/strong&gt; It can run scripts, open applications, control your browser. Real actions on your computer.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Tool and API access.&lt;/strong&gt; You configure which tools it can use, and the agent integrates them into its responses and actions.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Full control.&lt;/strong&gt; It runs on your server, under your infrastructure. You decide what data it holds and who can access it.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The key difference from ChatGPT or Claude is that Clawdbot does not live in a third party's cloud. It lives wherever you decide.&lt;/p&gt;




&lt;h2&gt;
  
  
  Full System Architecture
&lt;/h2&gt;

&lt;h3&gt;
  
  
  The Central Server
&lt;/h3&gt;

&lt;p&gt;The heart of the system is a VPS on Hetzner. Minimum specs: 2 vCPUs, 4GB of RAM. Model CX22. Cost: approximately 4 euros per month.&lt;/p&gt;

&lt;p&gt;This server runs the Clawdbot gateway, which handles:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Receiving and sending messages&lt;/li&gt;
&lt;li&gt;Communication with the AI model&lt;/li&gt;
&lt;li&gt;Persistent memory management&lt;/li&gt;
&lt;li&gt;Tool and node coordination&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  The AI Model
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Subscription option.&lt;/strong&gt; If you already pay for Claude Pro Max, ChatGPT Plus/Pro, or similar, Clawdbot can use that subscription directly. No separate API costs.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;API option.&lt;/strong&gt; If you prefer pay-per-use, you can connect the APIs from Claude, OpenAI, or any compatible model. For personal use it typically ranges between 10 and 30 euros per month.&lt;/p&gt;

&lt;h3&gt;
  
  
  WhatsApp as the Interface
&lt;/h3&gt;

&lt;p&gt;The setup requires a dedicated prepaid SIM. Cost: 6 euros per month with minimal data. The agent has its own phone number. You message it like any other contact and it replies.&lt;/p&gt;

&lt;h3&gt;
  
  
  Security with Tailscale
&lt;/h3&gt;

&lt;p&gt;The Clawdbot gateway is &lt;strong&gt;not exposed to the internet&lt;/strong&gt;. It only listens on the Tailscale network.&lt;/p&gt;

&lt;p&gt;Tailscale is a mesh VPN that creates a private network between your devices. End-to-end encryption, regardless of physical location. Free for personal use.&lt;/p&gt;

&lt;p&gt;Our specific configuration:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;The Hetzner VPS is on the Tailscale network&lt;/li&gt;
&lt;li&gt;Our phones and computers are too&lt;/li&gt;
&lt;li&gt;The gateway only accepts connections from Tailscale IP ranges&lt;/li&gt;
&lt;li&gt;The firewall is explicitly configured: only Tailscale can reach the gateway&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If someone scans the server from the internet, the gateway port does not even show up.&lt;/p&gt;

&lt;h3&gt;
  
  
  Distributed Nodes
&lt;/h3&gt;

&lt;p&gt;Clawdbot lets you connect "nodes": computers that the agent can control remotely. My MacBook is connected as a node. I can be out on the street, message the agent via WhatsApp saying "open Chrome and look up X" and it does it on my Mac.&lt;/p&gt;

&lt;p&gt;All traffic between nodes and the gateway goes through Tailscale.&lt;/p&gt;

&lt;h3&gt;
  
  
  Access Control
&lt;/h3&gt;

&lt;p&gt;The agent does not talk to just anyone. We have an allowlist of authorized phone numbers. Permissions are granular: who can interact, which commands it can execute, which tools are available, whether execution requires prior approval.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Proactive Agent: The Real Game Changer
&lt;/h2&gt;

&lt;p&gt;This is the feature that turns a chatbot into an actual assistant. The agent does not wait for you to write. It messages you first.&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Email alerts.&lt;/strong&gt; The agent monitors your email and notifies you via WhatsApp when something relevant arrives.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Calendar reminders.&lt;/strong&gt; It reads your calendar and alerts you ahead of meetings.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Custom alerts.&lt;/strong&gt; Weather updates, price monitoring, GitHub repository activity.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scheduled tasks.&lt;/strong&gt; "Every Monday at 9, review my pending tasks and tell me what I need to do this week."&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Proactivity completely transforms the relationship with the assistant.&lt;/p&gt;




&lt;h2&gt;
  
  
  Real Cost Breakdown
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Item&lt;/th&gt;
&lt;th&gt;Monthly Cost&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Hetzner CX22 VPS&lt;/td&gt;
&lt;td&gt;~4 EUR&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Prepaid SIM with data&lt;/td&gt;
&lt;td&gt;~6 EUR&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Tailscale&lt;/td&gt;
&lt;td&gt;0 EUR (free tier)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Total infrastructure&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;~10 EUR/month&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;The AI model is separate. If you already have a subscription to Claude Pro, ChatGPT Plus, or similar, you pay nothing extra.&lt;/p&gt;




&lt;h2&gt;
  
  
  Lessons Learned After Weeks of Use
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;More useful than expected.&lt;/strong&gt; What started as a technical experiment became a daily tool.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Proactivity is the differentiator.&lt;/strong&gt; Having the agent message you when something important happens is a paradigm shift.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Security is non-negotiable.&lt;/strong&gt; Tailscale, firewall, allowlist, granular permissions. All of it.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;WhatsApp as an interface reduces friction.&lt;/strong&gt; It is where you already are.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  How to Get Started
&lt;/h2&gt;

&lt;p&gt;Clawdbot is open source and available on GitHub with comprehensive documentation and an active Discord community.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Get a cheap VPS (Hetzner, DigitalOcean, or similar)&lt;/li&gt;
&lt;li&gt;Install Clawdbot following the documentation&lt;/li&gt;
&lt;li&gt;Connect WhatsApp with a dedicated SIM&lt;/li&gt;
&lt;li&gt;Set up Tailscale on the server and your devices&lt;/li&gt;
&lt;li&gt;Define an allowlist of authorized numbers&lt;/li&gt;
&lt;li&gt;Configure tools and permissions&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;&lt;strong&gt;Security recommendations:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Do not expose the gateway to the internet. Use Tailscale.&lt;/li&gt;
&lt;li&gt;Do not run as root.&lt;/li&gt;
&lt;li&gt;Configure user allowlists.&lt;/li&gt;
&lt;li&gt;Enable approval for command execution.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;We wrote a detailed blog post with the full architecture and step-by-step setup: &lt;a href="https://zoopa.es/es/blog/como-montar-un-asistente-de-ia-personal-por-10e-al-mes-con-clawdbot/" rel="noopener noreferrer"&gt;Read the full article on our blog&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Article published by the 498AS team. Questions about the setup? Drop a comment or get in touch.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>selfhosted</category>
      <category>ai</category>
      <category>opensource</category>
      <category>showdev</category>
    </item>
    <item>
      <title>Clawdbot: The AI Assistant Siri Promised But Never Delivered — Complete 2026 Guide</title>
      <dc:creator>carlosortet</dc:creator>
      <pubDate>Sun, 25 Jan 2026 18:32:18 +0000</pubDate>
      <link>https://dev.to/carlosortet/clawdbot-the-ai-assistant-siri-promised-but-never-delivered-complete-2026-guide-4la0</link>
      <guid>https://dev.to/carlosortet/clawdbot-the-ai-assistant-siri-promised-but-never-delivered-complete-2026-guide-4la0</guid>
      <description>&lt;p&gt;&lt;strong&gt;Clawdbot is an open-source personal AI assistant that radically transforms how we interact with artificial intelligence.&lt;/strong&gt; Unlike ChatGPT or Claude web, Clawdbot lives inside your everyday messaging apps — WhatsApp, Telegram, Slack, Discord — and can proactively message you, remember conversations from weeks ago, and execute tasks on your computer.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Problem Clawdbot Solves
&lt;/h2&gt;

&lt;p&gt;For over a decade, big tech companies promised us intelligent assistants that would transform our productivity. Siri arrived in 2011. Google Assistant in 2016. Alexa conquered millions of homes. And yet, in 2026, most users remain frustrated with these tools.&lt;/p&gt;

&lt;p&gt;The fundamental problem is that these traditional assistants:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Wait passively&lt;/strong&gt; for you to speak to them&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Forget everything&lt;/strong&gt; when the session closes&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Can't execute&lt;/strong&gt; complex tasks&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Live in closed ecosystems&lt;/strong&gt; that limit their utility&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Prioritize data collection&lt;/strong&gt; over real functionality&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Clawdbot represents a paradigm shift. It's what Siri should have been and never was.&lt;/p&gt;




&lt;h2&gt;
  
  
  Clawdbot vs Siri vs Google Assistant vs Alexa: 2026 Comparison
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Feature&lt;/th&gt;
&lt;th&gt;Clawdbot&lt;/th&gt;
&lt;th&gt;Siri&lt;/th&gt;
&lt;th&gt;Google Assistant&lt;/th&gt;
&lt;th&gt;Alexa&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Proactivity&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Writes to you when there's news&lt;/td&gt;
&lt;td&gt;Only responds&lt;/td&gt;
&lt;td&gt;Limited to routines&lt;/td&gt;
&lt;td&gt;Only basic notifications&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Persistent memory&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Remembers weeks/months&lt;/td&gt;
&lt;td&gt;Forgets on close&lt;/td&gt;
&lt;td&gt;Very limited&lt;/td&gt;
&lt;td&gt;Session only&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Executes PC tasks&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Terminal, browser, files&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Open source&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;MIT License, auditable&lt;/td&gt;
&lt;td&gt;Closed&lt;/td&gt;
&lt;td&gt;Closed&lt;/td&gt;
&lt;td&gt;Closed&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Your data on your machine&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Local-first&lt;/td&gt;
&lt;td&gt;Apple servers&lt;/td&gt;
&lt;td&gt;Google servers&lt;/td&gt;
&lt;td&gt;Amazon servers&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;WhatsApp integration&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Native&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Monthly cost&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;~$27 (VPS+Claude)&lt;/td&gt;
&lt;td&gt;Free (with iPhone)&lt;/td&gt;
&lt;td&gt;Free&lt;/td&gt;
&lt;td&gt;Free (with Echo)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Verdict&lt;/strong&gt;: Clawdbot is the only assistant that combines real proactivity, persistent memory, and total user control. The trade-off is it requires initial technical setup.&lt;/p&gt;




&lt;h2&gt;
  
  
  What is Clawdbot and Why Does It Matter
&lt;/h2&gt;

&lt;p&gt;Clawdbot is an open-source personal AI assistant that runs locally on your machine and responds through the messaging apps you already use: WhatsApp, Telegram, Slack, Discord, Signal, iMessage, and 50+ additional integrations.&lt;/p&gt;

&lt;p&gt;The fundamental difference with ChatGPT or Claude web is that Clawdbot &lt;strong&gt;lives inside your messaging apps&lt;/strong&gt;. You don't go to a website. You message it like any other contact. And it can:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Proactively send you messages&lt;/li&gt;
&lt;li&gt;Remember past conversations&lt;/li&gt;
&lt;li&gt;Execute tasks on your computer&lt;/li&gt;
&lt;li&gt;Run 24/7&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  The Local-First Philosophy
&lt;/h3&gt;

&lt;p&gt;One of the most relevant aspects of Clawdbot is its local-first architecture:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Your data stays on your machine by default&lt;/li&gt;
&lt;li&gt;No dependency on external servers (except the AI model you choose)&lt;/li&gt;
&lt;li&gt;You can audit all the code since it's open-source&lt;/li&gt;
&lt;li&gt;You have total control over what it can and cannot do&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  The Creator: Peter Steinberger
&lt;/h2&gt;

&lt;p&gt;Peter Steinberger (@steipete) is the creator of Clawdbot. He's not an anonymous developer or a weekend project funded by VCs promising to revolutionize the world.&lt;/p&gt;

&lt;p&gt;His profile:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Ex-founder of PSPDFKit&lt;/strong&gt;, a PDF software company with global presence&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;13+ years of experience&lt;/strong&gt; in native iOS development&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;Based in Vienna and London&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Builds in public&lt;/strong&gt;, sharing both successes and failures&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Technical Architecture
&lt;/h2&gt;

&lt;p&gt;Clawdbot uses a hub-and-spoke architecture with a centralized Gateway that acts as the control plane. The Gateway manages:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Session routing and isolation&lt;/strong&gt;: Each conversation stays separate&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Channel connections and events&lt;/strong&gt;: WhatsApp, Telegram, Slack, etc.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Tool execution and streaming&lt;/strong&gt;: Real-time responses&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Canvas/A2UI hosting&lt;/strong&gt;: Visual interface when needed&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Device node orchestration&lt;/strong&gt;: Multiple synchronized devices&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Supported AI Models
&lt;/h3&gt;

&lt;p&gt;Clawdbot isn't tied to a single provider:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Provider&lt;/th&gt;
&lt;th&gt;Models&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Anthropic&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Claude Pro/Max&lt;/td&gt;
&lt;td&gt;Recommended: Claude Opus 4.5&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;OpenAI&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;ChatGPT/Codex&lt;/td&gt;
&lt;td&gt;GPT-4 available&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Local&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Ollama, LM Studio&lt;/td&gt;
&lt;td&gt;For maximum privacy&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  Installation and Requirements
&lt;/h2&gt;

&lt;h3&gt;
  
  
  System Requirements
&lt;/h3&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Component&lt;/th&gt;
&lt;th&gt;Requirement&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Node.js&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&amp;gt;= 22&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Package Manager&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;npm, pnpm (preferred), or bun&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Operating System&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;macOS, Linux, Windows (via WSL2)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Minimum RAM&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;2GB (4GB for web automation)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Optional&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Docker (for sandboxing)&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  Quick Installation
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;npm &lt;span class="nb"&gt;install&lt;/span&gt; &lt;span class="nt"&gt;-g&lt;/span&gt; clawdbot@latest
clawdbot onboard &lt;span class="nt"&gt;--install-daemon&lt;/span&gt;
clawdbot gateway &lt;span class="nt"&gt;--port&lt;/span&gt; 18789 &lt;span class="nt"&gt;--verbose&lt;/span&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Or the one-liner:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;curl &lt;span class="nt"&gt;-fsSL&lt;/span&gt; https://clawd.bot/install.sh | bash
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Estimated setup time&lt;/strong&gt;: 30 minutes for basic configuration. 2-4 hours for complete customization with skills.&lt;/p&gt;




&lt;h2&gt;
  
  
  Differentiating Features
&lt;/h2&gt;

&lt;h3&gt;
  
  
  1. Persistent Memory
&lt;/h3&gt;

&lt;p&gt;Unlike traditional chatbots that forget everything when the session closes, Clawdbot maintains continuous 24/7 context, learned user preferences, history of past conversations, and structured memory files. The workspace can be a Git repository — if the bot "learns" something incorrect, you can git revert and return to a previous state.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. Real Proactivity
&lt;/h3&gt;

&lt;p&gt;This is perhaps the most differentiating feature. Clawdbot doesn't wait passively:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Morning briefings&lt;/strong&gt;: Summary of emails, calendar, relevant news&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Contextual alerts&lt;/strong&gt;: "Your meeting starts in 20 minutes"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Topic tracking&lt;/strong&gt;: "That project you mentioned has updates"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Smart reminders&lt;/strong&gt;: Based on usage patterns&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  3. System Control
&lt;/h3&gt;

&lt;p&gt;Clawdbot can interact with your computer:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Browser&lt;/strong&gt;: Automation with Chrome/Chromium (CDP)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Files&lt;/strong&gt;: Reading and writing&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Terminal&lt;/strong&gt;: Shell command execution&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scripts&lt;/strong&gt;: On-the-fly creation and execution&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  4. Extensibility
&lt;/h3&gt;

&lt;p&gt;The skills system allows extending capabilities. Most interesting: Clawdbot can write its own plugins to gain new capabilities.&lt;/p&gt;




&lt;h2&gt;
  
  
  Real-World Use Cases
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Personal Productivity
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Automatic Inbox Zero&lt;/strong&gt; — Email cleanup, mass unsubscription, organization&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Smart Calendar&lt;/strong&gt; — Contextual reminders based on location/time&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Flight Check-in&lt;/strong&gt; — Complete automation without intervention&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Morning Briefings&lt;/strong&gt; — Personalized daily summary&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Advanced Automation
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;WhatsApp Memory Vault&lt;/strong&gt; — A user automatically transcribed over 1000 voice messages with integrated semantic search.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Grocery Autopilot&lt;/strong&gt; — From recipe photo to completed shopping cart in under 5 minutes.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Complete Website Migration&lt;/strong&gt; — &lt;a class="mentioned-user" href="https://dev.to/thekitze"&gt;@thekitze&lt;/a&gt;: "Rebuilt my entire site via Telegram while watching Netflix in bed. Notion to Astro, 18 posts migrated, DNS moved to Cloudflare. Never opened my laptop."&lt;/p&gt;




&lt;h2&gt;
  
  
  Community Testimonials
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;"I got up and running today with @clawdbot and it's been nothing short of an iPhone moment for me." — @dajaset&lt;/p&gt;

&lt;p&gt;"Using @clawdbot for a week and it genuinely feels like early AGI." — &lt;a class="mentioned-user" href="https://dev.to/davekiss"&gt;@davekiss&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;"Not enterprise. Not hosted. Infrastructure you control. This is what personal AI should feel like." — @karpathy&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Real Costs
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Component&lt;/th&gt;
&lt;th&gt;Cost&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Basic VPS&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;~5 EUR/month (Hetzner, 2GB RAM)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Recommended VPS&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;~10 EUR/month (4GB RAM)&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Claude Pro&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$20/month&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Claude Max&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;$100-200/month&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Minimum functional configuration&lt;/strong&gt;: ~$27/month (VPS + Claude Pro)&lt;/p&gt;

&lt;p&gt;Clawdbot software is 100% free and open-source under MIT license.&lt;/p&gt;




&lt;h2&gt;
  
  
  Security and Privacy
&lt;/h2&gt;

&lt;p&gt;Clawdbot follows a local-first architecture: your data stays on your machine by default. Being open-source, you can audit all the code. However, the agent has broad access to your system (files, terminal, browser), so you should carefully configure permissions and use Docker containers to limit access in group contexts.&lt;/p&gt;

&lt;h3&gt;
  
  
  Security Recommendations
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;Use restrictive pairing&lt;/li&gt;
&lt;li&gt;SSH tunneling for remote access&lt;/li&gt;
&lt;li&gt;Dedicated number for WhatsApp (ban risk)&lt;/li&gt;
&lt;li&gt;Carefully evaluate skill permissions&lt;/li&gt;
&lt;li&gt;Run &lt;code&gt;clawdbot doctor&lt;/code&gt; regularly&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Who is Clawdbot For
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Ideal Profiles
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Developers&lt;/strong&gt; — Infinite extensibility, hackable&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Technical Power Users&lt;/strong&gt; — Total control, privacy&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Founders/CEOs&lt;/strong&gt; — 24/7 automation&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Content Creators&lt;/strong&gt; — Multi-platform, community management&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  Who Should NOT Use It (Yet)
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Non-technical users&lt;/strong&gt; — Requires terminal, VPS knowledge&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Companies with strict compliance&lt;/strong&gt; — Unaudited security&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Error intolerant users&lt;/strong&gt; — Still early stage software&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Conclusions
&lt;/h2&gt;

&lt;p&gt;Clawdbot represents a milestone in the evolution of personal AI assistants. It's not perfect — onboarding is still rough, there are occasional bugs, and it requires technical knowledge — but it's genuinely useful in ways Siri never achieved.&lt;/p&gt;

&lt;p&gt;For developers and technical power users, it's a transformative tool. For the average user, it's probably better to wait for more polished versions.&lt;/p&gt;

&lt;p&gt;What's clear is that the model of "assistant that lives in your infrastructure and contacts you proactively" is the future. Big Tech will eventually arrive. The open-source community got there first.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;GitHub&lt;/strong&gt;: &lt;a href="https://github.com/clawdbot/clawdbot" rel="noopener noreferrer"&gt;github.com/clawdbot/clawdbot&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Analysis by ZOOPA Research | 498AS Innovation Lab | January 2026&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>productivity</category>
      <category>selfhosted</category>
    </item>
  </channel>
</rss>
