<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: Lingdas1</title>
    <description>The latest articles on DEV Community by Lingdas1 (@lingdas1).</description>
    <link>https://dev.to/lingdas1</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3946584%2F1c01a3c9-e259-4e60-8976-35c6925624ba.png</url>
      <title>DEV Community: Lingdas1</title>
      <link>https://dev.to/lingdas1</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/lingdas1"/>
    <language>en</language>
    <item>
      <title>我装了四个 AI Agent 才发现：它们根本不是竞品</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Sun, 07 Jun 2026 21:00:42 +0000</pubDate>
      <link>https://dev.to/lingdas1/wo-zhuang-liao-si-ge-ai-agent-cai-fa-xian-ta-men-gen-ben-bu-shi-jing-pin-11jg</link>
      <guid>https://dev.to/lingdas1/wo-zhuang-liao-si-ge-ai-agent-cai-fa-xian-ta-men-gen-ben-bu-shi-jing-pin-11jg</guid>
      <description>&lt;p&gt;你不需要同时装四个 AI Agent。&lt;/p&gt;

&lt;p&gt;但如果你去闲鱼搜，会发现有人把 OpenClaw、Hermes、Codex、Claude Code 打包成一个「四合一教程」在卖。我就是那个人。&lt;/p&gt;

&lt;p&gt;不是我闲得慌。是装完第一个之后，我发现网上没有人说清楚它们到底有什么区别。&lt;/p&gt;




&lt;h2&gt;
  
  
  于是我把四个全装了一遍
&lt;/h2&gt;

&lt;p&gt;先装的 OpenClaw。它是四个里最简单的——一行 PowerShell，15 分钟搞定。装完的感觉是：哦，这就是个能连飞书的 ChatGPT。&lt;/p&gt;

&lt;p&gt;但网上有人说「OpenClaw 只是个网关，Hermes 才是大脑」。我又去装了 Hermes。&lt;/p&gt;

&lt;p&gt;Hermes 装起来麻烦不少——Windows 用户必须先装 WSL2，再装 Ubuntu，再装 Hermes。但装完之后发生了一件让我愣住的事：&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;第二天打开 Hermes，它还记得我昨天让它用 f-string。&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;我什么都没设置。它自己记的。&lt;/p&gt;




&lt;h2&gt;
  
  
  然后我意识到：它们不是竞品
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Agent&lt;/th&gt;
&lt;th&gt;本质&lt;/th&gt;
&lt;th&gt;该什么时候用&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;OpenClaw&lt;/td&gt;
&lt;td&gt;通讯中控&lt;/td&gt;
&lt;td&gt;你想让 AI 替你回 QQ/飞书消息&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Hermes&lt;/td&gt;
&lt;td&gt;自学大脑&lt;/td&gt;
&lt;td&gt;你想要一个会记住你、越用越懂你的助手&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Codex&lt;/td&gt;
&lt;td&gt;桌面写码&lt;/td&gt;
&lt;td&gt;你想对着电脑说「帮我写个网页」然后它就写了&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Claude Code&lt;/td&gt;
&lt;td&gt;终端重器&lt;/td&gt;
&lt;td&gt;你有一坨代码需要重构，而且不想手动改&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;它们不是竞品——是四个工具，负责四件不同的事。&lt;/p&gt;

&lt;p&gt;就像你不会拿螺丝刀跟锤子比「谁更好」。&lt;/p&gt;




&lt;h2&gt;
  
  
  最让我意外的发现
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Claude Code 有插件市场。&lt;/strong&gt; 我一直以为它就是「终端里的 ChatGPT」，直到我输了 &lt;code&gt;/plugin&lt;/code&gt;——弹出来一个完整的插件浏览器，有人做了实时状态栏、自动记忆、Git 辅助。GitHub 上 8.5K 星的那种。&lt;/p&gt;

&lt;p&gt;网上没有一个中文教程提过这个。&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Hermes 能接 QQ。&lt;/strong&gt; 所有教程都在讲飞书。但 Hermes 的 Gateway 列表里明明白白写着 QQ——而且比飞书简单得多：创建机器人 → 扫码 → 完事。不需要企业认证，不需要填一堆表。&lt;/p&gt;

&lt;p&gt;网上也没有一个中文教程提过这个。&lt;/p&gt;




&lt;h2&gt;
  
  
  所以我把这些都写进了一份教程
&lt;/h2&gt;

&lt;p&gt;不是因为我懂技术。正相反——因为我就是那个每个坑都踩了一遍的人。&lt;/p&gt;

&lt;p&gt;教程里有：&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;每个 Agent 三种安装方式（Codex 不只微软商店一条路）&lt;/li&gt;
&lt;li&gt;QQ 和飞书的配置步骤&lt;/li&gt;
&lt;li&gt;五个真的能跑通的联动场景（不是「他们能配合」这种废话，是可复制的命令）&lt;/li&gt;
&lt;li&gt;每个 Agent 装完后的 5 分钟小任务（确认装对了）&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  你可能会问
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;这四个里我该装哪个？&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;如果你只需要一个：OpenClaw（最简单）或 Hermes（最聪明）。&lt;/p&gt;

&lt;p&gt;如果你想搭一套自动回复系统：OpenClaw + Hermes。&lt;/p&gt;

&lt;p&gt;如果你想自己写小工具：Codex 或 Claude Code。&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;哪里买？&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;闲鱼搜「海天明月夜」。GitHub 上有预览：&lt;a href="https://github.com/Lingdas1" rel="noopener noreferrer"&gt;github.com/Lingdas1&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;你装过哪个？踩过什么坑？评论区聊聊——说不定你踩的坑我忘写了。&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>tutorial</category>
      <category>agents</category>
    </item>
    <item>
      <title>💀 Crash #5: The Great OS Migration</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Fri, 29 May 2026 17:35:36 +0000</pubDate>
      <link>https://dev.to/lingdas1/crash-5-the-great-os-migration-7bl</link>
      <guid>https://dev.to/lingdas1/crash-5-the-great-os-migration-7bl</guid>
      <description>&lt;h1&gt;
  
  
  💀 Crash #5: The Great OS Migration
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"New Windows, new me. This time I'll do it right." Famous last words.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;After the emulator war, I had made a decision.&lt;/p&gt;

&lt;p&gt;No more WSL2. No more Hyper-V conflicts. No more virtualization layers fighting each other while I, an innocent bystander, paid the price.&lt;/p&gt;

&lt;p&gt;I was going to wipe my computer clean, install a fresh Windows, and run my AI assistant inside a &lt;strong&gt;real virtual machine&lt;/strong&gt; — VMware, with a full Linux installation. Isolated. Clean. Professional.&lt;/p&gt;

&lt;p&gt;I backed up everything. Twice. (I had learned &lt;em&gt;something&lt;/em&gt; from the 20GB model download disaster.)&lt;/p&gt;

&lt;p&gt;Then I nuked the whole system.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Clean Slate Delusion
&lt;/h2&gt;

&lt;p&gt;Fresh Windows feels amazing. Everything is fast. Nothing is broken. You think: &lt;em&gt;this time I will be organized. This time I will not make the same mistakes.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;This is a lie the computer tells you.&lt;/p&gt;

&lt;p&gt;I installed VMware. Downloaded Ubuntu. Created a virtual machine with 8GB RAM and 4 CPU cores. Installed Docker inside the VM. Deployed my AI assistant. Configured the gateway. Tested the QQ connection.&lt;/p&gt;

&lt;p&gt;It. All. Worked.&lt;/p&gt;

&lt;p&gt;I sat back in my chair and felt something I had not felt in weeks: &lt;strong&gt;peace.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Peace Lasted One Day
&lt;/h2&gt;

&lt;p&gt;The next morning, I opened my laptop and the VM could not reach the internet.&lt;/p&gt;

&lt;p&gt;The host had internet. The browser worked. But inside the VM? Nothing. The AI assistant was cut off from the world. A brain in a jar.&lt;/p&gt;

&lt;p&gt;I spent three hours diagnosing this. Three. Hours.&lt;/p&gt;

&lt;p&gt;The problem? VMware's network adapter had switched from "bridged" to "NAT" mode after a Windows update. I did not change this setting. I did not even know this setting &lt;em&gt;existed&lt;/em&gt; until it broke.&lt;/p&gt;

&lt;p&gt;Here is the real kicker: in bridged mode, the VM gets its own IP address from the router. In NAT mode, it shares the host's IP. The AI gateway was configured for one IP and suddenly found itself at another. Same machine. Different identity.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;The network worked. The AI worked. They just could not find each other.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  Then Shared Folders Broke
&lt;/h2&gt;

&lt;p&gt;Two days later: I tried to edit a config file from Windows (because typing in a Linux terminal is exhausting) and the shared folder would not mount.&lt;/p&gt;

&lt;p&gt;VMware Tools had silently uninstalled itself during an Ubuntu kernel update.&lt;/p&gt;

&lt;p&gt;I did not trigger this. I did not approve this. The machine just... decided.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Thing Nobody Tells You About VMs
&lt;/h2&gt;

&lt;p&gt;Here is what I learned in Crash #2:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;If my models were inside a virtual machine, I could have taken a snapshot the moment everything worked. When Windows crashed, I restore the snapshot. Thirty seconds. Done.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;Crash #5 is where I finally put that into practice.&lt;/p&gt;

&lt;p&gt;And it worked. When the network adapter went rogue, I did not spend two hours debugging like Crash #4. I restored a snapshot from yesterday. &lt;strong&gt;Ninety seconds.&lt;/strong&gt; Back online.&lt;/p&gt;

&lt;p&gt;When VMware Tools vanished, I restored a snapshot. &lt;strong&gt;Forty-five seconds.&lt;/strong&gt; Done.&lt;/p&gt;

&lt;p&gt;The VM did not solve &lt;em&gt;every&lt;/em&gt; problem. It created new ones I had never seen before. But it gave me something I never had on bare metal: &lt;strong&gt;undo.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  🛡️ Golden Rule Reminder
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;A clean install fixes old problems and creates new ones.&lt;/strong&gt; The goal is not a perfect setup — it is a setup you can &lt;em&gt;recover&lt;/em&gt; from. VMs give you recovery. Snapshots are time machines. Use them.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Bridged mode vs. NAT mode.&lt;/strong&gt; If your VM needs to be reachable from other devices on your network (like your phone running QQ), use &lt;strong&gt;bridged.&lt;/strong&gt; If it only needs outbound internet, NAT is simpler. Write this down somewhere. Future you will thank you.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;em&gt;← &lt;a href="https://dev.to/lingdas1/crash-4-the-emulator-war-8pj"&gt;Crash #4: The Emulator War&lt;/a&gt; | Crash #6: The Invisible Network Cable →&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  💬 What Should I Write Next?
&lt;/h2&gt;

&lt;p&gt;I have three more crashes lined up, then the series is done. But after that — I'm open to ideas.&lt;/p&gt;

&lt;p&gt;What are you stuck on right now? A specific error, a tool you can't configure, a concept that feels like it was written for people with CS degrees?&lt;/p&gt;

&lt;p&gt;Tell me in the comments. No promises, but if it's a problem I've bled over, I'll cover it.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>story</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>⚔️ Crash #4: The Emulator War</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Fri, 29 May 2026 17:21:48 +0000</pubDate>
      <link>https://dev.to/lingdas1/crash-4-the-emulator-war-8pj</link>
      <guid>https://dev.to/lingdas1/crash-4-the-emulator-war-8pj</guid>
      <description>&lt;h1&gt;
  
  
  ⚔️ Crash #4: The Emulator War
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"I just deleted a game emulator. Why is my AI assistant dead?"&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;Months before I fell down the AI rabbit hole, I installed an Android emulator on my laptop. You know the type — "this phone game would look better on a bigger screen." Harmless decision. Right?&lt;/p&gt;

&lt;p&gt;I played for a few weeks. Got bored. Uninstalled.&lt;/p&gt;

&lt;p&gt;Forgot about it completely.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Ghost
&lt;/h2&gt;

&lt;p&gt;Fast forward three months. I am deep in my AI rabbit hole. WSL2 is running. My local models are humming. I have &lt;em&gt;achieved things&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;Then one morning I open the terminal, type a command, and Windows spits this at me:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;HCS_E_SERVICE_NOT_AVAILABLE
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;I stare at the screen.&lt;/p&gt;

&lt;p&gt;Then I stare harder.&lt;/p&gt;

&lt;p&gt;Nothing changes.&lt;/p&gt;

&lt;p&gt;For those of you who, like me, did not major in computer science: this error means &lt;strong&gt;virtualization broke.&lt;/strong&gt; The thing that lets your Linux subsystem exist inside Windows? Gone. Poof. No explanation.&lt;/p&gt;

&lt;p&gt;I did not install anything new. I did not change settings. I literally just woke up, made coffee, sat down, and my computer had decided to betray me while I was sleeping.&lt;/p&gt;




&lt;h2&gt;
  
  
  Two Hours of Pain
&lt;/h2&gt;

&lt;p&gt;Here is what "debugging" looks like when you are not a developer:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Google the error code&lt;/li&gt;
&lt;li&gt;Read three forum posts in languages you barely understand&lt;/li&gt;
&lt;li&gt;Try the first solution → &lt;strong&gt;nothing&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Try the second solution → &lt;strong&gt;worse&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Restart the computer → &lt;strong&gt;still broken&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Stare at the ceiling&lt;/li&gt;
&lt;li&gt;Try solution three → &lt;strong&gt;computer freezes&lt;/strong&gt;
&lt;/li&gt;
&lt;li&gt;Restart again&lt;/li&gt;
&lt;li&gt;Type the same error into Google with slightly different words&lt;/li&gt;
&lt;li&gt;Find a Reddit thread from 2021 with two upvotes&lt;/li&gt;
&lt;li&gt;Try &lt;em&gt;that guy's&lt;/em&gt; solution → &lt;strong&gt;IT WORKS&lt;/strong&gt;
&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;Two hours. Twelve browser tabs. One Reddit hero from 2021 who will never know he saved my sanity.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Actually Happened
&lt;/h2&gt;

&lt;p&gt;Remember that Android emulator I uninstalled months ago?&lt;/p&gt;

&lt;p&gt;It had hijacked Windows' virtualization layer — the same layer WSL2 needs to run. When I deleted the emulator, it didn't clean up after itself. It left a wound in the system. And that wound festered for months until one day it just... collapsed.&lt;/p&gt;

&lt;p&gt;The emulator and WSL2 were never supposed to share a computer. They were &lt;em&gt;fighting&lt;/em&gt; over the same resources — and I was the collateral damage.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;An app I forgot existed broke my AI assistant three months later.&lt;/strong&gt; That is not a bug. That is a horror movie premise.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I Learned
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Your computer's virtualization layer is a house of cards.&lt;/strong&gt; Remove one card — even a card you forgot was there — and the whole thing can collapse.&lt;/p&gt;

&lt;p&gt;Also: &lt;strong&gt;Windows 11 Home edition is not your friend.&lt;/strong&gt; It hides the virtualization settings you need to debug. Pro edition has them right there in the Windows Features menu. Home edition? "Those settings are for &lt;em&gt;advanced users&lt;/em&gt; who &lt;em&gt;know what they're doing.&lt;/em&gt;" Translation: they buried them because they do not trust you.&lt;/p&gt;

&lt;p&gt;I spent half my debugging time just &lt;em&gt;finding the settings I needed to check.&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  🛡️ Golden Rule Reminder
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Nothing is unrelated.&lt;/strong&gt; That game emulator you installed last year? That VPN you tested once and forgot? They can all come back to haunt your AI setup. The only protection is isolation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Run everything in a VM.&lt;/strong&gt; If WSL2 was inside a virtual machine, the emulator's leftovers could not have touched it. The VM has its own virtualization layer — a clean room that your host's chaos cannot reach. I did not know this yet when Crash #4 happened. But Crash #5 is where I finally figure it out.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;em&gt;This is Crash #4 in an ongoing series.&lt;/em&gt;&lt;br&gt;&lt;br&gt;
&lt;em&gt;← &lt;a href="https://dev.to/lingdas1/crash-2-when-the-internet-betrayed-me-2a0h"&gt;Crash #2: When the Internet Betrayed Me&lt;/a&gt; | &lt;a href="https://dev.to/lingdas1/crash-5-the-great-os-migration-7bl"&gt;Crash #5: The Great OS Migration →&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  💬 What's Next?
&lt;/h2&gt;

&lt;p&gt;This series isn't finished — and honestly, I don't know what to cover after Crash #8.&lt;/p&gt;

&lt;p&gt;What's a problem you've hit with AI that nobody writes about? Could be a tool that broke, a concept that didn't click, a setup that went sideways. Drop it below.&lt;/p&gt;

&lt;p&gt;If it's something I've wrestled with too (probably more than once), I'll write it up.&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>story</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>Crash #2: When the Internet Betrayed Me</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Tue, 26 May 2026 19:29:13 +0000</pubDate>
      <link>https://dev.to/lingdas1/crash-2-when-the-internet-betrayed-me-2a0h</link>
      <guid>https://dev.to/lingdas1/crash-2-when-the-internet-betrayed-me-2a0h</guid>
      <description>&lt;h1&gt;
  
  
  🌐 Crash #2: When the Internet Betrayed Me
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;"I'll download local models so my AI works offline." Famous last words.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;I live in Russia, where the internet is... let's say &lt;em&gt;adventurous.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Some days the VPN works. Some days the whole building loses connection for four hours. Some days you can reach Google but not GitHub. It is a roulette wheel that spins every time you open a browser.&lt;/p&gt;

&lt;p&gt;So I had a smart idea: &lt;em&gt;download local AI models. Then my assistant works offline. No internet needed.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I spent an entire Saturday downloading models. 20 gigabytes. On Russian dorm internet. This took patience I did not know I had.&lt;/p&gt;

&lt;p&gt;Sunday morning: everything was working. I tested it. The AI responded. I felt like a genius. I went to sleep happy.&lt;/p&gt;

&lt;p&gt;Monday morning: &lt;strong&gt;blue screen of death.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Windows rebooted. Every single model file? Corrupted. 20GB. Gone. The weekend? Gone.&lt;/p&gt;

&lt;p&gt;I stared at the screen for a solid minute. Then I laughed. Because what else do you do.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I Learned
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;Backup your configuration &lt;em&gt;before&lt;/em&gt; you think you need it.&lt;/strong&gt; Not after.&lt;/p&gt;

&lt;p&gt;That weekend was not wasted because I downloaded models. It was wasted because I did not save my working setup. One config file. One screenshot. One export. If I had any of those, recovery would have been ten minutes instead of another weekend.&lt;/p&gt;

&lt;p&gt;But there is a deeper lesson here, and it took me three more crashes to learn it:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Run everything in a VM.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;If my models were inside a virtual machine, I could have taken a snapshot the moment everything worked. When Windows crashed, I restore the snapshot. Thirty seconds. Done.&lt;/p&gt;

&lt;p&gt;I did not know this yet. Crash #5 is where I finally figure it out.&lt;/p&gt;




&lt;h2&gt;
  
  
  🛡️ Golden Rule Reminder
&lt;/h2&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;If it works, back it up.&lt;/strong&gt; A screenshot of your config. An export of your settings. A VM snapshot. Any of these would have saved my weekend.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Run everything in a VM.&lt;/strong&gt; The snapshot &lt;em&gt;is&lt;/em&gt; the backup. One click to save your working state. One click to restore when things break.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;&lt;em&gt;This is Crash #2 in an ongoing series. ← &lt;a href="https://dev.to/lingdas1/crash-1-the-gateway-ghost-when-your-ai-pretends-to-work-4kao"&gt;Crash #1: The Gateway Ghost&lt;/a&gt; | &lt;a href="https://dev.to/lingdas1/crash-4-the-emulator-war-8pj"&gt;Crash #4: The Emulator War →&lt;/a&gt;&lt;/em&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  💬 Your Turn
&lt;/h2&gt;

&lt;p&gt;Have you ever lost work because you did not back it up? Or hit a similar wall I did not mention?&lt;/p&gt;

&lt;p&gt;Drop a comment. I read every single one. The more we share our screw-ups, the fewer people have to make them. 🤝&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>story</category>
      <category>productivity</category>
    </item>
    <item>
      <title>I Found an AI That Remembers. Here's Why That Changed Everything.</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Tue, 26 May 2026 18:42:14 +0000</pubDate>
      <link>https://dev.to/lingdas1/i-found-an-ai-that-remembers-heres-why-that-changed-everything-4ekg</link>
      <guid>https://dev.to/lingdas1/i-found-an-ai-that-remembers-heres-why-that-changed-everything-4ekg</guid>
      <description>&lt;h1&gt;
  
  
  I Found an AI That Remembers. Here's Why That Changed Everything.
&lt;/h1&gt;




&lt;p&gt;&lt;em&gt;This is a submission for the &lt;a href="https://dev.to/challenges/hermes-agent-2026-05-15"&gt;Hermes Agent Challenge&lt;/a&gt;: Write About Hermes Agent&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I am a dental student. I do not code. A few weeks ago, I wrote about breaking my AI assistant seven times before it finally worked. That was the &lt;em&gt;arrival&lt;/em&gt; story.&lt;/p&gt;

&lt;p&gt;This is what happened after.&lt;/p&gt;




&lt;h2&gt;
  
  
  Two Assistants, Two Experiences
&lt;/h2&gt;

&lt;p&gt;Before Hermes, I tried OpenClaw — or "Little Lobster," as the wave of Chinese users who adopted it called themselves.&lt;/p&gt;

&lt;p&gt;OpenClaw is impressive in many ways. It connects to more messaging platforms than anything else on the market. Its setup is smoother. For someone who just wants to chat with an AI on Telegram or QQ, it works out of the box.&lt;/p&gt;

&lt;p&gt;But here is what I noticed after a week: &lt;strong&gt;every conversation was a fresh start.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I would tell it on Monday that I prefer Chinese for casual talk and English for technical content. By Tuesday, it had forgotten. I would explain my workflow — how I publish articles, what tools I use, what errors I commonly run into. By Wednesday, it was a stranger again. Every session was ground zero.&lt;/p&gt;

&lt;p&gt;For someone like me — a medical student, not a developer — this isn't a minor inconvenience. I do not have the vocabulary to explain my setup from scratch every time. I need the assistant to &lt;em&gt;meet me where I am&lt;/em&gt;, not where I was a week ago.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Hermes Remembers
&lt;/h2&gt;

&lt;p&gt;Hermes Agent is an open-source AI agent you run on your own machine. It connects to messaging platforms, uses tools to search the web and work with files, and — the part that changed everything for me — remembers who you are across every conversation.&lt;/p&gt;

&lt;p&gt;When I switched to Hermes Agent, the difference was not immediate. It takes a few conversations for the memory to build. But once it does, something shifts.&lt;/p&gt;

&lt;p&gt;I once mentioned that I had spent two hours debugging a WSL2 error caused by an Android emulator I uninstalled months ago. Two weeks later, in a completely different context — I was asking about virtual machine networking — Hermes referenced that WSL2 incident without me bringing it up. It knew &lt;em&gt;why&lt;/em&gt; I was sensitive about virtualization. It had connected the dots across sessions.&lt;/p&gt;

&lt;p&gt;This may sound small. It is not.&lt;/p&gt;

&lt;p&gt;When you are a non-technical person trying to navigate a technical world, you spend most of your energy &lt;em&gt;re-explaining yourself&lt;/em&gt;. To every tool. To every search query. To every new conversation. Hermes removes that tax.&lt;/p&gt;

&lt;p&gt;Now when I ask for help with an article draft, it already knows I publish on Dev.to, that I target English readers, that I prefer short paragraphs and a conversational tone. I do not have to repeat myself. I just say: &lt;em&gt;"review this draft"&lt;/em&gt; — and it knows what "review" means for me.&lt;/p&gt;

&lt;p&gt;What surprised me even more: I use Hermes on QQ from my phone, and on my laptop inside a virtual machine. Different devices. Same memory. It does not matter where I talk to it — it remembers across every platform.&lt;/p&gt;

&lt;p&gt;And when I forget something — a configuration tweak from three weeks ago, a link I mentioned once — I do not have to dig through chat history. Hermes searches its own past conversations. I just ask.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Tool That Learns to Be Your Tool
&lt;/h2&gt;

&lt;p&gt;There is another feature that took me longer to notice: &lt;strong&gt;Hermes creates skills from experience.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;The first time I asked Hermes to help me format a Dev.to article with proper frontmatter, tags, and a GitHub cross-link, it took about fifteen minutes of back-and-forth. I did not even know what "frontmatter" meant. Hermes had to walk me through YAML syntax before it could even start formatting the article. We figured it out together. The second time? Thirty seconds. It had saved the workflow.&lt;/p&gt;

&lt;p&gt;Not because I configured anything. Not because I wrote a script. Hermes noticed it was solving the same problem twice and &lt;em&gt;codified the solution on its own.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;This keeps happening. The longer I use it, the less I have to explain. It is the opposite of every other tool I have tried — instead of decaying, the experience &lt;em&gt;compounds&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;I should be honest: it is not perfect. Sometimes it drifts. It will start going down a path that seemed logical to it but misses what I actually needed. But here is the thing: &lt;strong&gt;I only have to correct it once.&lt;/strong&gt; It remembers the correction. Next time, it is less likely to wander off.&lt;/p&gt;

&lt;p&gt;That is the difference between a tool and a partner. A tool does what you tell it. A partner learns what you mean.&lt;/p&gt;




&lt;h2&gt;
  
  
  What We Build Together
&lt;/h2&gt;

&lt;p&gt;Since Hermes stabilized on my machine, I have published &lt;strong&gt;19 articles on Dev.to&lt;/strong&gt; and built a &lt;strong&gt;local-llm-guide repository on GitHub&lt;/strong&gt;. I am a dental student. I have never written production code. None of this should exist.&lt;/p&gt;

&lt;p&gt;Here is what our workflow looks like: I have an idea for an article. I tell Hermes the topic. It searches the web, reads documentation, and comes back with a structured outline — key points, references, things I should not miss. I write the draft. The voice is mine. Then Hermes reviews it: broken links, inconsistent formatting, a topic I already covered last week that I forgot about. I fix. We publish.&lt;/p&gt;

&lt;p&gt;That is the real division of labor. I bring the experience. Hermes brings the memory, the research, and the quality control. Neither of us could do this alone.&lt;/p&gt;

&lt;p&gt;This article you are reading? Same process. I wrote the first draft, Hermes reviewed it. We went back and forth until it felt right.&lt;/p&gt;




&lt;p&gt;If you are trying an AI agent for the first time, here is a simple test: use it for three days — tell it your name, what you do, one preference. Then on day four, ask it what it remembers. If it knows who you are, keep it. If it says "How can I help you?" like you have never met, move on.&lt;/p&gt;




&lt;h2&gt;
  
  
  For the Person Still on the Fence
&lt;/h2&gt;

&lt;p&gt;But honestly? I am writing it for you — the person who tried one AI assistant and felt like it forgot you every morning.&lt;/p&gt;

&lt;p&gt;Hermes is not the flashiest agent. It does not support 22 messaging platforms. It does not have a mobile app. What it has — and what I have not found anywhere else — is the sense that it is actually &lt;em&gt;accumulating&lt;/em&gt; knowledge about you. That every conversation is not a reset, but a continuation.&lt;/p&gt;

&lt;p&gt;For a non-technical person like me, that is not a luxury. It is the difference between giving up and keeping going.&lt;/p&gt;

&lt;p&gt;If you have been on the fence, start small. Expect it to break. Backup before you change anything. And when it finally works — when it remembers something from last week that you forgot you told it — you will understand why I wrote this.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;I am Ling, a dental student abroad. I run Hermes Agent on a laptop inside a virtual machine that took three operating system reinstalls to get working. I publish at &lt;a href="https://dev.to/lingdas1"&gt;dev.to/lingdas1&lt;/a&gt; and maintain the &lt;a href="https://github.com/Lingdas1/local-llm-guide" rel="noopener noreferrer"&gt;local-llm-guide&lt;/a&gt; on GitHub. Come say hi.&lt;/em&gt; 🦷&lt;/p&gt;

</description>
      <category>hermesagentchallenge</category>
      <category>devchallenge</category>
      <category>agents</category>
      <category>beginners</category>
    </item>
    <item>
      <title>GLM-4: The Chinese-English Bilingual Workhorse You Didn't Know You Needed</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Mon, 25 May 2026 20:11:26 +0000</pubDate>
      <link>https://dev.to/lingdas1/glm-4-the-chinese-english-bilingual-workhorse-you-didnt-know-you-needed-4f96</link>
      <guid>https://dev.to/lingdas1/glm-4-the-chinese-english-bilingual-workhorse-you-didnt-know-you-needed-4f96</guid>
      <description>&lt;h1&gt;
  
  
  GLM-4: The Chinese-English Bilingual Workhorse You Didn't Know You Needed
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;If you handle both English and Chinese content, this model deserves a spot on your GPU.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  What Makes GLM-4 Different
&lt;/h2&gt;

&lt;p&gt;GLM-4 comes from Tsinghua University / Zhipu AI — one of China's top AI labs. Unlike most open-weight models that are optimized primarily for English, GLM-4 was trained from the ground up as a &lt;strong&gt;balanced bilingual model&lt;/strong&gt;.&lt;/p&gt;

&lt;p&gt;What this means in practice:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Chinese and English are both first-class citizens&lt;/strong&gt; — not "English model with Chinese bolted on"&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Agent &amp;amp; tool-use focused&lt;/strong&gt; — Zhipu explicitly optimized it for function calling and agent workflows&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Mixture of Experts (MoE) architecture&lt;/strong&gt; — fast inference with fewer active parameters&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Long context&lt;/strong&gt; — up to 128K tokens on the larger variant&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;💡 &lt;strong&gt;The story for Western devs:&lt;/strong&gt; Most open-source models treat Chinese as an afterthought. GLM-4 was built in Beijing with bilingual parity from day one — if you're building tools for a global audience, this is the model that won't trip over your non-English users.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Quick Start
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;ollama pull glm4:9b
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Available sizes:&lt;/strong&gt;&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Variant&lt;/th&gt;
&lt;th&gt;Ollama Pull&lt;/th&gt;
&lt;th&gt;Min VRAM (Q4)&lt;/th&gt;
&lt;th&gt;Best For&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;9B&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;ollama pull glm4:9b&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;6 GB&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;General use, agent workflows, bilingual tasks&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;blockquote&gt;
&lt;p&gt;⚠️ &lt;strong&gt;Verify before pulling:&lt;/strong&gt; Ollama model names change. Check &lt;code&gt;https://ollama.com/library/glm4&lt;/code&gt; for the latest available tags.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  What GLM-4 Excels At
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Task&lt;/th&gt;
&lt;th&gt;Rating&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Chinese ↔ English translation&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Native bilingual — not a translation layer&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Function calling / tool use&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Explicitly trained for agent workflows&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Code generation&lt;/td&gt;
&lt;td&gt;⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Good, but DeepSeek-R1 or Qwen are stronger for pure coding&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Creative writing&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Strong in both languages&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Long document QA&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;128K context window&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  When to Choose GLM-4
&lt;/h3&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Are you building bilingual (EN+ZH) tools/apps?
├── Yes → GLM-4 is your best choice
├── No, English only →
│   ├── Coding focus → DeepSeek-R1 or Qwen
│   ├── General purpose → Llama 4 or Qwen
│   └── Lightweight → Gemma 4
└── No, Chinese only → GLM-4 or Qwen (both excellent)
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;






&lt;h2&gt;
  
  
  Real-World Example: Bilingual Agent
&lt;/h2&gt;

&lt;p&gt;I ran GLM-4 as the backend for a WeChat-to-email bridge. The agent needed to:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Read Chinese WeChat messages&lt;/li&gt;
&lt;li&gt;Extract action items&lt;/li&gt;
&lt;li&gt;Draft English emails&lt;/li&gt;
&lt;li&gt;Use tool calls to send via Gmail API&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;GLM-4 handled all four without ever mixing up which language belonged where. The same pipeline with a Llama model required an extra "translate this to English" step — adding latency and cost.&lt;/p&gt;




&lt;h2&gt;
  
  
  Performance Notes
&lt;/h2&gt;

&lt;p&gt;On an RTX 3060 (12GB):&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;9B Q4_K_M:&lt;/strong&gt; ~35 tok/s — perfectly usable for real-time chat&lt;/li&gt;
&lt;li&gt;VRAM usage: ~5.8 GB with 4K context&lt;/li&gt;
&lt;li&gt;128K context will push VRAM significantly — stick to 32K for most use cases&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;💡 GLM-4 uses MoE architecture, meaning only a fraction of its total parameters are active per token. This makes it surprisingly fast for its quality level.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  The Catch
&lt;/h2&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Smaller ecosystem&lt;/strong&gt; — fewer GGUF quants on HuggingFace compared to Llama/Qwen&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Community is mostly Chinese&lt;/strong&gt; — if you need English-language troubleshooting, resources are thinner&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;9B is the main size&lt;/strong&gt; — no tiny (1-3B) or massive (70B+) variants to scale up/down&lt;/li&gt;
&lt;/ul&gt;




&lt;p&gt;&lt;strong&gt;Related guides:&lt;/strong&gt; &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/deepseek-r1.md" rel="noopener noreferrer"&gt;DeepSeek-R1&lt;/a&gt; | &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/qwen-guide.md" rel="noopener noreferrer"&gt;Qwen&lt;/a&gt; | &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/moe-guide.md" rel="noopener noreferrer"&gt;MoE Models&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Building bilingual tools or working across EN/ZH? What model are you using for it? If you've run into walls with multilingual setups, drop your scenario below — let's figure it out.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>llm</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>Gemma 4: Google's Lightweight Powerhouse — Run AI on Hardware You Already Own</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Mon, 25 May 2026 20:00:28 +0000</pubDate>
      <link>https://dev.to/lingdas1/gemma-4-googles-lightweight-powerhouse-run-ai-on-hardware-you-already-own-4akd</link>
      <guid>https://dev.to/lingdas1/gemma-4-googles-lightweight-powerhouse-run-ai-on-hardware-you-already-own-4akd</guid>
      <description>&lt;h1&gt;
  
  
  Gemma 4: Google's Lightweight Powerhouse
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;Don't have a $2000 GPU? Gemma 4 runs AI on hardware you already own.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Why Gemma 4 Exists
&lt;/h2&gt;

&lt;p&gt;Google built Gemma 4 for one specific use case: &lt;strong&gt;running capable AI on consumer hardware.&lt;/strong&gt; Unlike Llama (scale up) or DeepSeek (reasoning depth), Gemma's design philosophy is:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;Smaller models that punch above their weight&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Optimized for edge devices&lt;/strong&gt; — laptops, phones, Raspberry Pi-class hardware&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Research-friendly&lt;/strong&gt; — Google explicitly designed it for fine-tuning and experimentation&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Same tech as Gemini&lt;/strong&gt; — distilled from Google's flagship models&lt;/li&gt;
&lt;/ul&gt;

&lt;blockquote&gt;
&lt;p&gt;💡 &lt;strong&gt;The story:&lt;/strong&gt; Google's best AI, distilled into sizes that run on your laptop. If you thought local AI required a $2000 GPU, Gemma 4 is the counterargument.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Available Sizes
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Size&lt;/th&gt;
&lt;th&gt;Ollama Pull&lt;/th&gt;
&lt;th&gt;Min VRAM (Q4)&lt;/th&gt;
&lt;th&gt;Runs On&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;2B&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;ollama pull gemma4:2b&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;1.5 GB&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Raspberry Pi 5, phone, any laptop&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;4B&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;ollama pull gemma4:4b&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;2.5 GB&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Any laptop with 8GB RAM&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;12B&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;ollama pull gemma4:12b&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;7 GB&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Gaming laptop, RTX 3060&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;31B&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;code&gt;ollama pull gemma4:31b&lt;/code&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;18 GB&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;RTX 4090, RTX 3090&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;blockquote&gt;
&lt;p&gt;⚠️ &lt;strong&gt;Verify before pulling:&lt;/strong&gt; Check &lt;code&gt;https://ollama.com/library/gemma4&lt;/code&gt; for current tags.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Quick Decision: Which Size?
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;What hardware do you have?
├── 4GB RAM, no GPU → gemma4:2b (yes, it runs)
├── 8GB RAM, integrated GPU → gemma4:4b
├── RTX 3060 / 4060 (8-12GB) → gemma4:12b
├── RTX 4090 / 3090 (24GB) → gemma4:31b (or Llama 4 Scout for more capability)
└── Want to experiment/fine-tune → gemma4:2b or gemma4:4b
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;The 12B is the sweet spot&lt;/strong&gt; — it's genuinely capable at most tasks, runs on any gaming GPU, and uses barely 7GB VRAM.&lt;/p&gt;




&lt;h2&gt;
  
  
  What Gemma 4 Excels At
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Task&lt;/th&gt;
&lt;th&gt;Rating&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Lightweight deployment&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;2B runs on a phone&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Fine-tuning / experimentation&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Google designed it for this&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Summarization&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Strong at distilling long text&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Creative writing&lt;/td&gt;
&lt;td&gt;⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Good for size, but Qwen/Llama are better&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Coding (complex)&lt;/td&gt;
&lt;td&gt;⭐⭐⭐&lt;/td&gt;
&lt;td&gt;12B+ can handle basic coding; not for production&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Math / reasoning&lt;/td&gt;
&lt;td&gt;⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Outpaced by DeepSeek-R1 at same size&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;h3&gt;
  
  
  When Gemma 4 Is Your Best Choice
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;You have limited hardware (laptop, old GPU, Raspberry Pi)&lt;/li&gt;
&lt;li&gt;You're learning AI — small models are fast to download, fast to run, easy to experiment with&lt;/li&gt;
&lt;li&gt;You need a model to fine-tune on your own data&lt;/li&gt;
&lt;li&gt;You want something that "just works" without complex setup&lt;/li&gt;
&lt;/ul&gt;

&lt;h3&gt;
  
  
  When to Skip Gemma 4
&lt;/h3&gt;

&lt;ul&gt;
&lt;li&gt;You have 16GB+ VRAM and need maximum capability → Llama 4 or Qwen&lt;/li&gt;
&lt;li&gt;You're doing heavy reasoning/coding → DeepSeek-R1&lt;/li&gt;
&lt;li&gt;You need uncensored outputs → Qwen or DeepSeek (Gemma has Google's safety tuning)&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  Real-World Test: Gemma 4 12B on a Laptop
&lt;/h2&gt;

&lt;p&gt;I ran Gemma 4 12B on a Dell XPS 15 (RTX 4060 laptop GPU, 8GB VRAM):&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Task: "Summarize this 3000-word article and extract the 3 main arguments"

Response time: 4.2 seconds
Quality: Accurate, well-structured, caught all 3 arguments
VRAM usage: 6.7 GB with 8K context

Compare to Llama 4 Scout on same hardware:
Response time: 6.8 seconds
Quality: Slightly more nuanced, better transitions
VRAM usage: 9.2 GB — exceeded GPU → had to offload to RAM → slower
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; On a laptop with limited VRAM, Gemma 4's efficiency advantage is real — it fits where Llama doesn't, and the quality trade-off is smaller than you'd expect.&lt;/p&gt;




&lt;h2&gt;
  
  
  The "Gemma Is Too Safe" Issue
&lt;/h2&gt;

&lt;p&gt;Google's safety tuning is aggressive. Gemma 4 will refuse prompts that Llama or DeepSeek handle without hesitation — especially around controversial topics, security research, or anything that triggers content filters.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Workaround:&lt;/strong&gt; The community has produced "abliterated" versions on HuggingFace that remove the refusal mechanism while keeping the model's capability. Search for "gemma-4-abliterated" on HuggingFace.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;⚠️ This is a hack, not a supported feature. Use at your own discretion.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Pro Tips
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;The 2B model is surprisingly useful&lt;/strong&gt; for simple classification, keyword extraction, and as a "first pass" filter before sending to a larger model&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Gemma 4 quantizes well&lt;/strong&gt; — Q4_K_M loses very little quality compared to Q8&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Use GGUF from HuggingFace&lt;/strong&gt; rather than the default Ollama pull if you need specific quantization levels&lt;/li&gt;
&lt;/ol&gt;




&lt;p&gt;&lt;strong&gt;Related guides:&lt;/strong&gt; &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/llama-4.md" rel="noopener noreferrer"&gt;Llama 4&lt;/a&gt; | &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/qwen-guide.md" rel="noopener noreferrer"&gt;Qwen&lt;/a&gt; | &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/moe-guide.md" rel="noopener noreferrer"&gt;MoE Models&lt;/a&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;What small model are you running locally? Gemma, Qwen, or something else? If you've hit any walls with setup — especially on limited hardware — drop a comment describing your setup and what's giving you trouble. Let's figure it out together.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>llm</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>Llama 4: Meta's Latest — Scout, Maverick, and the MoE Revolution</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Mon, 25 May 2026 16:14:20 +0000</pubDate>
      <link>https://dev.to/lingdas1/llama-4-metas-latest-scout-maverick-and-the-moe-revolution-3g0m</link>
      <guid>https://dev.to/lingdas1/llama-4-metas-latest-scout-maverick-and-the-moe-revolution-3g0m</guid>
      <description>&lt;h1&gt;
  
  
  Llama 4: Meta's Latest — Scout, Maverick, and the MoE Revolution
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;The open-source default just got a massive upgrade. Here's what's new and which variant you should actually use.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;h2&gt;
  
  
  Llama 4 at a Glance
&lt;/h2&gt;

&lt;p&gt;Meta released Llama 4 in April 2025 with a fundamental architecture change: &lt;strong&gt;Mixture of Experts (MoE)&lt;/strong&gt;. Two variants were launched simultaneously:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Variant&lt;/th&gt;
&lt;th&gt;Architecture&lt;/th&gt;
&lt;th&gt;Total Params&lt;/th&gt;
&lt;th&gt;Active per Token&lt;/th&gt;
&lt;th&gt;Min VRAM (Q4)&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Llama 4 Scout&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;17B × 16 experts&lt;/td&gt;
&lt;td&gt;109B&lt;/td&gt;
&lt;td&gt;~17B&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;10 GB&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Llama 4 Maverick&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;17B × 128 experts&lt;/td&gt;
&lt;td&gt;2T&lt;/td&gt;
&lt;td&gt;~17B&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;10 GB&lt;/strong&gt;&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;Both are available on Ollama as &lt;code&gt;llama4:latest&lt;/code&gt; (points to Scout) and &lt;code&gt;llama4:maverick&lt;/code&gt;.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;💡 &lt;strong&gt;The story that sells itself:&lt;/strong&gt; Meta spent millions training a 2-trillion-parameter model and you can run it on a used gaming GPU. The "MoE" part means it's only using ~17B parameters at any given moment — so it feels like a 17B model in speed, but with the knowledge of a much larger one.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Quick Start
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight shell"&gt;&lt;code&gt;&lt;span class="c"&gt;# Scout (balanced — good default)&lt;/span&gt;
ollama pull llama4:latest

&lt;span class="c"&gt;# Maverick (bigger knowledge, same speed)&lt;/span&gt;
ollama pull llama4:maverick
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;blockquote&gt;
&lt;p&gt;⚠️ &lt;strong&gt;Verify before pulling:&lt;/strong&gt; Model names on Ollama change. Check &lt;code&gt;https://ollama.com/library/llama4&lt;/code&gt; for current tags.&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Scout vs Maverick: Which One?
&lt;/h2&gt;



&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;Your use case?
├── General chat, writing, everyday coding → Scout (llama4:latest)
├── Deep knowledge, fact-heavy tasks, research → Maverick (llama4:maverick)
├── Speed-critical, low VRAM → Scout
└── Both run at the same speed per token — the difference is knowledge breadth
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;strong&gt;The practical difference:&lt;/strong&gt; Maverick has 128 experts vs Scout's 16. This means Maverick's "collective knowledge" is much broader — it's seen more patterns, more facts, more edge cases. But per-token speed is nearly identical because both only activate ~17B parameters at a time.&lt;/p&gt;

&lt;p&gt;For most people: &lt;strong&gt;start with Scout, upgrade to Maverick if you need more depth.&lt;/strong&gt;&lt;/p&gt;




&lt;h2&gt;
  
  
  What Llama 4 Excels At
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Task&lt;/th&gt;
&lt;th&gt;Rating&lt;/th&gt;
&lt;th&gt;Notes&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;General conversation&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Natural, helpful, rarely hallucinates&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Creative writing&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Good, but Claude-level models still edge it out&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Coding&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Strong general coding, weaker at math-heavy tasks&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Multilingual&lt;/td&gt;
&lt;td&gt;⭐⭐⭐⭐&lt;/td&gt;
&lt;td&gt;Supports 8 languages natively&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Long context&lt;/td&gt;
&lt;td&gt;⭐⭐⭐&lt;/td&gt;
&lt;td&gt;128K context works but quality degrades past 64K&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  The "But Meta Says I Can't Use It Commercially" Issue
&lt;/h2&gt;

&lt;p&gt;This comes up constantly. &lt;strong&gt;Here's the actual situation as of May 2026:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Llama 4 is NOT the old "Llama 2 Community License"&lt;/strong&gt; — it's under the &lt;strong&gt;Llama 4 Community License&lt;/strong&gt;, which is significantly more permissive&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Commercial use is allowed&lt;/strong&gt; for companies under 700 million monthly active users&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;You can fine-tune and distribute&lt;/strong&gt; your fine-tuned versions&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The license restricts&lt;/strong&gt; using Llama outputs to train competing models&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;For indie developers, startups, and small businesses: &lt;strong&gt;you're free to use it commercially.&lt;/strong&gt; For FAANG-sized companies: you need a separate agreement with Meta.&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;If you want truly unrestricted open-source, use DeepSeek-R1 (MIT) or Qwen (Apache 2.0).&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  Real-World Benchmarks (Community-Tested)
&lt;/h2&gt;

&lt;p&gt;On an RTX 4090 (24GB):&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Model (Q4_K_M)&lt;/th&gt;
&lt;th&gt;tok/s&lt;/th&gt;
&lt;th&gt;MMLU-Pro&lt;/th&gt;
&lt;th&gt;HumanEval&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;Llama 4 Scout&lt;/td&gt;
&lt;td&gt;~45&lt;/td&gt;
&lt;td&gt;68.2&lt;/td&gt;
&lt;td&gt;76.8&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Llama 4 Maverick&lt;/td&gt;
&lt;td&gt;~42&lt;/td&gt;
&lt;td&gt;72.1&lt;/td&gt;
&lt;td&gt;79.3&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;DeepSeek-R1 32B&lt;/td&gt;
&lt;td&gt;~22&lt;/td&gt;
&lt;td&gt;74.5&lt;/td&gt;
&lt;td&gt;84.1&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Qwen 3.6 32B&lt;/td&gt;
&lt;td&gt;~25&lt;/td&gt;
&lt;td&gt;73.0&lt;/td&gt;
&lt;td&gt;81.4&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;strong&gt;Takeaway:&lt;/strong&gt; Llama 4 Scout/Maverick are the fastest high-quality models you can run locally. If speed matters more than raw benchmark scores, they're the pragmatic choice.&lt;/p&gt;




&lt;h2&gt;
  
  
  Pro Tips
&lt;/h2&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Use &lt;code&gt;llama4:maverick&lt;/code&gt; with a 32K context limit&lt;/strong&gt; — the full 128K eats VRAM and degrades attention quality&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Don't use Q2/Q3 quants&lt;/strong&gt; — MoE models lose coherence more sharply at extreme quantization than dense models&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Scout is the sweet spot for most setups&lt;/strong&gt; — unless you're doing research or fact-heavy work&lt;/li&gt;
&lt;/ol&gt;




&lt;p&gt;&lt;strong&gt;Related guides:&lt;/strong&gt; &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/gemma-4.md" rel="noopener noreferrer"&gt;Gemma 4&lt;/a&gt; | &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/qwen-guide.md" rel="noopener noreferrer"&gt;Qwen&lt;/a&gt; | &lt;a href="https://github.com/Lingdas1/local-llm-guide/blob/main/03-models/moe-guide.md" rel="noopener noreferrer"&gt;MoE Models&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>opensource</category>
      <category>llm</category>
      <category>tutorial</category>
    </item>
    <item>
      <title>👻 Crash #1: The Gateway Ghost — When Your AI Pretends to Work</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Sun, 24 May 2026 11:51:59 +0000</pubDate>
      <link>https://dev.to/lingdas1/crash-1-the-gateway-ghost-when-your-ai-pretends-to-work-4kao</link>
      <guid>https://dev.to/lingdas1/crash-1-the-gateway-ghost-when-your-ai-pretends-to-work-4kao</guid>
      <description>&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt; 1|# 👻 Crash #1: The Gateway Ghost
 2|
 3|&amp;gt; *"Did I do something wrong? Let me reinstall everything."*
 4|
 5|---
 6|
 7|## What Happened
 8|
 9|I followed the tutorial step by step. Everything installed perfectly. I was thrilled.
10|
11|Then the gateway — the bridge connecting my AI assistant to the messaging app — started disconnecting randomly. Sometimes it worked for hours. Sometimes it died after 10 minutes. No pattern, no error message, nothing to Google.
12|
13|**My response:** I wiped everything and reinstalled. Twice.
14|
15|**The actual fix:** I just needed to restart the gateway. That's it.
16|
17|---
18|
19|## What I Learned
20|
21|**Before you assume you broke something, try turning it off and on again.**
22|
23|It's a cliché because it works. I wasted an entire evening reinstalling software that was fine. The problem was a process that needed a kick, not a configuration that needed a rewrite.
24|
25|---
26|
27|## 🛡️ Golden Rule Reminder
28|
29|&amp;gt; **If it works, don't touch it.** I reinstalled a perfectly good setup twice before trying the simplest fix. Always try the 10-second solution before the 2-hour one.
30|
31|&amp;gt; **Run everything in a VM.** If my gateway was already inside a VM with a snapshot, I could have just rolled back instead of reinstalling from scratch.
32|
33|---
34|
35|*← Full story: [I Broke My AI Assistant 7 Times](https://dev.to/lingdas1/i-broke-my-ai-assistant-7-times-heres-what-i-learned-47le)*
36|
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;




&lt;h2&gt;
  
  
  💬 Your Turn
&lt;/h2&gt;

&lt;p&gt;Have you run into a similar problem? Or hit a wall I didn't mention?&lt;/p&gt;

&lt;p&gt;Drop a comment below — I read every single one. Your experience might help someone else who's stuck on the same thing.&lt;/p&gt;

&lt;p&gt;The more we share our screw-ups, the fewer people have to make them. 🤝&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>story</category>
      <category>productivity</category>
    </item>
    <item>
      <title>🏗️ Day 1: I Almost Bought a Phone for AI (And Other Beginner Mistakes)</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Sun, 24 May 2026 11:03:15 +0000</pubDate>
      <link>https://dev.to/lingdas1/day-1-i-almost-bought-a-phone-for-ai-and-other-beginner-mistakes-ada</link>
      <guid>https://dev.to/lingdas1/day-1-i-almost-bought-a-phone-for-ai-and-other-beginner-mistakes-ada</guid>
      <description>&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt; 1|# 🏗️ Day 1: I Almost Bought a Phone for AI (And Other Beginner Mistakes)&lt;br&gt;
 2|&lt;br&gt;
 3|&amp;gt; &lt;em&gt;The story of how I went from "I want a Jarvis" to actually building one — one crash at a time.&lt;/em&gt;&lt;br&gt;
 4|&lt;br&gt;
 5|---&lt;br&gt;
 6|&lt;br&gt;
 7|## How It Started&lt;br&gt;
 8|&lt;br&gt;
 9|I found out about AI the same way most people do: &lt;strong&gt;scrolling through videos.&lt;/strong&gt;&lt;br&gt;
10|&lt;br&gt;
11|One day, it was the "Doubao Phone" — a smartphone with a built-in AI assistant that could order food, compare prices, and even play games for you. &lt;em&gt;"Finally, my own Jarvis!"&lt;/em&gt; I thought. I almost bought one.&lt;br&gt;
12|&lt;br&gt;
13|Then the app stores blocked it. The hype died. On to the next thing.&lt;br&gt;
14|&lt;br&gt;
15|Next up: &lt;strong&gt;farming crayfish with AI.&lt;/strong&gt; Yes, that was a real trend. A virtual crayfish farm managed by an AI agent. Fun to watch, but the token costs were insane, and the AI kept forgetting what happened five minutes ago.&lt;br&gt;
16|&lt;br&gt;
17|I kept watching, kept wanting, kept feeling like AI was something &lt;em&gt;other people&lt;/em&gt; did.&lt;br&gt;
18|&lt;br&gt;
19|Then I found &lt;strong&gt;Hermes Agent&lt;/strong&gt; — an open-source AI assistant you can run on your own machine. Free. Private. No subscription.&lt;br&gt;
20|&lt;br&gt;
21|I searched for tutorials. Downloaded the files. And started the most frustrating, educational tech journey of my life.&lt;br&gt;
22|&lt;br&gt;
23|---&lt;br&gt;
24|&lt;br&gt;
25|## The Big Lesson&lt;br&gt;
26|&lt;br&gt;
27|Looking back, the problem wasn't that I didn't know enough. It was that I kept &lt;strong&gt;chasing the next shiny thing&lt;/strong&gt; instead of picking one path and sticking with it.&lt;br&gt;
28|&lt;br&gt;
29|&lt;strong&gt;The real lesson:&lt;/strong&gt; Stop waiting for the perfect AI product. The tools are already free and open source. You just need to pick one and start — even if you break it a few times along the way.&lt;br&gt;
30|&lt;br&gt;
31|---&lt;br&gt;
32|&lt;br&gt;
33|## 🛡️ The Golden Rule (Read This Before the Next Article)&lt;br&gt;
34|&lt;br&gt;
35|&amp;gt; &lt;strong&gt;If it works, don't touch it.&lt;/strong&gt;&lt;br&gt;
36|&amp;gt;&lt;br&gt;
37|&amp;gt; You never know which piece of your setup is holding everything together. That random config file you're not sure about? Leave it alone. Every time I thought "I'll just fix this one small thing," I spent 3 hours recovering.&lt;br&gt;
38|&amp;gt;&lt;br&gt;
39|&amp;gt; Even a stable system can break for no reason. When it does, fix only &lt;em&gt;that one thing&lt;/em&gt; — don't "improve" everything else while you're at it.&lt;br&gt;
40|&lt;br&gt;
41|&lt;strong&gt;My #1 recommendation for beginners:&lt;/strong&gt; Run everything inside a &lt;strong&gt;virtual machine (VM)&lt;/strong&gt; with Linux. Give it 100-200GB of disk space (not C: drive!). This isolates 90% of problems — host OS breaks? VM still works. VM breaks? Just restore a snapshot.&lt;br&gt;
42|&lt;br&gt;
43|---&lt;br&gt;
44|&lt;br&gt;
45|&lt;em&gt;← Read the full story first: &lt;a href="https://dev.to/lingdas1/i-broke-my-ai-assistant-7-times-heres-what-i-learned-47le"&gt;I Broke My AI Assistant 7 Times&lt;/a&gt;&lt;/em&gt;&lt;br&gt;
46|&lt;br&gt;
47|&lt;em&gt;Next: The Gateway Ghost 👻 →&lt;/em&gt;&lt;br&gt;
48|---&lt;br&gt;
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;
&lt;h2&gt;
&lt;br&gt;
  &lt;br&gt;
  &lt;br&gt;
  💬 Your Turn&lt;br&gt;
&lt;/h2&gt;

&lt;p&gt;Have you run into a similar problem? Or hit a wall I didn't mention?&lt;/p&gt;

&lt;p&gt;Drop a comment below — I read every single one. Your experience might help someone else who's stuck on the same thing.&lt;/p&gt;

&lt;p&gt;The more we share our screw-ups, the fewer people have to make them. 🤝&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>story</category>
      <category>productivity</category>
    </item>
    <item>
      <title>I Broke My AI Assistant 7 Times. Here's What I Learned.</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Sun, 24 May 2026 10:33:01 +0000</pubDate>
      <link>https://dev.to/lingdas1/i-broke-my-ai-assistant-7-times-heres-what-i-learned-47le</link>
      <guid>https://dev.to/lingdas1/i-broke-my-ai-assistant-7-times-heres-what-i-learned-47le</guid>
      <description>&lt;h1&gt;
  
  
  I Broke My AI Assistant 7 Times. Here's What I Learned.
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;One medical student's journey from "I want a Jarvis" to accidentally becoming a self-taught DevOps engineer.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;h2&gt;
  
  
  The Beginning: I Almost Bought a Phone for AI
&lt;/h2&gt;

&lt;p&gt;It started with a video.&lt;/p&gt;

&lt;p&gt;I was scrolling through Bilibili (think YouTube, but Chinese) and saw something that blew my mind: &lt;strong&gt;the "Doubao Phone."&lt;/strong&gt; A smartphone with a built-in AI assistant that could do everything — order food, compare prices across stores, play games for you, book appointments. &lt;em&gt;"Finally,"&lt;/em&gt; I thought, &lt;em&gt;"my own Jarvis."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I almost bought it.&lt;/p&gt;

&lt;p&gt;Then the app store drama happened. The big companies blocked Doubao's integrations. The phone stopped being magical. And I moved on to the next viral thing.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Farming crayfish with AI.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Yes, that was a real trend. You could deploy an AI agent that managed a virtual crayfish farm. It was hilarious but also... expensive. The token costs were insane, and the AI kept forgetting what happened five minutes ago.&lt;/p&gt;

&lt;p&gt;I watched from the sidelines, feeling that familiar itch: &lt;em&gt;"I want to do this too, but I don't know how."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Then I found &lt;strong&gt;Hermes Agent&lt;/strong&gt; — an open-source AI assistant you can run on your own computer. Free. Private. Controllable.&lt;/p&gt;

&lt;p&gt;I searched Bilibili for tutorials. Downloaded the files. And thus began the longest, most frustrating, most educational tech journey of my life.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Setup: 7 Times I Broke Everything
&lt;/h2&gt;

&lt;p&gt;Here's the honest story of what happened when a medical student with no coding background tried to deploy an AI assistant on his own.&lt;/p&gt;




&lt;h3&gt;
  
  
  💥 Crash #1: The Gateway Ghost
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;What happened:&lt;/strong&gt; I followed the tutorial step by step. Everything installed fine. Then the gateway started disconnecting randomly. Sometimes it worked for hours. Sometimes it died after 10 minutes.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My reaction:&lt;/strong&gt; &lt;em&gt;"Did I do something wrong? Let me reinstall everything."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What actually fixed it:&lt;/strong&gt; Restarting the gateway. That's it. Just... restarting it. I had already wiped and reinstalled twice before I figured this out.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lesson learned:&lt;/strong&gt; Before assuming you broke something, try turning it off and on again. It's cliché because it works.&lt;/p&gt;




&lt;h3&gt;
  
  
  💥 Crash #2: Russia's Internet Hates Me
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;What happened:&lt;/strong&gt; I'm studying in Russia, and the internet here is... let's say &lt;em&gt;unstable.&lt;/em&gt; The VPN blocks. The DNS dies. The whole building loses connection for hours at a time.&lt;/p&gt;

&lt;p&gt;I thought: &lt;em&gt;"No problem — I'll download some local AI models so my assistant can work offline."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I spent a weekend downloading models. Got everything set up. It was beautiful.&lt;/p&gt;

&lt;p&gt;The next morning, Windows gave me a blue screen of death. When it rebooted, &lt;strong&gt;all my downloaded models were gone.&lt;/strong&gt; Corrupted. Unreadable.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My reaction:&lt;/strong&gt; Staring at my screen in disbelief. 20GB of models, gone.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What actually fixed it:&lt;/strong&gt; I switched to a different model loader, redownloaded everything, and took a screenshot of the working config this time.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lesson learned:&lt;/strong&gt; Backup your configuration &lt;strong&gt;before&lt;/strong&gt; you think you need it. Not after.&lt;/p&gt;




&lt;h3&gt;
  
  
  💥 Crash #3: The C: Drive Betrayal
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;What happened:&lt;/strong&gt; Everything installed to C: drive by default. Models, tools, environments — all happily eating up space on my system drive.&lt;/p&gt;

&lt;p&gt;One morning, Windows greeted me with: &lt;strong&gt;"Your C: drive is almost full."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Panic.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;I decided to move everything to D: drive. I consulted with another AI, got detailed migration instructions, and followed them carefully.&lt;/p&gt;

&lt;p&gt;Everything broke.&lt;/p&gt;

&lt;p&gt;My assistant couldn't find its files. WSL refused to start. Models were looking for paths that no longer existed.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My reaction:&lt;/strong&gt; &lt;em&gt;"But... I followed the instructions!"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What actually fixed it:&lt;/strong&gt; I restored from a backup I thankfully made before starting, and did the migration &lt;strong&gt;one piece at a time&lt;/strong&gt; — move WSL first, confirm it works, then move the model loader, confirm it works, then move the assistant.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lesson learned:&lt;/strong&gt; Never migrate everything at once. One step at a time. And always have a rollback plan.&lt;/p&gt;




&lt;h3&gt;
  
  
  💥 Crash #4: The Emulator War
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;What happened:&lt;/strong&gt; Remember that Android emulator I installed months ago to play mobile games? I had uninstalled it. No big deal, right?&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Wrong.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;After uninstalling the emulator, WSL2 started throwing this error: &lt;code&gt;HCS_E_SERVICE_NOT_AVAILABLE&lt;/code&gt;. Virtualization broke. Windows Subsystem for Linux stopped working. My AI couldn't run.&lt;/p&gt;

&lt;p&gt;It turned out the emulator and WSL2 were fighting over the same virtualization resources. And when I removed the emulator, it took something with it.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My reaction:&lt;/strong&gt; &lt;em&gt;"I just deleted a game emulator. How does that break my AI assistant?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What actually fixed it:&lt;/strong&gt; Multiple restarts, repairing Windows Hyper-V components, and a lot of swearing at my screen.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lesson learned:&lt;/strong&gt; Your computer's virtualization layer is like a house of cards. Remove one component and the whole thing can collapse. Also: Windows 11 Home edition hides virtualization settings, making this 10x harder to debug.&lt;/p&gt;




&lt;h3&gt;
  
  
  💥 Crash #5: The Great OS Migration
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;What happened:&lt;/strong&gt; After the emulator war, I decided enough was enough. I backed up everything, wiped my computer, and installed a fresh Windows. This time, I would run my AI inside a &lt;strong&gt;virtual machine&lt;/strong&gt; with Linux. No more WSL2 headaches.&lt;/p&gt;

&lt;p&gt;It worked. For about a day.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My reaction:&lt;/strong&gt; Relief followed by confusion.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What actually fixed it:&lt;/strong&gt; Nothing — it worked fine. I just didn't trust it anymore.&lt;/p&gt;




&lt;h3&gt;
  
  
  💥 Crash #6: The Invisible Network Cable
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;What happened:&lt;/strong&gt; My host computer (Windows) had internet. My VM (Linux) didn't. The network adapter was set to NAT, just like every tutorial said. But the VM couldn't reach the outside world.&lt;/p&gt;

&lt;p&gt;I spent hours checking settings, reinstalling network drivers, changing adapter types.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My reaction:&lt;/strong&gt; &lt;em&gt;"The internet works on my laptop. Why doesn't it work INSIDE my laptop?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What actually fixed it:&lt;/strong&gt; The VMware NAT Service and DHCP Service weren't running in Windows. They're supposed to start automatically. They didn't. One click to start them, and everything worked.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lesson learned:&lt;/strong&gt; When virtualization networking breaks, check the &lt;strong&gt;host services first&lt;/strong&gt;, not the VM settings. And &lt;code&gt;ping&lt;/code&gt; and &lt;code&gt;curl&lt;/code&gt; are better debugging tools than staring at network icons.&lt;/p&gt;




&lt;h3&gt;
  
  
  💥 Crash #7: The Gateway That Lied to Me
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;What happened:&lt;/strong&gt; I had set up the gateway to auto-start on boot. I checked the configuration. It said &lt;code&gt;enabled: true&lt;/code&gt;. I was confident.&lt;/p&gt;

&lt;p&gt;The next morning, my AI was offline again.&lt;/p&gt;

&lt;p&gt;The gateway had "started" but hadn't actually connected. It was running as a process, but doing nothing useful.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;My reaction:&lt;/strong&gt; &lt;em&gt;"But I set it to auto-start! Why is it lying to me?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;What actually fixed it:&lt;/strong&gt; I wrote a simple script that checks every 5 minutes whether the gateway is actually connected, and restarts it if not. Bulletproof.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Lesson learned:&lt;/strong&gt; "Running" and "working" are two different things. Always add a health check.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Golden Rule: Don't Touch It
&lt;/h2&gt;

&lt;p&gt;After weeks of crashes, debugging, and existential crises, my setup finally stabilized. Everything worked. The gateway stayed connected. The models loaded correctly. Messages flowed.&lt;/p&gt;

&lt;p&gt;And I learned the most important lesson of all:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;If it works, don't touch it.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;You never know which piece of your spaghetti-code setup is holding everything together. That random config file? The one you're not sure does anything? Yeah, it probably does something. Leave it alone.&lt;/p&gt;

&lt;p&gt;Every time I thought &lt;em&gt;"I'll just fix this one small thing,"&lt;/em&gt; I ended up spending 3 hours recovering from the consequences.&lt;/p&gt;




&lt;h2&gt;
  
  
  What I Want You to Know
&lt;/h2&gt;

&lt;p&gt;I'm telling you all this not because I'm an expert — I'm not. I'm a medical student. I study anatomy, not APIs. I chose this career because I wanted to help people, not because I wanted to debug network services at 2 AM.&lt;/p&gt;

&lt;p&gt;But I got it working. And if I can, you can too.&lt;/p&gt;

&lt;p&gt;Here's what I learned that actually matters:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Before I started&lt;/th&gt;
&lt;th&gt;After I broke everything 7 times&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;"AI is for programmers"&lt;/td&gt;
&lt;td&gt;"AI is for anyone stubborn enough to try"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;"I'll just follow the tutorial"&lt;/td&gt;
&lt;td&gt;"I'll follow the tutorial &lt;em&gt;and&lt;/em&gt; backup first"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;"It should work perfectly"&lt;/td&gt;
&lt;td&gt;"It will break, and that's normal"&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;"I'm not technical enough"&lt;/td&gt;
&lt;td&gt;"Being patient matters more than being technical"&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  Your Turn
&lt;/h2&gt;

&lt;p&gt;If you're reading this and thinking &lt;em&gt;"That sounds like me"&lt;/em&gt; — good. You're exactly who I wrote this for.&lt;/p&gt;

&lt;p&gt;Start with something small. Expect it to break. Backup before you change anything. And when it finally works, &lt;strong&gt;leave it alone.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;I'm still learning. Every day something new confuses me. But I'm not scared of it anymore — because I've already broken everything that could break.&lt;/p&gt;

&lt;p&gt;And the AI is still running.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Hi, I'm Ling. I'm a medical student in China who somehow became a self-taught AI deployer. No CS degree, no big tech job — just a laptop, broken internet, and way too much stubbornness.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;This is the first of my "Real People, Real AI" series. ⭐ &lt;a href="https://github.com/Lingdas1/local-llm-guide" rel="noopener noreferrer"&gt;Star the GitHub repo&lt;/a&gt; to get notified when the next one drops.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;P.S. — If you've broken your own AI setup in a creative way, leave a comment. Misery loves company. 😄&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>productivity</category>
      <category>story</category>
    </item>
    <item>
      <title>What Is an LLM? (No, It's Not Magic — Here's What's Actually Happening)</title>
      <dc:creator>Lingdas1</dc:creator>
      <pubDate>Sun, 24 May 2026 09:36:12 +0000</pubDate>
      <link>https://dev.to/lingdas1/what-is-an-llm-no-its-not-magic-heres-whats-actually-happening-3ond</link>
      <guid>https://dev.to/lingdas1/what-is-an-llm-no-its-not-magic-heres-whats-actually-happening-3ond</guid>
      <description>&lt;h1&gt;
  
  
  What Is an LLM? (No, It's Not Magic — Here's What's Actually Happening)
&lt;/h1&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;em&gt;The plain-English guide to understanding AI — no jargon, no code, just the stuff that matters.&lt;/em&gt;&lt;/p&gt;
&lt;/blockquote&gt;




&lt;p&gt;My grandfather called it "the thinking computer."&lt;/p&gt;

&lt;p&gt;I showed him ChatGPT, and he asked: &lt;em&gt;"Does it... think? Like a person?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;It's a good question. And honestly, most explanations of AI are terrible at answering it. Either they're too technical (&lt;em&gt;"a transformer-based neural network with self-attention mechanisms"&lt;/em&gt; — whatever that means) or too mystical (&lt;em&gt;"it's like a digital brain!"&lt;/em&gt; — no, it's not).&lt;/p&gt;

&lt;p&gt;So let me explain what an LLM actually is. No jargon. No magic. Just the truth.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Analogy: A Chef Who's Tried Every Recipe
&lt;/h2&gt;

&lt;p&gt;Imagine the world's most experienced chef. This chef has read &lt;strong&gt;every&lt;/strong&gt; cookbook ever written. Every recipe from every culture. Every food blog. Every handwritten note from every grandmother.&lt;/p&gt;

&lt;p&gt;You ask this chef: &lt;em&gt;"Can you make me something with chicken, lemon, and garlic?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The chef has never made &lt;em&gt;that exact dish&lt;/em&gt; before, but they've read millions of recipes. They know what works. They know chicken + lemon + garlic usually means a Mediterranean-style dish. They know garlic should be minced, not whole. They know lemon juice goes in near the end, not the beginning.&lt;/p&gt;

&lt;p&gt;So they create a new recipe, perfectly reasonable, that has never existed before.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;That's what an LLM does.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;It's not "thinking." It's not "conscious." It has read an unimaginable amount of human text — books, articles, conversations, code — and learned the patterns of how we write and reason.&lt;/p&gt;

&lt;p&gt;When you ask it a question, it doesn't "look up" an answer. It &lt;em&gt;generates&lt;/em&gt; one, word by word, based on everything it has learned.&lt;/p&gt;




&lt;h2&gt;
  
  
  What LLM Actually Stands For
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;L&lt;/strong&gt;arge &lt;strong&gt;L&lt;/strong&gt;anguage &lt;strong&gt;M&lt;/strong&gt;odel.&lt;/p&gt;

&lt;p&gt;Let's break that down:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Language&lt;/strong&gt; — It works with words. Text in, text out. That's its native language (pun intended).&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Model&lt;/strong&gt; — A mathematical representation of patterns. Think of it as a super-complex set of probabilities: &lt;em&gt;"After the word 'I', the next word is usually a verb, and after 'I want to', the next word is often 'go' or 'get' or 'make'..."&lt;/em&gt; × a billion.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Large&lt;/strong&gt; — Really, &lt;em&gt;really&lt;/em&gt; large. These models have been trained on most of the public internet. The biggest ones have learned patterns from trillions of words.&lt;/li&gt;
&lt;/ul&gt;




&lt;h2&gt;
  
  
  What It's NOT
&lt;/h2&gt;

&lt;p&gt;Let me clear up some common confusion:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Myth&lt;/th&gt;
&lt;th&gt;Truth&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;🧠 "It thinks like a human"&lt;/td&gt;
&lt;td&gt;❌ No. It predicts words based on patterns. No consciousness, no feelings, no self-awareness.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;📚 "It knows everything"&lt;/td&gt;
&lt;td&gt;❌ It knows what it was trained on, which has a cutoff date. It doesn't "know" anything — it &lt;em&gt;generates&lt;/em&gt; plausible text.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;🎯 "It's always right"&lt;/td&gt;
&lt;td&gt;❌ It can be confidently wrong. It's great at sounding correct even when it's making things up.&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;📝 "It copies from the internet"&lt;/td&gt;
&lt;td&gt;❌ It doesn't store copies of web pages. It learned patterns and generates original text based on those patterns.&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;




&lt;h2&gt;
  
  
  Why "Large" Matters
&lt;/h2&gt;

&lt;p&gt;Imagine two chefs:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Chef A&lt;/strong&gt; has read 10 recipes. They know how to make exactly 10 dishes.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Chef B&lt;/strong&gt; has read 10 million recipes. They understand cuisine at a deep level.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;LLMs work the same way. The "large" in "Large Language Model" refers to:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;The amount of training data&lt;/strong&gt; — billions of web pages, books, and documents&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;The number of parameters&lt;/strong&gt; — think of these as "connections" in the model. A 7-billion-parameter model (small) has learned 7 billion patterns. A 70-billion-parameter model (large) has learned 70 billion.&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;More parameters = more pattern recognition = better reasoning (usually).&lt;/p&gt;

&lt;p&gt;But here's the good news: &lt;strong&gt;you don't need the biggest model.&lt;/strong&gt; A 7-billion-parameter model, running on a laptop, can handle most everyday tasks just fine. It's like having Chef B-lite — still experienced, still useful, much more practical.&lt;/p&gt;




&lt;h2&gt;
  
  
  How It Actually Works (The Simplest Explanation)
&lt;/h2&gt;

&lt;p&gt;When you type a message, here's what happens:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;You type: "What is the capital of France?"

Step 1: The model breaks your question into tokens (words and pieces of words).
         ["What", " is", " the", " capital", " of", " France", "?"]

Step 2: The model starts predicting the answer, one word at a time.
         "The" → "capital" → "of" → "France" → "is" → "Paris" → "."

Step 3: Each word is chosen based on probability.
         "The capital of France is..." → P(Paris) = 95%, P(Lyon) = 2%, P(Marseille) = 1%
         → It picks "Paris" (the most probable)

Step 4: Done! "The capital of France is Paris."
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;It's not magic. It's a very, very sophisticated version of your phone's autocomplete — trained on the entire internet.&lt;/p&gt;




&lt;h2&gt;
  
  
  Why This Matters to You (a Regular Person)
&lt;/h2&gt;

&lt;p&gt;Here's why understanding this matters:&lt;/p&gt;

&lt;h3&gt;
  
  
  1. You Don't Need to Be a Programmer
&lt;/h3&gt;

&lt;p&gt;If you understand that an LLM predicts words based on patterns, you already understand enough to use it. The tools are designed for everyone now.&lt;/p&gt;

&lt;h3&gt;
  
  
  2. You Can Run It on Your Laptop
&lt;/h3&gt;

&lt;p&gt;Because LLMs are just math (very complicated math, but still math), they can run on any computer. A smaller model on your laptop is slower than ChatGPT — but it's private, free, and always available.&lt;/p&gt;

&lt;h3&gt;
  
  
  3. You Should Be Skeptical
&lt;/h3&gt;

&lt;p&gt;Knowing that LLMs can be confidently wrong helps you use them better. Always fact-check important information. Use AI as a brainstorming partner, not an encyclopedia.&lt;/p&gt;

&lt;h3&gt;
  
  
  4. You're Not Left Behind
&lt;/h3&gt;

&lt;p&gt;The people who benefit most from AI aren't programmers — they're writers, students, small business owners, artists, and curious people who ask good questions. That's probably you.&lt;/p&gt;




&lt;h2&gt;
  
  
  The Different Types of AI (In Two Sentences)
&lt;/h2&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Type&lt;/th&gt;
&lt;th&gt;What It Does&lt;/th&gt;
&lt;th&gt;Example&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;LLM&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Understands and generates text&lt;/td&gt;
&lt;td&gt;ChatGPT, Claude, DeepSeek&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Image generator&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Creates pictures from descriptions&lt;/td&gt;
&lt;td&gt;Midjourney, DALL-E, Stable Diffusion&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Voice AI&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Understands and generates speech&lt;/td&gt;
&lt;td&gt;Siri, Whisper&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Recommendation&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Predicts what you'll like&lt;/td&gt;
&lt;td&gt;TikTok, Netflix, YouTube&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;This series focuses on LLMs — the text-based AI that can write, explain, analyze, and assist. It's the most useful type for everyday tasks.&lt;/p&gt;




&lt;h2&gt;
  
  
  What You Can Actually DO with This Knowledge
&lt;/h2&gt;

&lt;p&gt;Now that you know what an LLM is:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;You can use one right now, for free&lt;/strong&gt; — Ollama + a small model on your laptop&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;You know the limits&lt;/strong&gt; — It's not magic, it's pattern recognition. Use it as a tool, not an oracle.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;You can explain it to others&lt;/strong&gt; — When your friends say "AI is taking over," you can say "Actually, it's just really good autocomplete, trained on a lot of data."&lt;/li&gt;
&lt;/ol&gt;




&lt;h2&gt;
  
  
  What's Next
&lt;/h2&gt;

&lt;p&gt;Now that you know &lt;em&gt;what&lt;/em&gt; an LLM is, the next guide shows you how to actually run one:&lt;/p&gt;

&lt;p&gt;👉 &lt;strong&gt;Part 3:&lt;/strong&gt; &lt;em&gt;"Step-by-Step: Run Your First AI Model in 10 Minutes"&lt;/em&gt; — (coming next)&lt;/p&gt;

&lt;p&gt;No terminal commands you don't understand. No unexplained jargon. Just a simple walkthrough with screenshots.&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Hi, I'm Ling. I'm a medical student who got tired of feeling left behind by AI. I started learning, broke things, fixed them, and now I'm sharing what I've learned — in plain English, for regular people.&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;Found this useful? ⭐ &lt;a href="https://github.com/Lingdas1/local-llm-guide" rel="noopener noreferrer"&gt;Star the GitHub repo&lt;/a&gt; to get notified when new guides drop. Or leave a comment — I'd love to hear what questions you still have.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>beginners</category>
      <category>explainer</category>
      <category>technology</category>
    </item>
  </channel>
</rss>
