<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: zhide liu</title>
    <description>The latest articles on DEV Community by zhide liu (@zhide_liu_263b10ae585aed3).</description>
    <link>https://dev.to/zhide_liu_263b10ae585aed3</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3998276%2Fc0ac72fd-b379-40cc-a9be-5b27dd57a878.jpg</url>
      <title>DEV Community: zhide liu</title>
      <link>https://dev.to/zhide_liu_263b10ae585aed3</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/zhide_liu_263b10ae585aed3"/>
    <language>en</language>
    <item>
      <title>We Installed a Hippocampus in AI, Modeled After the Human Brain</title>
      <dc:creator>zhide liu</dc:creator>
      <pubDate>Tue, 23 Jun 2026 09:16:34 +0000</pubDate>
      <link>https://dev.to/zhide_liu_263b10ae585aed3/we-installed-a-hippocampus-in-ai-modeled-after-the-human-brain-500k</link>
      <guid>https://dev.to/zhide_liu_263b10ae585aed3/we-installed-a-hippocampus-in-ai-modeled-after-the-human-brain-500k</guid>
      <description>&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ferwvf05cgs8iv9em568g.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ferwvf05cgs8iv9em568g.jpg" alt="A glowing cyan neural-network brain on the left dissolves into a circuit board on the right — microchips, memory modules, and a padlocked shield icon — set against a midnight-indigo starfield, symbolizing a brain-inspired, privacy-first memory system for AI" width="800" height="446"&gt;&lt;/a&gt;&lt;br&gt;
The AntGroup AFX team is open-sourcing &lt;strong&gt;Hebb Mind&lt;/strong&gt;, a memory system for AI agents. Instead of just "store and retrieve," it brings the brain's full memory loop — &lt;strong&gt;encode → consolidate → activate → forget&lt;/strong&gt; — into engineering.&lt;/p&gt;

&lt;blockquote&gt;
&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;GitHub&lt;/strong&gt;: &lt;a href="https://github.com/afx-team/hebb-mind" rel="noopener noreferrer"&gt;https://github.com/afx-team/hebb-mind&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Docs&lt;/strong&gt;: &lt;a href="https://afx-team.github.io/hebb-mind/quick-start.html" rel="noopener noreferrer"&gt;https://afx-team.github.io/hebb-mind/quick-start.html&lt;/a&gt;
&lt;/li&gt;
&lt;/ul&gt;
&lt;/blockquote&gt;
&lt;h2&gt;
  
  
  How do we make AI remember context today?
&lt;/h2&gt;

&lt;p&gt;The most common approach is a handful of &lt;strong&gt;&lt;code&gt;MEMORY.md&lt;/code&gt;&lt;/strong&gt;, &lt;strong&gt;&lt;code&gt;AGENTS.md&lt;/code&gt;&lt;/strong&gt;, and &lt;strong&gt;&lt;code&gt;CLAUDE.md&lt;/code&gt;&lt;/strong&gt; files paired with &lt;code&gt;grep&lt;/code&gt; (keyword search over text). Whatever you need to remember gets appended line by line; when you need it, you search. The appeal is that it's dead simple — &lt;strong&gt;zero install, zero dependencies, zero cost&lt;/strong&gt; — and you can start right now.&lt;/p&gt;

&lt;p&gt;But the limits show up fast: there's no structure. The longer you use it, the more your memories pile up flat inside one ever-growing file. Phrase your query differently and you miss the hit; scale it up and "memory" degrades into "a tangled mess."&lt;/p&gt;

&lt;p&gt;A step up is the more systematic option: a vector store plus semantic retrieval (i.e., RAG). After each turn you make a model call to extract the key points, write them into a store indexed "by semantic similarity," and next time you ask, you recall "the few entries closest in meaning." This solves grep's rephrasing problem and gives you genuine semantic search.&lt;/p&gt;

&lt;p&gt;But hold these up against a real brain and you'll notice &lt;strong&gt;a few missing dimensions&lt;/strong&gt;:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Neither one &lt;strong&gt;organizes and settles&lt;/strong&gt; scattered experiences into structured knowledge during idle time (a &lt;code&gt;.md&lt;/code&gt; is a puddle of text, a vector store is a heap of vectors — nothing gets filed away);&lt;/li&gt;
&lt;li&gt;Neither one &lt;strong&gt;forgets on purpose&lt;/strong&gt; — an offhand remark from three years ago carries the same weight as a preference you stressed yesterday;&lt;/li&gt;
&lt;li&gt;Recall is just "pull up the closest few entries." It can't give you the brain's feeling of &lt;strong&gt;one cue lighting up a whole region&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In short: &lt;code&gt;.md&lt;/code&gt; + grep is "good enough," and a vector store gets you "search by meaning" — but even if model vendors push the context window from 200k to 1m, it still doesn't solve the real problem. Because what our brains do has never been simple storage plus retrieval. It's a complete loop — &lt;strong&gt;encode → consolidate → activate → forget&lt;/strong&gt;.&lt;/p&gt;
&lt;h2&gt;
  
  
  We rewrote it the way the brain takes notes
&lt;/h2&gt;

&lt;p&gt;We tried a range of off-the-shelf options, and on intelligence they all came up short. So we decided to look hard at the structure and the memory machinery of the human brain, and see if it held any clues. Below is the standard map of major brain regions that researchers broadly agree on — and the secret of memory is hidden in there.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fo9x2afjki75yx67kq2nn.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fo9x2afjki75yx67kq2nn.png" width="800" id="HrFtq" alt="▲ The hippocampus sits deep in the medial temporal lobe and handles fast encoding; long-term memories are slowly &amp;quot;consolidated&amp;quot; into the neocortex" height="446"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step one, encoding.&lt;/strong&gt; What you just experienced isn't immediately carved into the deep brain. The hippocampus first stashes it away quickly and cheaply — like jotting it on a sticky note.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step two, consolidation — and it happens while you sleep.&lt;/strong&gt; This is the step everyone overlooks, and also the most important one. During sleep, the hippocampus &lt;strong&gt;replays the day's experiences at high speed&lt;/strong&gt; to the cortex, weaving the new information into your existing web of knowledge, pass after pass. That's why after a good night's sleep you often feel like you've "figured it out" or "locked it in" — memory gets cemented while you rest. And the stuff that's important, the stuff that fits what you already know, cements fastest and firmest.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step three, retrieval — a "spreading activation."&lt;/strong&gt; Remembering isn't an exact table lookup. One cue ("that café") travels along the association network and lights up "the project we discussed that day," "the person sitting across from me." Even when the cue is only half there, the brain fills in the rest.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Step four, forgetting — an active ability.&lt;/strong&gt; What you use often and used recently sticks around; what you haven't touched in a long time fades exponentially on its own. This isn't "the disk filled up and overflowed" — it's deliberate. The brain wants your head clear when you're thinking, not stuffed with years of stale noise.&lt;/p&gt;

&lt;p&gt;By contrast: today's &lt;strong&gt;mainstream AI memory approaches drop exactly the three steps that matter most — consolidation, spreading activation, and active forgetting — and keep only "store" and "retrieve."&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ft6xw2cqs6e4vsrvpgkm1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Ft6xw2cqs6e4vsrvpgkm1.png" width="800" id="HrFtq" alt="_▲ Retrieval isn" height="447"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;Following this machinery, we built Hebb Mind — and it's now open source. "Hebb" comes from the neuroscientist Donald Hebb, who in 1949 proposed what's now known as Hebbian theory, usually boiled down to one line: &lt;strong&gt;"neurons that fire together, wire together."&lt;/strong&gt; That's the underlying mechanism by which the brain forms associations and leaps from one point to a whole region — and it's the starting point for Hebb Mind's design: memories that often appear together get linked, and memories that get used often stick around longer.&lt;/p&gt;

&lt;p&gt;So the core of Hebb Mind isn't "yet another RAG framework." It's that biological loop, &lt;strong&gt;ported one-to-one into engineering&lt;/strong&gt;:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;&lt;strong&gt;New things land in the "hippocampus" first; during "sleep" they're moved to the "cortex"; when you think, multiple paths light up and one point leads to a whole region; what goes unused slowly fades.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F5qsmspk12f2otg6ovw4q.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F5qsmspk12f2otg6ovw4q.jpg" width="800" title="" id="zPOSw" height="537"&gt;&lt;/a&gt; &lt;em&gt;▲ The loop as a system: integration layer → four-stage memory loop → local storage&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Concretely:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Encode&lt;/strong&gt;: the first stop for every new memory isn't long-term storage — it's a working-memory buffer that maps to the "hippocampus." Fast, cheap, format-agnostic, no extra processing, readable and writable verbatim. This step is &lt;strong&gt;fully local and needs no external cloud service&lt;/strong&gt;.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Consolidate&lt;/strong&gt;: a "sleep-like" background job runs on a schedule every day (the system defaults to evening, as if drifting off), replaying that day's scattered notes in the buffer to a large model, session by session, as whole transcripts — the engineering version of the brain's "high-speed replay" during sleep. In a single pass the model decides: which category this memory belongs to, how to distill and rewrite it, how important it is, and how to resolve conflicts with existing memories. The cleaned-up result is written into long-term memory, then the buffer's drafts are cleared — emptying the hippocampus to make room for new experiences.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Activate&lt;/strong&gt;: when looking something up, three paths light up at once — "semantically close," "literal match," and "tag-related" are all recalled together, then merged and ranked. After a hit, it follows the context of the same conversation and its related tags to pull in "that whole region" too. This is the engineering version of "one point leads to a whole region, and partial cues still get completed."&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Forget&lt;/strong&gt;: how long a memory survives is decided dynamically by "how often it's used × how important it is × how long since it was last used." High-frequency, important memories are practically nailed into long-term storage; obscure, idle ones quickly collapse and get reclaimed. Literally &lt;strong&gt;use it or lose it&lt;/strong&gt;.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These four steps map, respectively, onto several well-known theories in cognitive science — Complementary Learning Systems (CLS), Tulving's multiple memory systems, sharp-wave-ripple replay during sleep, ACT-R activation, and the Ebbinghaus forgetting curve. We studied this research as the starting point for the design, and wired it into the code one piece at a time.&lt;/p&gt;
&lt;h2&gt;
  
  
  So does it actually work? Straight to the numbers
&lt;/h2&gt;

&lt;p&gt;However nice the mechanism sounds, it has to prove itself. The numbers below all come from public datasets, and they were produced by &lt;strong&gt;the same code that ships in the real product&lt;/strong&gt; — not some "lab-only special." What you install is the same build, and all of it is reproducible.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;LongMemEval (500 long-horizon memory questions, a widely used industry benchmark)&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Finding it&lt;/strong&gt; (retrieval recall): Top-5 hit &lt;strong&gt;99.0%&lt;/strong&gt;, Top-10 hit &lt;strong&gt;99.4%&lt;/strong&gt;; four of the six question types hit &lt;strong&gt;100%&lt;/strong&gt; outright.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Getting it right&lt;/strong&gt; (end-to-end answer accuracy): &lt;strong&gt;79.0%&lt;/strong&gt; — using the dataset's official neutral prompt with no task-specific tuning, already closing in on the theoretical ceiling (about 82%) you'd get &lt;em&gt;assuming&lt;/em&gt; retrieval were 100% perfect.&lt;/li&gt;
&lt;li&gt;On the same 500 questions under the same official protocol, Zep's published numbers are Top-10 retrieval hit &lt;strong&gt;95.5%&lt;/strong&gt; and answer accuracy &lt;strong&gt;71.2%&lt;/strong&gt; — same questions, same scoring; on both "finding it" and "getting it right," Hebb Mind comes out ahead.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F0sact0tekl8xu7ivfiwk.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2F0sact0tekl8xu7ivfiwk.png" width="800" title="" id="ZPVCo" height="507"&gt;&lt;/a&gt; &lt;em&gt;▲ Ability to find the evidence — ahead at every depth&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fa32ox68856005h55fnul.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fa32ox68856005h55fnul.png" width="800" title="" id="dBB2D" height="507"&gt;&lt;/a&gt; &lt;em&gt;▲ Accuracy when actually answering&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;LoCoMo (multi-turn long conversations spanning dozens of sessions, built to test "do you still remember what was said several sessions ago")&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;On what this dataset measures best — "can you find it" — Top-10 hit &lt;strong&gt;95.75%&lt;/strong&gt; (94.14% even with rerank off). Under the same model and the same config, it edges out the published MemPalace numbers.&lt;/p&gt;

&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fg0mwmz3vkyfrqjigy8e1.png" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fg0mwmz3vkyfrqjigy8e1.png" width="800" title="" id="Ih7Wo" height="507"&gt;&lt;/a&gt; &lt;em&gt;▲ LoCoMo recall, same-protocol comparison&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;These are just the two most representative datasets. If you want the full scores on every dataset (more benchmarks, with each table carrying its protocol, k value, version, embedding/rerank config, and the exact rerun command), the complete benchmark report is open — you're welcome to check it out:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;📊 &lt;strong&gt;Full benchmark report&lt;/strong&gt;: afx-team.github.io/hebb-mind/benchmarks/&lt;/p&gt;
&lt;/blockquote&gt;
&lt;h2&gt;
  
  
  Beyond the scores — how to choose
&lt;/h2&gt;

&lt;p&gt;Scores alone aren't enough. There are plenty of memory frameworks, each with its own approach. Put a few of the commonly compared ones side by side and it roughly looks like this:&lt;/p&gt;

&lt;div class="table-wrapper-paragraph"&gt;&lt;table&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th&gt;Framework&lt;/th&gt;
&lt;th&gt;Starts as&lt;/th&gt;
&lt;th&gt;Write needs an LLM?&lt;/th&gt;
&lt;th&gt;Active forgetting&lt;/th&gt;
&lt;th&gt;License&lt;/th&gt;
&lt;th&gt;Strength / when to pick it&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&lt;strong&gt;Hebb Mind&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Local-first, zero external services&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;
&lt;strong&gt;No&lt;/strong&gt; — only consolidation does&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;Yes — staged, dynamic TTL&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;&lt;strong&gt;MIT&lt;/strong&gt;&lt;/td&gt;
&lt;td&gt;Local &amp;amp; private + faithful mechanism + full neuroscience loop&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;mem0&lt;/td&gt;
&lt;td&gt;Self-hosted / cloud, needs a vector store&lt;/td&gt;
&lt;td&gt;Yes&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Apache core + paid cloud&lt;/td&gt;
&lt;td&gt;Earliest mover, widest ecosystem and integrations&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Zep / Graphiti&lt;/td&gt;
&lt;td&gt;Cloud-first; self-host needs a graph DB&lt;/td&gt;
&lt;td&gt;Yes — builds a graph on write&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Apache-2.0 (Graphiti)&lt;/td&gt;
&lt;td&gt;Temporal reasoning + bitemporal knowledge graph&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Letta (MemGPT)&lt;/td&gt;
&lt;td&gt;Self-hosted / cloud, needs Postgres&lt;/td&gt;
&lt;td&gt;Yes — agent runtime&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Apache-2.0&lt;/td&gt;
&lt;td&gt;Stateful-agent runtime + self-editing memory&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;Cognee&lt;/td&gt;
&lt;td&gt;Self-hosted (graph + vector backend)&lt;/td&gt;
&lt;td&gt;Yes — LLM extraction&lt;/td&gt;
&lt;td&gt;No&lt;/td&gt;
&lt;td&gt;Apache core&lt;/td&gt;
&lt;td&gt;Vector + graph hybrid, pluggable backends&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;&lt;/div&gt;

&lt;p&gt;&lt;em&gt;▲ A factual comparison, not a benchmark — each tool has its place&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;mem0&lt;/strong&gt; got out early and is widely adopted; &lt;strong&gt;Zep&lt;/strong&gt;'s temporal knowledge graph is genuinely strong on "when did it happen, what came before what" questions; &lt;strong&gt;Letta&lt;/strong&gt; turned the "stateful agent" into a complete runtime; &lt;strong&gt;MemPalace&lt;/strong&gt; is easy to integrate. Most of them are cloud-shaped, and a write requires a model call up front — if you want something managed with a ready-made ecosystem, any of them is a reasonable pick.&lt;/p&gt;

&lt;p&gt;Hebb Mind, by contrast, aims to be more biologically faithful in mechanism and more elegant in engineering: &lt;strong&gt;local-first, multi-agent compatible, a complete memory loop, MIT-licensed&lt;/strong&gt; — and right now, the only row that checks all four boxes at once is Hebb Mind.&lt;/p&gt;
&lt;h2&gt;
  
  
  Three commands, running locally
&lt;/h2&gt;

&lt;p&gt;It's not just the numbers that hold up — getting from install to running is just as simple:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;pipx install hebb-mind
hebb setup
hebb service install
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Once installed, it runs as an OS-level background service (macOS / Linux / Windows all supported, user-level by default, no admin rights needed), bringing up a local API and a web console — open it in your browser to manage memories by hand.&lt;/p&gt;

&lt;p&gt;More importantly, Hebb Mind has two core strengths:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Local-first, zero external services&lt;/strong&gt;: writes and retrieval are fully offline, running on the bundled local embedding model. Only when the "consolidation" step actually needs to do its work do you need to configure a large-model key (skip it and nothing errors out — that step just silently no-ops). At the model layer it supports common LLMs including Claude / GPT / Llama / Qwen / GLM / Kimi.&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Built for agents, not for chat&lt;/strong&gt;: native integration with Claude Code and Codex — one command to install, and your coding assistant remembers your project, your preferences, and the traps you've hit across sessions, with no need to re-explain every time.&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Once installed, you have a local service: a memory loop that organizes itself and also forgets, running without any cloud.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Wiring it into your coding assistant: Claude Code / Codex
&lt;/h2&gt;

&lt;h3&gt;
  
  
  Claude Code integration
&lt;/h3&gt;

&lt;p&gt;&lt;strong&gt;Claude Code is the deepest integration, giving you two layers at once.&lt;/strong&gt; One is the &lt;strong&gt;MCP tools&lt;/strong&gt; — letting Claude actively call &lt;code&gt;write_memory&lt;/code&gt; / &lt;code&gt;search_memory&lt;/code&gt; / &lt;code&gt;consolidate&lt;/code&gt; inside a conversation, deciding for itself what to record and what to look up. The other is the &lt;strong&gt;Hooks auto-memory layer&lt;/strong&gt;, the invisible one: when a session opens, cross-session memories are recalled and injected into context automatically; every message you send is automatically de-noised, deduplicated, and written to memory; and when the session ends, consolidation is triggered automatically. No intervention required.&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;hebb claude-code install --scope user    # injects hooks + MCP; restart Claude Code to apply
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;Restart Claude Code after installing and it takes effect. From then on it remembers your project, preferences, and the traps you've hit across sessions — no need to re-explain every time.&lt;/p&gt;

&lt;h3&gt;
  
  
  Codex integration
&lt;/h3&gt;

&lt;p&gt;Codex goes through MCP, and the command Hebb Mind provides installs it automatically:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;hebb codex install --scope user
codex mcp list                          # verify the install succeeded
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;After installation, Codex can call tools like &lt;code&gt;write_memory&lt;/code&gt; / &lt;code&gt;search_memory&lt;/code&gt; / &lt;code&gt;consolidate&lt;/code&gt; / &lt;code&gt;ingest_conversation&lt;/code&gt;.&lt;/p&gt;

&lt;h3&gt;
  
  
  Web console
&lt;/h3&gt;

&lt;p&gt;To make management easier, we also ship a web console. Run the following command to open it:&lt;br&gt;
&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;hebb console
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;



&lt;p&gt;&lt;a href="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fchg5582a839shiewipwe.jpg" class="article-body-image-wrapper"&gt;&lt;img src="https://media2.dev.to/dynamic/image/width=800%2Cheight=%2Cfit=scale-down%2Cgravity=auto%2Cformat=auto/https%3A%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Farticles%2Fchg5582a839shiewipwe.jpg" width="800" title="" id="ucedbd30d" height="499"&gt;&lt;/a&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  Closing thoughts
&lt;/h2&gt;

&lt;p&gt;&lt;strong&gt;An agent's memory shouldn't be just "storage + retrieval."&lt;/strong&gt; What makes the brain truly remarkable is that it organizes during sleep, forgets on purpose, and leaps from one point to a whole region. That's the loop we've open-sourced — it's still early, but you've seen the numbers above, and we're walking this direction on steady ground.&lt;/p&gt;

&lt;p&gt;If you also think "AI memory" matters, there's a lot left for us to polish — come kick the tires, open PRs, and drop a star 🌟:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;GitHub&lt;/strong&gt;: &lt;a href="https://github.com/afx-team/hebb-mind" rel="noopener noreferrer"&gt;https://github.com/afx-team/hebb-mind&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Docs&lt;/strong&gt;: &lt;a href="https://afx-team.github.io/hebb-mind/quick-start.html" rel="noopener noreferrer"&gt;https://afx-team.github.io/hebb-mind/quick-start.html&lt;/a&gt;
&lt;/li&gt;
&lt;li&gt;Install: &lt;code&gt;pip install hebb-mind&lt;/code&gt; or &lt;code&gt;pipx install hebb-mind&lt;/code&gt;
&lt;/li&gt;
&lt;/ul&gt;

</description>
      <category>ai</category>
      <category>memory</category>
      <category>agents</category>
    </item>
  </channel>
</rss>
