<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: blaze</title>
    <description>The latest articles on DEV Community by blaze (@rongrong).</description>
    <link>https://dev.to/rongrong</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.us-east-2.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F4006484%2F3be9f5b4-4a23-405d-ace3-274f5cf46473.png</url>
      <title>DEV Community: blaze</title>
      <link>https://dev.to/rongrong</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/rongrong"/>
    <language>en</language>
    <item>
      <title>A Four-Type Framework for LLM Wiki by karpathy</title>
      <dc:creator>blaze</dc:creator>
      <pubDate>Sun, 28 Jun 2026 12:32:33 +0000</pubDate>
      <link>https://dev.to/rongrong/a-four-type-framework-for-llm-wiki-by-karpathy-5f1n</link>
      <guid>https://dev.to/rongrong/a-four-type-framework-for-llm-wiki-by-karpathy-5f1n</guid>
      <description>&lt;h1&gt;
  
  
  Why Knowledge Alone Doesn't Create Judgment
&lt;/h1&gt;

&lt;p&gt;Karpathy's LLM Wiki is brilliant. You dump raw material in, an LLM extracts concepts and links them together, and you get a personal knowledge base that actually works.&lt;/p&gt;

&lt;p&gt;I built one. 100+ pages. It's great.&lt;/p&gt;

&lt;p&gt;But I hit a wall that made me rethink everything.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Wall
&lt;/h2&gt;

&lt;p&gt;I asked my AI to act as a programming tutor. It could recite every concept perfectly.&lt;/p&gt;

&lt;p&gt;Student: &lt;em&gt;"I don't understand Promises."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;AI: &lt;em&gt;"A Promise is an object representing the eventual completion or failure of an asynchronous operation..."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;Wrong answer. The right answer was: &lt;em&gt;"Do you understand callbacks first? What about synchronous execution? What have you tried so far?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The AI had knowledge. It had zero judgment.&lt;/p&gt;

&lt;p&gt;And then I realized why: &lt;strong&gt;every single page in my wiki was the same type of knowledge.&lt;/strong&gt;&lt;/p&gt;

&lt;h2&gt;
  
  
  One Type vs Four
&lt;/h2&gt;

&lt;p&gt;LLM Wiki 1.0 stores declarative knowledge — facts, definitions, summaries. Things that answer "What is this?"&lt;/p&gt;

&lt;p&gt;But think about what makes a human expert different from a textbook:&lt;/p&gt;

&lt;p&gt;A great programming mentor doesn't just know what Promises are. They know &lt;strong&gt;why you teach callback → Promise → async/await in that exact order&lt;/strong&gt; — and never the reverse. That's not a fact. It's a reasoning path.&lt;/p&gt;

&lt;p&gt;A master astrologer doesn't just know what each star represents. They know &lt;strong&gt;why you check 命宮 first, then 三方四正, when to prioritize 格局, when a palace is a consequence rather than a cause.&lt;/strong&gt; That's not a fact either. It's a decision sequence.&lt;/p&gt;

&lt;p&gt;And here's the kicker: &lt;strong&gt;even knowing the reasoning path isn't enough.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;We annotated Anderson's (1972) Socratic tutoring dialogues — full 41-turn and 30-turn conversations, labeling every decision point. Knowing the 23 Socratic rules (the reasoning path) is one thing. Reading a complete dialogue — watching the expert set a trap, wait 15 seconds in silence, break their own rules when the student gets frustrated — is something else entirely.&lt;/p&gt;

&lt;p&gt;Read the 《Complete Book of Psychology》 ≠ know how to use or teach.&lt;/p&gt;

&lt;p&gt;And there's still one more type.&lt;/p&gt;

&lt;p&gt;Student says: &lt;em&gt;"I have no motivation lately."&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;A knowledge-based response: &lt;em&gt;"Here are the top 5 causes of low motivation..."&lt;/em&gt;&lt;br&gt;
but....it's useless, LLM don't know how to resolve the problem.&lt;br&gt;
it just explaining a concept.&lt;br&gt;
So,a more suitable response for this scenario is: &lt;em&gt;"When was the first time you noticed this? What makes you think so?"&lt;/em&gt;&lt;/p&gt;

&lt;p&gt;The expert isn't answering. They're diagnosing. They know that "no motivation" is a surface symptom — the real problem could be burnout, unclear goals, a specific failure, or something else. Until you know which, any advice is a guess.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;That's four distinct types of knowledge:&lt;/strong&gt;&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;
&lt;strong&gt;Declarative&lt;/strong&gt; — What is true (facts, concepts, definitions)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Procedural&lt;/strong&gt; — How to reason (expert decision sequences, why X before Y)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Experiential&lt;/strong&gt; — How it's actually done (complete worked examples with mistakes visible)&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Interaction&lt;/strong&gt; — How to guide (what to ask next, when to tell vs wait)&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;LLM Wiki 1.0 only stores type 1.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Evidence Is Brutal
&lt;/h2&gt;

&lt;p&gt;WashU researchers analyzed 98 real CS TA sessions — 17 hours, 8,203 utterances.&lt;/p&gt;

&lt;p&gt;Socratic questioning (guided reasoning, diagnostic probes): &lt;strong&gt;0.6%.&lt;/strong&gt;&lt;br&gt;
TAs directly giving the answer: &lt;strong&gt;75%.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;These TAs knew the method. They were trained. Under time pressure, they defaulted to giving answers anyway.&lt;/p&gt;

&lt;p&gt;Knowing the rules ≠ being able to execute them.&lt;/p&gt;

&lt;p&gt;That gap — between knowing and executing — is exactly where procedural, experiential, and interaction knowledge live. If you don't store those types, you can't train them. If you can't train them, you can't execute under pressure.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Missing Operation
&lt;/h2&gt;

&lt;p&gt;Karpathy's framework has one operation: &lt;strong&gt;ingest&lt;/strong&gt; — extract facts from raw material.&lt;/p&gt;

&lt;p&gt;That produces declarative knowledge beautifully. But you can't get reasoning paths, worked examples, or guidance strategies by looking for facts. You have to look for &lt;strong&gt;decisions&lt;/strong&gt; — what did the expert choose, when, and what followed?&lt;/p&gt;

&lt;p&gt;We added a second operation: &lt;strong&gt;mine.&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;ingest&lt;/strong&gt; looks for facts → Declarative Knowledge&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;mine&lt;/strong&gt; looks for decisions → Procedural, Experiential, Interaction Knowledge&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Same raw material. Completely different extraction target.&lt;/p&gt;

&lt;h2&gt;
  
  
  What This Looks Like In Practice
&lt;/h2&gt;

&lt;p&gt;Over two weeks, we mined five teaching case studies:&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Procedural frameworks extracted:&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Anderson's 23 Socratic Rules — complete tutoring cycle in 6 groups&lt;/li&gt;
&lt;li&gt;One-Minute Preceptor — clinical medicine's "diagnose before you teach" framework&lt;/li&gt;
&lt;li&gt;Socratic Debugging 7 Steps — "don't touch the keyboard, guide to cognitive dissonance"&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Experiential cases annotated (decision-point level, not summaries):&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;41-turn scientific reasoning dialogue — trap design, "don't say you're wrong"&lt;/li&gt;
&lt;li&gt;30-turn moral reasoning dialogue — counter-example strategy, breakthrough moment&lt;/li&gt;
&lt;li&gt;1-hour CMU math tutoring — "Tell Your Reader" metaphor, progressive correction&lt;/li&gt;
&lt;li&gt;WashU 98-session negative case — why Socratic method fails in practice&lt;/li&gt;
&lt;li&gt;MathDial 3,000-dialogue taxonomy — Focus / Probe / Tell / Generic decision model&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;strong&gt;Interaction pattern (emerging):&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;A decision tree: when the student is stuck → narrow the problem (Focus). When they answer but reasoning is unclear → deepen understanding (Probe). When Focus + Probe cycle fails twice → give a strategy hint, not the answer (Tell).&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  This Isn't Just About Teaching
&lt;/h2&gt;

&lt;p&gt;The four-type distinction applies wherever expertise exists:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;
&lt;strong&gt;Medical diagnosis:&lt;/strong&gt; Disease definitions → diagnostic reasoning sequence → grand rounds presentations → how to guide a resident&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Philosophy mentoring:&lt;/strong&gt; What Heidegger said → when to bring up Stoicism instead → full dialogue transcripts → when to stay silent&lt;/li&gt;
&lt;li&gt;
&lt;strong&gt;Growth coaching:&lt;/strong&gt; Motivation theories → when to probe vs reframe → full session transcripts → "When did you first notice this?"&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;In every domain, experts have all four types. Knowledge bases only capture the first.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Point
&lt;/h2&gt;

&lt;p&gt;The next generation of AI won't be defined by larger knowledge bases.&lt;/p&gt;

&lt;p&gt;It will be defined by better reasoning, better teaching, and better judgment.&lt;/p&gt;

&lt;p&gt;Those don't come from more declarative knowledge. They come from organizing knowledge differently.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Judgment isn't a knowledge problem. It's a knowledge-type problem.&lt;/strong&gt;&lt;/p&gt;




&lt;p&gt;&lt;em&gt;Built on @karpathy's LLM Wiki foundation. The idea of "mine" as a second operation is what's new here — ingest extracts facts, mine extracts decisions. If you're building an AI tutor, a knowledge system, or anything that needs judgment, the four-type checklist might save you months.&lt;/em&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>llm</category>
      <category>learning</category>
      <category>agents</category>
    </item>
  </channel>
</rss>
