<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/">
  <channel>
    <title>DEV Community: zeromathai</title>
    <description>The latest articles on DEV Community by zeromathai (@zeromathai).</description>
    <link>https://dev.to/zeromathai</link>
    <image>
      <url>https://media2.dev.to/dynamic/image/width=90,height=90,fit=cover,gravity=auto,format=auto/https:%2F%2Fdev-to-uploads.s3.amazonaws.com%2Fuploads%2Fuser%2Fprofile_image%2F3872570%2Fc7bba9ef-1a14-44b5-a02d-f6720ab48ab8.png</url>
      <title>DEV Community: zeromathai</title>
      <link>https://dev.to/zeromathai</link>
    </image>
    <atom:link rel="self" type="application/rss+xml" href="https://dev.to/feed/zeromathai"/>
    <language>en</language>
    <item>
      <title>What AI Really Is — From Turing Test to Deep Learning</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Mon, 18 May 2026 23:57:30 +0000</pubDate>
      <link>https://dev.to/zeromathai/what-ai-really-is-from-turing-test-to-deep-learning-39o2</link>
      <guid>https://dev.to/zeromathai/what-ai-really-is-from-turing-test-to-deep-learning-39o2</guid>
      <description>&lt;p&gt;AI is not just chatbots or neural networks.&lt;/p&gt;

&lt;p&gt;It is a long-running attempt to answer one question:&lt;/p&gt;

&lt;p&gt;Can a machine behave intelligently?&lt;/p&gt;

&lt;p&gt;That question shaped everything from symbolic AI to modern deep learning.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;Artificial Intelligence is the field of building systems that can perform tasks requiring intelligence.&lt;/p&gt;

&lt;p&gt;That can include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;reasoning&lt;/li&gt;
&lt;li&gt;learning&lt;/li&gt;
&lt;li&gt;planning&lt;/li&gt;
&lt;li&gt;perception&lt;/li&gt;
&lt;li&gt;language understanding&lt;/li&gt;
&lt;li&gt;decision-making&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;But AI is not one single technique.&lt;/p&gt;

&lt;p&gt;It is a collection of paradigms.&lt;/p&gt;

&lt;p&gt;Different eras of AI tried different answers to the same question.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simple map of AI looks like this:&lt;/p&gt;

&lt;p&gt;Turing Test → Symbolic AI → AI Winter → Neural Networks → Deep Learning → Modern AI&lt;/p&gt;

&lt;p&gt;More compactly:&lt;/p&gt;

&lt;p&gt;AI = reasoning systems + learning systems + decision systems&lt;/p&gt;

&lt;p&gt;The important shift is this:&lt;/p&gt;

&lt;p&gt;AI moved from hand-coded rules toward data-driven learning.&lt;/p&gt;

&lt;p&gt;That shift explains why modern AI looks so different from early AI.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, an AI system often works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;receive input from the environment

represent the problem internally

apply rules, search, or learned patterns

make a prediction or decision

act or generate an output

improve through feedback or training
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why AI is broader than one model.&lt;/p&gt;

&lt;p&gt;A chatbot, a search algorithm, and a recommendation system may look different.&lt;/p&gt;

&lt;p&gt;But they all transform input into decisions or outputs.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a spam detection system.&lt;/p&gt;

&lt;p&gt;A symbolic AI approach might use explicit rules:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;if subject contains suspicious phrase, increase risk&lt;/li&gt;
&lt;li&gt;if sender is unknown, increase risk&lt;/li&gt;
&lt;li&gt;if many links exist, increase risk&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A machine learning approach learns patterns from labeled examples.&lt;/p&gt;

&lt;p&gt;A deep learning approach may learn internal representations directly from text.&lt;/p&gt;

&lt;p&gt;Same task.&lt;/p&gt;

&lt;p&gt;Different AI paradigm.&lt;/p&gt;

&lt;h2&gt;
  
  
  Symbolic AI vs Connectionism
&lt;/h2&gt;

&lt;p&gt;This is one of the most important comparisons in AI history.&lt;/p&gt;

&lt;p&gt;Symbolic AI:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;uses explicit rules and logic&lt;/li&gt;
&lt;li&gt;represents knowledge with symbols&lt;/li&gt;
&lt;li&gt;is easier to inspect&lt;/li&gt;
&lt;li&gt;struggles with messy real-world data&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Connectionism:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;uses neural-network-style learning&lt;/li&gt;
&lt;li&gt;learns patterns from data&lt;/li&gt;
&lt;li&gt;handles complex inputs better&lt;/li&gt;
&lt;li&gt;can be harder to interpret&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Symbolic AI asks:&lt;/p&gt;

&lt;p&gt;“What rules should the system follow?”&lt;/p&gt;

&lt;p&gt;Connectionism asks:&lt;/p&gt;

&lt;p&gt;“What patterns can the system learn from data?”&lt;/p&gt;

&lt;p&gt;Modern AI is strongly shaped by the second question.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why AI Winter Happened
&lt;/h2&gt;

&lt;p&gt;AI did not grow in a straight line.&lt;/p&gt;

&lt;p&gt;Early expectations were extremely high.&lt;/p&gt;

&lt;p&gt;But hardware, data, algorithms, and practical results could not always keep up.&lt;/p&gt;

&lt;p&gt;This led to periods known as AI winters.&lt;/p&gt;

&lt;p&gt;The important lesson is simple:&lt;/p&gt;

&lt;p&gt;AI progress depends on more than ideas.&lt;/p&gt;

&lt;p&gt;It also depends on compute, data, algorithms, and realistic expectations.&lt;/p&gt;

&lt;p&gt;That is why modern AI surged when those conditions improved together.&lt;/p&gt;

&lt;h2&gt;
  
  
  Where Current AI Stands
&lt;/h2&gt;

&lt;p&gt;Most current AI systems are narrow AI.&lt;/p&gt;

&lt;p&gt;They perform specific tasks well.&lt;/p&gt;

&lt;p&gt;Examples:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;image recognition&lt;/li&gt;
&lt;li&gt;translation&lt;/li&gt;
&lt;li&gt;recommendation&lt;/li&gt;
&lt;li&gt;text generation&lt;/li&gt;
&lt;li&gt;code assistance&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;They are not general human-level intelligence.&lt;/p&gt;

&lt;p&gt;That distinction matters.&lt;/p&gt;

&lt;p&gt;Narrow AI solves defined tasks.&lt;/p&gt;

&lt;p&gt;AGI would be able to generalize across many domains more like a human.&lt;/p&gt;

&lt;p&gt;Superintelligence would go beyond human-level cognitive ability.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Modern AI Became So Powerful
&lt;/h2&gt;

&lt;p&gt;Modern AI grew because several ideas converged:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;neural networks&lt;/li&gt;
&lt;li&gt;deep learning&lt;/li&gt;
&lt;li&gt;large datasets&lt;/li&gt;
&lt;li&gt;GPUs and accelerators&lt;/li&gt;
&lt;li&gt;representation learning&lt;/li&gt;
&lt;li&gt;Transformer architectures&lt;/li&gt;
&lt;li&gt;large language models&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The big change was representation learning.&lt;/p&gt;

&lt;p&gt;Instead of manually defining every feature, models learned useful internal structures from data.&lt;/p&gt;

&lt;p&gt;That made AI much more flexible.&lt;/p&gt;

&lt;h2&gt;
  
  
  Technical vs Philosophical AI
&lt;/h2&gt;

&lt;p&gt;AI also raises deeper questions.&lt;/p&gt;

&lt;p&gt;Can a system follow rules without understanding?&lt;/p&gt;

&lt;p&gt;Does producing intelligent behavior mean it has intelligence?&lt;/p&gt;

&lt;p&gt;Where do choice, intention, and consciousness fit?&lt;/p&gt;

&lt;p&gt;These questions appear in debates like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Chinese Room Argument&lt;/li&gt;
&lt;li&gt;strong AI vs weak AI&lt;/li&gt;
&lt;li&gt;free will discussions&lt;/li&gt;
&lt;li&gt;AGI and superintelligence&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You do not need to answer them first.&lt;/p&gt;

&lt;p&gt;But they explain why AI is not only an engineering topic.&lt;/p&gt;

&lt;p&gt;It is also a question about mind and intelligence.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If AI feels too broad, learn it in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Turing Test&lt;/li&gt;
&lt;li&gt;AI Paradigms&lt;/li&gt;
&lt;li&gt;Symbolic AI&lt;/li&gt;
&lt;li&gt;Connectionism&lt;/li&gt;
&lt;li&gt;AI Winter&lt;/li&gt;
&lt;li&gt;Neural Networks&lt;/li&gt;
&lt;li&gt;Deep Learning&lt;/li&gt;
&lt;li&gt;Narrow AI vs Broad AI&lt;/li&gt;
&lt;li&gt;AGI&lt;/li&gt;
&lt;li&gt;Singularity&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand the question.&lt;/p&gt;

&lt;p&gt;Then you understand the paradigm shift.&lt;/p&gt;

&lt;p&gt;Then you connect it to modern AI.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;AI is not one algorithm.&lt;/p&gt;

&lt;p&gt;It is a field built around machines that reason, learn, decide, or act intelligently.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;AI = systems that turn information into intelligent behavior&lt;/p&gt;

&lt;p&gt;Symbolic AI uses rules.&lt;/p&gt;

&lt;p&gt;Connectionist AI learns patterns.&lt;/p&gt;

&lt;p&gt;Modern AI is largely powered by neural networks, deep learning, and large-scale data.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;AI evolved from asking machines to follow rules into training machines to learn patterns from data.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When explaining AI to beginners, do you start from the Turing Test and history, or from modern examples like neural networks and LLMs?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/ai-overview-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/ai-overview-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>deeplearning</category>
    </item>
    <item>
      <title>How RNNs Work — Remembering Previous States in Sequential Data</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Mon, 18 May 2026 23:57:02 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-rnns-work-remembering-previous-states-in-sequential-data-560o</link>
      <guid>https://dev.to/zeromathai/how-rnns-work-remembering-previous-states-in-sequential-data-560o</guid>
      <description>&lt;p&gt;A normal neural network treats each input mostly as a fixed snapshot.&lt;/p&gt;

&lt;p&gt;But many problems are not snapshots.&lt;/p&gt;

&lt;p&gt;Text, speech, and time-series data depend on order.&lt;/p&gt;

&lt;p&gt;That is why RNNs exist.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;A Recurrent Neural Network is designed for sequential data.&lt;/p&gt;

&lt;p&gt;It does not only look at the current input.&lt;/p&gt;

&lt;p&gt;It also carries information from previous steps.&lt;/p&gt;

&lt;p&gt;That carried information is called the hidden state.&lt;/p&gt;

&lt;p&gt;So an RNN can process a sequence one step at a time while keeping memory of what came before.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simple RNN flow looks like this:&lt;/p&gt;

&lt;p&gt;Previous Hidden State + Current Input → New Hidden State → Output&lt;/p&gt;

&lt;p&gt;More compactly:&lt;/p&gt;

&lt;p&gt;RNN = current input + previous state&lt;/p&gt;

&lt;p&gt;At each time step:&lt;/p&gt;

&lt;p&gt;h_t = f(x_t, h_{t-1})&lt;/p&gt;

&lt;p&gt;Where:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;x_t = input at the current time step&lt;/li&gt;
&lt;li&gt;h_{t-1} = previous hidden state&lt;/li&gt;
&lt;li&gt;h_t = updated hidden state&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This recurrence is the core mechanism.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, an RNN processes a sequence like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;initialize hidden state

for each time step in the sequence:
    read current input

    combine it with previous hidden state

    update hidden state

    optionally produce output

return final output or all outputs
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why RNNs are useful for ordered data.&lt;/p&gt;

&lt;p&gt;The model can carry context forward.&lt;/p&gt;

&lt;p&gt;It does not restart from zero at every step.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a sentence:&lt;/p&gt;

&lt;p&gt;I love machine learning&lt;/p&gt;

&lt;p&gt;A basic feedforward network may process words as independent inputs.&lt;/p&gt;

&lt;p&gt;But an RNN reads them in order.&lt;/p&gt;

&lt;p&gt;Step 1:&lt;/p&gt;

&lt;p&gt;I&lt;/p&gt;

&lt;p&gt;Step 2:&lt;/p&gt;

&lt;p&gt;I love&lt;/p&gt;

&lt;p&gt;Step 3:&lt;/p&gt;

&lt;p&gt;I love machine&lt;/p&gt;

&lt;p&gt;Step 4:&lt;/p&gt;

&lt;p&gt;I love machine learning&lt;/p&gt;

&lt;p&gt;At each step, the hidden state carries previous context.&lt;/p&gt;

&lt;p&gt;That is how the model remembers earlier words while reading later ones.&lt;/p&gt;

&lt;h2&gt;
  
  
  Standard Neural Network vs RNN
&lt;/h2&gt;

&lt;p&gt;This comparison makes the difference clear.&lt;/p&gt;

&lt;p&gt;Standard neural network:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;processes fixed-size input&lt;/li&gt;
&lt;li&gt;has no built-in memory across time&lt;/li&gt;
&lt;li&gt;works well for static feature vectors&lt;/li&gt;
&lt;li&gt;does not naturally model order&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;RNN:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;processes sequences step by step&lt;/li&gt;
&lt;li&gt;carries hidden state forward&lt;/li&gt;
&lt;li&gt;models temporal or ordered dependence&lt;/li&gt;
&lt;li&gt;fits text, speech, and time-series tasks&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The key difference is state.&lt;/p&gt;

&lt;p&gt;A standard network transforms input.&lt;/p&gt;

&lt;p&gt;An RNN transforms input while remembering previous context.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Hidden State Matters
&lt;/h2&gt;

&lt;p&gt;The hidden state is the memory of the RNN.&lt;/p&gt;

&lt;p&gt;It is not memory in the human sense.&lt;/p&gt;

&lt;p&gt;It is a vector that summarizes previous information.&lt;/p&gt;

&lt;p&gt;At each step, the hidden state is updated.&lt;/p&gt;

&lt;p&gt;That updated state influences the next step.&lt;/p&gt;

&lt;p&gt;This lets the model capture patterns like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;word order&lt;/li&gt;
&lt;li&gt;temporal trends&lt;/li&gt;
&lt;li&gt;repeated signals&lt;/li&gt;
&lt;li&gt;dependency across earlier and later inputs&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Without hidden state, the sequence becomes just a list of disconnected inputs.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Deep RNNs Exist
&lt;/h2&gt;

&lt;p&gt;A basic RNN can model sequences.&lt;/p&gt;

&lt;p&gt;But some patterns are more complex.&lt;/p&gt;

&lt;p&gt;A Deep RNN stacks recurrent layers.&lt;/p&gt;

&lt;p&gt;That allows the model to build richer sequence representations.&lt;/p&gt;

&lt;p&gt;Basic RNN:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;one recurrent layer&lt;/li&gt;
&lt;li&gt;simpler sequence modeling&lt;/li&gt;
&lt;li&gt;easier to understand&lt;/li&gt;
&lt;li&gt;limited representational depth&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Deep RNN:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;multiple recurrent layers&lt;/li&gt;
&lt;li&gt;more expressive sequence modeling&lt;/li&gt;
&lt;li&gt;can capture more complex temporal patterns&lt;/li&gt;
&lt;li&gt;harder to train&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So Deep RNNs extend the same idea.&lt;/p&gt;

&lt;p&gt;They do not replace recurrence.&lt;/p&gt;

&lt;p&gt;They deepen it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Where RNNs Fit in Deep Learning
&lt;/h2&gt;

&lt;p&gt;RNNs became important because different data types need different architectures.&lt;/p&gt;

&lt;p&gt;CNNs work well for images because images have spatial structure.&lt;/p&gt;

&lt;p&gt;RNNs work well for sequences because sequences have order.&lt;/p&gt;

&lt;p&gt;CNN:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;local spatial patterns&lt;/li&gt;
&lt;li&gt;image-centered tasks&lt;/li&gt;
&lt;li&gt;convolution kernels&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;RNN:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;temporal or sequential patterns&lt;/li&gt;
&lt;li&gt;text, speech, time series&lt;/li&gt;
&lt;li&gt;recurrent hidden state&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is why RNNs became one of the major deep learning architectures.&lt;/p&gt;

&lt;p&gt;They match the structure of sequential data.&lt;/p&gt;

&lt;h2&gt;
  
  
  Practical Limits
&lt;/h2&gt;

&lt;p&gt;RNNs are powerful, but they have limits.&lt;/p&gt;

&lt;p&gt;Long sequences are hard.&lt;/p&gt;

&lt;p&gt;Information from early steps can weaken over time.&lt;/p&gt;

&lt;p&gt;Training can become unstable because gradients must pass through many time steps.&lt;/p&gt;

&lt;p&gt;This is one reason later architectures became important.&lt;/p&gt;

&lt;p&gt;Attention mechanisms and Transformers changed the landscape by making long-range relationships easier to model.&lt;/p&gt;

&lt;p&gt;But RNNs remain the best starting point for understanding sequence modeling.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If RNNs feel abstract, learn them in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Neural Network&lt;/li&gt;
&lt;li&gt;Recurrent Neural Network&lt;/li&gt;
&lt;li&gt;Hidden State&lt;/li&gt;
&lt;li&gt;Deep RNN&lt;/li&gt;
&lt;li&gt;CNN vs RNN comparison&lt;/li&gt;
&lt;li&gt;Attention Mechanism&lt;/li&gt;
&lt;li&gt;Transformer&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand normal neural networks.&lt;/p&gt;

&lt;p&gt;Then you see what changes when order matters.&lt;/p&gt;

&lt;p&gt;Then you understand why modern sequence models moved beyond basic recurrence.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;An RNN is a neural network designed for sequences.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;RNN = current input + previous hidden state&lt;/p&gt;

&lt;p&gt;It reads data step by step.&lt;/p&gt;

&lt;p&gt;It carries context forward.&lt;/p&gt;

&lt;p&gt;It uses that context to make better predictions on ordered data.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;RNNs make neural networks sequence-aware by passing hidden state from one time step to the next.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When learning sequence models, do you find it easier to start from RNNs first, or jump directly to Attention and Transformers?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/rnn-complete-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/rnn-complete-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>ai</category>
      <category>deeplearning</category>
      <category>neuralnetworks</category>
    </item>
    <item>
      <title>How Probabilistic Reasoning Works — From Evidence to Better Beliefs</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Mon, 18 May 2026 00:07:43 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-probabilistic-reasoning-works-from-evidence-to-better-beliefs-2k19</link>
      <guid>https://dev.to/zeromathai/how-probabilistic-reasoning-works-from-evidence-to-better-beliefs-2k19</guid>
      <description>&lt;p&gt;AI often has to decide without complete information.&lt;/p&gt;

&lt;p&gt;The question is not always “What is true?”&lt;/p&gt;

&lt;p&gt;It is often:&lt;/p&gt;

&lt;p&gt;“What should we believe now that new evidence has arrived?”&lt;/p&gt;

&lt;p&gt;That is the core of probabilistic reasoning.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;Probabilistic reasoning is a way to make decisions under uncertainty.&lt;/p&gt;

&lt;p&gt;Instead of treating answers as simply true or false, it assigns probabilities.&lt;/p&gt;

&lt;p&gt;Then it updates those probabilities when new information appears.&lt;/p&gt;

&lt;p&gt;This makes AI systems more flexible in uncertain environments.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;The basic flow looks like this:&lt;/p&gt;

&lt;p&gt;Prior Belief → New Evidence → Updated Belief → Decision&lt;/p&gt;

&lt;p&gt;More compactly:&lt;/p&gt;

&lt;p&gt;Probabilistic Reasoning = uncertainty + evidence + updating&lt;/p&gt;

&lt;p&gt;Bayes’ theorem is the central rule behind this update.&lt;/p&gt;

&lt;p&gt;It explains how prior belief changes after evidence is observed.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, probabilistic reasoning works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;define possible hypotheses

assign initial probabilities

observe new evidence

update probabilities using the evidence

compare updated beliefs

choose the most reasonable conclusion
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why probabilistic reasoning matters in AI.&lt;/p&gt;

&lt;p&gt;Real systems rarely have perfect information.&lt;/p&gt;

&lt;p&gt;They need a way to revise beliefs as evidence changes.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a medical diagnosis system.&lt;/p&gt;

&lt;p&gt;The possible hypothesis is:&lt;/p&gt;

&lt;p&gt;The patient has a disease.&lt;/p&gt;

&lt;p&gt;At first, the system has only a prior probability.&lt;/p&gt;

&lt;p&gt;Then new evidence appears:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;fever&lt;/li&gt;
&lt;li&gt;cough&lt;/li&gt;
&lt;li&gt;test result&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Each piece of evidence changes the probability.&lt;/p&gt;

&lt;p&gt;The system does not simply say “disease” or “no disease” immediately.&lt;/p&gt;

&lt;p&gt;It updates its belief step by step.&lt;/p&gt;

&lt;p&gt;That is probabilistic reasoning.&lt;/p&gt;

&lt;h2&gt;
  
  
  Probability vs Conditional Probability
&lt;/h2&gt;

&lt;p&gt;Probability gives a basic likelihood.&lt;/p&gt;

&lt;p&gt;Conditional probability changes that likelihood when information is known.&lt;/p&gt;

&lt;p&gt;Probability asks:&lt;/p&gt;

&lt;p&gt;How likely is A?&lt;/p&gt;

&lt;p&gt;Conditional probability asks:&lt;/p&gt;

&lt;p&gt;How likely is A given B?&lt;/p&gt;

&lt;p&gt;Written as:&lt;/p&gt;

&lt;p&gt;P(A | B)&lt;/p&gt;

&lt;p&gt;This distinction matters because real-world reasoning depends on context.&lt;/p&gt;

&lt;p&gt;The probability of a disease may be low in general.&lt;/p&gt;

&lt;p&gt;But the probability changes if symptoms or test results are observed.&lt;/p&gt;

&lt;h2&gt;
  
  
  Bayes’ Theorem
&lt;/h2&gt;

&lt;p&gt;Bayes’ theorem is the core mechanism for belief updating.&lt;/p&gt;

&lt;p&gt;In simple language:&lt;/p&gt;

&lt;p&gt;prior belief + evidence = updated belief&lt;/p&gt;

&lt;p&gt;More formally:&lt;/p&gt;

&lt;p&gt;Posterior = Prior × Likelihood / Evidence&lt;/p&gt;

&lt;p&gt;The important idea is not just the formula.&lt;/p&gt;

&lt;p&gt;The important idea is revision.&lt;/p&gt;

&lt;p&gt;You start with a belief.&lt;/p&gt;

&lt;p&gt;Then evidence changes it.&lt;/p&gt;

&lt;p&gt;That is the foundation of Bayesian reasoning.&lt;/p&gt;

&lt;h2&gt;
  
  
  Before vs After Evidence
&lt;/h2&gt;

&lt;p&gt;This is the most important comparison.&lt;/p&gt;

&lt;p&gt;Before evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;the system only has a prior belief&lt;/li&gt;
&lt;li&gt;the conclusion is uncertain&lt;/li&gt;
&lt;li&gt;many hypotheses may still be plausible&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;After evidence:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;probabilities are updated&lt;/li&gt;
&lt;li&gt;some hypotheses become more likely&lt;/li&gt;
&lt;li&gt;some hypotheses become less likely&lt;/li&gt;
&lt;li&gt;the system can make a better decision&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So probabilistic reasoning is not static.&lt;/p&gt;

&lt;p&gt;It is adaptive.&lt;/p&gt;

&lt;h2&gt;
  
  
  From Reasoning to Systems
&lt;/h2&gt;

&lt;p&gt;A probabilistic reasoning system connects models and inference procedures.&lt;/p&gt;

&lt;p&gt;It may include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;probability theory&lt;/li&gt;
&lt;li&gt;conditional probability&lt;/li&gt;
&lt;li&gt;Bayes’ theorem&lt;/li&gt;
&lt;li&gt;Bayesian networks&lt;/li&gt;
&lt;li&gt;Markov networks&lt;/li&gt;
&lt;li&gt;conditional probability tables&lt;/li&gt;
&lt;li&gt;inference algorithms&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The goal is not only to calculate probabilities.&lt;/p&gt;

&lt;p&gt;The goal is to support decisions under uncertainty.&lt;/p&gt;

&lt;p&gt;That is why these ideas matter in AI architecture.&lt;/p&gt;

&lt;h2&gt;
  
  
  Graphical Models
&lt;/h2&gt;

&lt;p&gt;When many variables interact, plain probability formulas become hard to manage.&lt;/p&gt;

&lt;p&gt;Graphical models help by representing relationships as structure.&lt;/p&gt;

&lt;p&gt;Bayesian Networks use directed edges.&lt;/p&gt;

&lt;p&gt;Markov Networks use undirected edges.&lt;/p&gt;

&lt;p&gt;Conditional Probability Tables store probability values for different conditions.&lt;/p&gt;

&lt;p&gt;This makes probabilistic reasoning more scalable.&lt;/p&gt;

&lt;p&gt;Instead of reasoning over isolated formulas, the system reasons over a structured model.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If probabilistic reasoning feels abstract, learn it in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Probability Theory&lt;/li&gt;
&lt;li&gt;Conditional Probability&lt;/li&gt;
&lt;li&gt;Bayes’ Theorem&lt;/li&gt;
&lt;li&gt;Probabilistic Reasoning&lt;/li&gt;
&lt;li&gt;Probabilistic Reasoning Systems&lt;/li&gt;
&lt;li&gt;Bayesian Network&lt;/li&gt;
&lt;li&gt;Conditional Probability Table&lt;/li&gt;
&lt;li&gt;Markov Network&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand uncertainty.&lt;/p&gt;

&lt;p&gt;Then you understand updating.&lt;/p&gt;

&lt;p&gt;Then you connect the idea to structured AI systems.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;Probabilistic reasoning is how AI handles uncertainty.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Probabilistic Reasoning = prior belief + evidence + update&lt;/p&gt;

&lt;p&gt;Probability represents uncertainty.&lt;/p&gt;

&lt;p&gt;Conditional probability adds context.&lt;/p&gt;

&lt;p&gt;Bayes’ theorem updates belief.&lt;/p&gt;

&lt;p&gt;Graphical models make the structure scalable.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;Probabilistic reasoning lets AI revise what it believes when new evidence appears.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When building AI systems under uncertainty, do you prefer starting from Bayes’ theorem directly, or from concrete examples like diagnosis, ranking, or risk prediction?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/probabilistic-reasoning-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/probabilistic-reasoning-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>probability</category>
      <category>machinelearning</category>
      <category>datascience</category>
    </item>
    <item>
      <title>How Probabilistic Graphical Models Represent Uncertainty</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Thu, 14 May 2026 23:20:49 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-probabilistic-graphical-models-represent-uncertainty-5gcd</link>
      <guid>https://dev.to/zeromathai/how-probabilistic-graphical-models-represent-uncertainty-5gcd</guid>
      <description>&lt;p&gt;Probability can become hard to reason about when many variables interact.&lt;/p&gt;

&lt;p&gt;One variable affects another.&lt;/p&gt;

&lt;p&gt;Evidence changes belief.&lt;/p&gt;

&lt;p&gt;Dependencies start to form a network.&lt;/p&gt;

&lt;p&gt;That is where Probabilistic Graphical Models become useful.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;A Probabilistic Graphical Model represents uncertainty with a graph.&lt;/p&gt;

&lt;p&gt;The nodes are random variables.&lt;/p&gt;

&lt;p&gt;The edges represent relationships between them.&lt;/p&gt;

&lt;p&gt;Instead of treating probability as a flat list of formulas, a PGM gives it structure.&lt;/p&gt;

&lt;p&gt;That structure makes complex uncertainty easier to reason about.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simple PGM view looks like this:&lt;/p&gt;

&lt;p&gt;Random Variables → Graph Structure → Probability Values → Inference&lt;/p&gt;

&lt;p&gt;More compactly:&lt;/p&gt;

&lt;p&gt;PGM = graph + probability + inference&lt;/p&gt;

&lt;p&gt;The graph shows how variables are connected.&lt;/p&gt;

&lt;p&gt;The probability values define how likely different states are.&lt;/p&gt;

&lt;p&gt;Inference uses both to answer questions under uncertainty.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, building a PGM looks like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;define the random variables

decide which variables depend on each other

choose a graph structure

assign probability values

observe evidence

run inference

update beliefs
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why PGMs matter in AI.&lt;/p&gt;

&lt;p&gt;They do not only store probabilities.&lt;/p&gt;

&lt;p&gt;They give the system a way to reason when information is incomplete.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a simple diagnosis system.&lt;/p&gt;

&lt;p&gt;You may have variables like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Disease&lt;/li&gt;
&lt;li&gt;Fever&lt;/li&gt;
&lt;li&gt;Cough&lt;/li&gt;
&lt;li&gt;Test Result&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;These variables are not independent.&lt;/p&gt;

&lt;p&gt;Disease can affect Fever.&lt;/p&gt;

&lt;p&gt;Disease can affect Cough.&lt;/p&gt;

&lt;p&gt;Disease can affect Test Result.&lt;/p&gt;

&lt;p&gt;A PGM represents these relationships explicitly.&lt;/p&gt;

&lt;p&gt;Then, when new evidence appears, the model can update beliefs.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;p&gt;If Fever is observed, how does the probability of Disease change?&lt;/p&gt;

&lt;p&gt;That is probabilistic reasoning.&lt;/p&gt;

&lt;h2&gt;
  
  
  Bayesian Network vs Markov Network
&lt;/h2&gt;

&lt;p&gt;PGMs split into different model families.&lt;/p&gt;

&lt;p&gt;The most important comparison is Bayesian Networks vs Markov Networks.&lt;/p&gt;

&lt;p&gt;Bayesian Network:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;uses directed edges&lt;/li&gt;
&lt;li&gt;represents dependency direction&lt;/li&gt;
&lt;li&gt;often fits causal-style reasoning&lt;/li&gt;
&lt;li&gt;commonly uses conditional probability tables&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Markov Network:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;uses undirected edges&lt;/li&gt;
&lt;li&gt;represents mutual relationships&lt;/li&gt;
&lt;li&gt;focuses on association rather than direction&lt;/li&gt;
&lt;li&gt;is useful when relationships are symmetric&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the model choice depends on the relationship type.&lt;/p&gt;

&lt;p&gt;If direction matters, use a Bayesian Network.&lt;/p&gt;

&lt;p&gt;If direction does not matter, a Markov Network may fit better.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Conditional Probability Matters
&lt;/h2&gt;

&lt;p&gt;Conditional probability is the foundation of many PGMs.&lt;/p&gt;

&lt;p&gt;It answers questions like:&lt;/p&gt;

&lt;p&gt;What is the probability of A given B?&lt;/p&gt;

&lt;p&gt;Written as:&lt;/p&gt;

&lt;p&gt;P(A | B)&lt;/p&gt;

&lt;p&gt;This matters because uncertainty is rarely isolated.&lt;/p&gt;

&lt;p&gt;We usually care about how one variable changes when another is known.&lt;/p&gt;

&lt;p&gt;That is exactly what PGMs organize.&lt;/p&gt;

&lt;h2&gt;
  
  
  From Graph to Computation
&lt;/h2&gt;

&lt;p&gt;A graph alone is not enough.&lt;/p&gt;

&lt;p&gt;You also need probability values.&lt;/p&gt;

&lt;p&gt;In Bayesian Networks, this often means using Conditional Probability Tables.&lt;/p&gt;

&lt;p&gt;A CPT defines how likely a variable is under different parent conditions.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;p&gt;How likely is Fever if Disease is true?&lt;/p&gt;

&lt;p&gt;How likely is Fever if Disease is false?&lt;/p&gt;

&lt;p&gt;The graph gives the dependency structure.&lt;/p&gt;

&lt;p&gt;The CPT gives the numbers.&lt;/p&gt;

&lt;p&gt;Together, they make the model computable.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Inference Is the Goal
&lt;/h2&gt;

&lt;p&gt;A PGM is not useful just because it looks structured.&lt;/p&gt;

&lt;p&gt;Its real purpose is inference.&lt;/p&gt;

&lt;p&gt;Inference means answering questions such as:&lt;/p&gt;

&lt;p&gt;What is likely true given the evidence?&lt;/p&gt;

&lt;p&gt;How should belief change after a new observation?&lt;/p&gt;

&lt;p&gt;Which hidden variable best explains what we see?&lt;/p&gt;

&lt;p&gt;This is why PGMs are important for uncertainty-aware AI.&lt;/p&gt;

&lt;p&gt;They connect structure, probability, and reasoning.&lt;/p&gt;

&lt;h2&gt;
  
  
  PGM vs Flat Probability Tables
&lt;/h2&gt;

&lt;p&gt;Without graphical structure, probability models can become huge.&lt;/p&gt;

&lt;p&gt;Every variable combination may need to be represented directly.&lt;/p&gt;

&lt;p&gt;That quickly becomes impractical.&lt;/p&gt;

&lt;p&gt;A PGM helps by using structure.&lt;/p&gt;

&lt;p&gt;Flat probability table:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;stores many combinations directly&lt;/li&gt;
&lt;li&gt;becomes large quickly&lt;/li&gt;
&lt;li&gt;is hard to interpret&lt;/li&gt;
&lt;li&gt;does not expose dependency structure clearly&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Probabilistic Graphical Model:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;separates variables and dependencies&lt;/li&gt;
&lt;li&gt;makes relationships visible&lt;/li&gt;
&lt;li&gt;can reduce unnecessary complexity&lt;/li&gt;
&lt;li&gt;supports structured inference&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is the practical reason PGMs exist.&lt;/p&gt;

&lt;p&gt;They make uncertainty manageable.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If PGMs feel abstract, learn them in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Conditional Probability&lt;/li&gt;
&lt;li&gt;Probabilistic Graphical Model&lt;/li&gt;
&lt;li&gt;Bayesian Network&lt;/li&gt;
&lt;li&gt;Markov Network&lt;/li&gt;
&lt;li&gt;Conditional Probability Table&lt;/li&gt;
&lt;li&gt;Bayes' Theorem&lt;/li&gt;
&lt;li&gt;Probabilistic Inference&lt;/li&gt;
&lt;li&gt;Probabilistic Reasoning Systems&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand probability relationships.&lt;/p&gt;

&lt;p&gt;Then you understand graph structure.&lt;/p&gt;

&lt;p&gt;Then you learn how inference works on top of that structure.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;Probabilistic Graphical Models turn uncertainty into structure.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;PGM = random variables + graph structure + probability values + inference&lt;/p&gt;

&lt;p&gt;Bayesian Networks model directed relationships.&lt;/p&gt;

&lt;p&gt;Markov Networks model undirected relationships.&lt;/p&gt;

&lt;p&gt;CPTs turn graph structure into computable probability.&lt;/p&gt;

&lt;p&gt;Inference turns the model into a reasoning system.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;A PGM helps AI reason under uncertainty by making relationships between variables explicit.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When modeling uncertainty, do you find directed Bayesian Networks easier to reason about, or undirected Markov Networks?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/probabilistic-graphical-model-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/probabilistic-graphical-model-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>probability</category>
      <category>machinelearning</category>
      <category>datascience</category>
    </item>
    <item>
      <title>How Optimization Search Works — From Hill Climbing to Genetic Algorithms</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Thu, 14 May 2026 00:01:31 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-optimization-search-works-from-hill-climbing-to-genetic-algorithms-5876</link>
      <guid>https://dev.to/zeromathai/how-optimization-search-works-from-hill-climbing-to-genetic-algorithms-5876</guid>
      <description>&lt;p&gt;Optimization is not just about finding a good answer.&lt;/p&gt;

&lt;p&gt;It is about finding a better answer than the one you have now.&lt;/p&gt;

&lt;p&gt;The hard part is knowing whether your current “best” is truly best, or just locally good.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;Optimization is the process of improving a solution according to some objective.&lt;/p&gt;

&lt;p&gt;You start with a candidate solution.&lt;/p&gt;

&lt;p&gt;You measure how good it is.&lt;/p&gt;

&lt;p&gt;Then you search for a better one.&lt;/p&gt;

&lt;p&gt;That sounds simple.&lt;/p&gt;

&lt;p&gt;But the search space can be huge.&lt;/p&gt;

&lt;p&gt;And the best nearby solution may not be the best overall solution.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simple optimization loop looks like this:&lt;/p&gt;

&lt;p&gt;Candidate Solution → Evaluate Score → Explore Neighbor → Accept or Reject → Repeat&lt;/p&gt;

&lt;p&gt;More compactly:&lt;/p&gt;

&lt;p&gt;Optimization = search space + objective function + update strategy&lt;/p&gt;

&lt;p&gt;The objective function tells you what “better” means.&lt;/p&gt;

&lt;p&gt;The update strategy tells you how to move.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, optimization search works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;start with an initial solution

evaluate its score

while stopping condition is not met:
    generate candidate solutions

    compare scores

    choose whether to move

    update the current solution

return the best solution found
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This structure appears in many algorithms.&lt;/p&gt;

&lt;p&gt;Hill Climbing.&lt;/p&gt;

&lt;p&gt;Simulated Annealing.&lt;/p&gt;

&lt;p&gt;Genetic Algorithms.&lt;/p&gt;

&lt;p&gt;Even mathematical optimization methods follow the same broad idea:&lt;/p&gt;

&lt;p&gt;evaluate, improve, repeat.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine tuning a route, schedule, or model setting.&lt;/p&gt;

&lt;p&gt;You start with one possible solution.&lt;/p&gt;

&lt;p&gt;Then you slightly modify it.&lt;/p&gt;

&lt;p&gt;If the new solution improves the score, you keep it.&lt;/p&gt;

&lt;p&gt;If not, you reject it.&lt;/p&gt;

&lt;p&gt;That is the basic intuition behind Hill Climbing.&lt;/p&gt;

&lt;p&gt;But there is a problem.&lt;/p&gt;

&lt;p&gt;What if the current solution is better than all nearby options, but not the best possible solution overall?&lt;/p&gt;

&lt;p&gt;That is the local optimum problem.&lt;/p&gt;

&lt;h2&gt;
  
  
  Local Optimum vs Global Optimum
&lt;/h2&gt;

&lt;p&gt;This is the first concept to understand.&lt;/p&gt;

&lt;p&gt;A local optimum is the best solution in its nearby region.&lt;/p&gt;

&lt;p&gt;A global optimum is the best solution across the entire search space.&lt;/p&gt;

&lt;p&gt;Local optimum:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;looks best from the current neighborhood&lt;/li&gt;
&lt;li&gt;can trap simple algorithms&lt;/li&gt;
&lt;li&gt;may not be the true best answer&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Global optimum:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;is the best overall solution&lt;/li&gt;
&lt;li&gt;may be far from the current point&lt;/li&gt;
&lt;li&gt;is often harder to find&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If you do not understand this distinction, optimization algorithms feel like random strategy names.&lt;/p&gt;

&lt;h2&gt;
  
  
  Hill Climbing
&lt;/h2&gt;

&lt;p&gt;Hill Climbing is the most intuitive optimization strategy.&lt;/p&gt;

&lt;p&gt;It repeatedly moves to a better neighboring solution.&lt;/p&gt;

&lt;p&gt;The idea is simple:&lt;/p&gt;

&lt;p&gt;If the next move improves the score, take it.&lt;/p&gt;

&lt;p&gt;If not, stop or try another neighbor.&lt;/p&gt;

&lt;p&gt;Hill Climbing is easy to understand and implement.&lt;/p&gt;

&lt;p&gt;But it has a major weakness.&lt;/p&gt;

&lt;p&gt;It can get stuck at a local optimum.&lt;/p&gt;

&lt;p&gt;That is why more advanced strategies exist.&lt;/p&gt;

&lt;h2&gt;
  
  
  Hill Climbing vs Simulated Annealing
&lt;/h2&gt;

&lt;p&gt;Simulated Annealing extends Hill Climbing.&lt;/p&gt;

&lt;p&gt;Hill Climbing usually accepts only better moves.&lt;/p&gt;

&lt;p&gt;Simulated Annealing sometimes accepts worse moves.&lt;/p&gt;

&lt;p&gt;That sounds strange at first.&lt;/p&gt;

&lt;p&gt;But it helps the algorithm escape local optima.&lt;/p&gt;

&lt;p&gt;Hill Climbing:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;simple and greedy&lt;/li&gt;
&lt;li&gt;improves step by step&lt;/li&gt;
&lt;li&gt;can get stuck locally&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Simulated Annealing:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;allows occasional worse moves&lt;/li&gt;
&lt;li&gt;explores more widely&lt;/li&gt;
&lt;li&gt;gradually becomes more selective&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the key difference is exploration.&lt;/p&gt;

&lt;p&gt;Hill Climbing exploits.&lt;/p&gt;

&lt;p&gt;Simulated Annealing explores before settling.&lt;/p&gt;

&lt;h2&gt;
  
  
  Genetic Algorithm
&lt;/h2&gt;

&lt;p&gt;Genetic Algorithms take a different approach.&lt;/p&gt;

&lt;p&gt;Instead of improving one solution, they work with a population of solutions.&lt;/p&gt;

&lt;p&gt;The algorithm repeatedly selects, mixes, and mutates candidates.&lt;/p&gt;

&lt;p&gt;A simple view:&lt;/p&gt;

&lt;p&gt;Population → Selection → Crossover → Mutation → New Population&lt;/p&gt;

&lt;p&gt;This is useful when the search space is complex.&lt;/p&gt;

&lt;p&gt;Instead of betting on one path, the algorithm explores multiple directions at once.&lt;/p&gt;

&lt;p&gt;That makes Genetic Algorithms very different from Hill Climbing.&lt;/p&gt;

&lt;p&gt;Hill Climbing improves one candidate.&lt;/p&gt;

&lt;p&gt;Genetic Algorithms evolve many candidates.&lt;/p&gt;

&lt;h2&gt;
  
  
  Newton-Raphson Method
&lt;/h2&gt;

&lt;p&gt;Newton-Raphson is more mathematical.&lt;/p&gt;

&lt;p&gt;It uses information about the function’s slope and curvature.&lt;/p&gt;

&lt;p&gt;When the function is smooth and the structure is known, this can move quickly toward a solution.&lt;/p&gt;

&lt;p&gt;But it depends more heavily on mathematical assumptions.&lt;/p&gt;

&lt;p&gt;Compared with search-based methods, Newton-Raphson is less like exploring a landscape blindly.&lt;/p&gt;

&lt;p&gt;It is more like using the shape of the landscape to jump efficiently.&lt;/p&gt;

&lt;h2&gt;
  
  
  Search-Based vs Mathematical Optimization
&lt;/h2&gt;

&lt;p&gt;This comparison helps organize the field.&lt;/p&gt;

&lt;p&gt;Search-based optimization:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;explores candidate solutions&lt;/li&gt;
&lt;li&gt;works even when the landscape is messy&lt;/li&gt;
&lt;li&gt;can be intuitive and flexible&lt;/li&gt;
&lt;li&gt;may require many evaluations&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Mathematical optimization:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;uses structure like gradients or curvature&lt;/li&gt;
&lt;li&gt;can converge quickly when assumptions hold&lt;/li&gt;
&lt;li&gt;may fail or become unstable when assumptions are weak&lt;/li&gt;
&lt;li&gt;often needs a smoother objective&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the right method depends on the problem.&lt;/p&gt;

&lt;p&gt;If you know the structure, use it.&lt;/p&gt;

&lt;p&gt;If you do not, search.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If optimization strategies feel scattered, learn them in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Local Optimum vs Global Optimum&lt;/li&gt;
&lt;li&gt;Hill Climbing&lt;/li&gt;
&lt;li&gt;Simulated Annealing&lt;/li&gt;
&lt;li&gt;Genetic Algorithm&lt;/li&gt;
&lt;li&gt;Newton-Raphson Method&lt;/li&gt;
&lt;li&gt;Heuristic Search&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand the problem.&lt;/p&gt;

&lt;p&gt;Then you learn the simplest strategy.&lt;/p&gt;

&lt;p&gt;Then you learn how algorithms escape limitations.&lt;/p&gt;

&lt;p&gt;Then you compare broader approaches.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;Optimization is search under an objective.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Optimization = find better solutions according to a score&lt;/p&gt;

&lt;p&gt;Hill Climbing moves toward local improvement.&lt;/p&gt;

&lt;p&gt;Simulated Annealing sometimes accepts worse moves to escape traps.&lt;/p&gt;

&lt;p&gt;Genetic Algorithms explore with many candidates.&lt;/p&gt;

&lt;p&gt;Newton-Raphson uses mathematical structure to move faster.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;Every optimization method is a trade-off between exploitation, exploration, and assumptions about the problem.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When solving optimization problems, do you usually start with a simple greedy method like Hill Climbing, or jump directly to broader strategies like Simulated Annealing or Genetic Algorithms?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/optimization-and-search-strategies-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/optimization-and-search-strategies-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>algorithms</category>
      <category>ai</category>
      <category>optimization</category>
      <category>programming</category>
    </item>
    <item>
      <title>How Neural Networks Work — From Perceptrons to Backpropagation</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Thu, 14 May 2026 00:00:57 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-neural-networks-work-from-perceptrons-to-backpropagation-1d6d</link>
      <guid>https://dev.to/zeromathai/how-neural-networks-work-from-perceptrons-to-backpropagation-1d6d</guid>
      <description>&lt;p&gt;A neural network is not magic.&lt;/p&gt;

&lt;p&gt;It is a system that transforms input into output through layers.&lt;/p&gt;

&lt;p&gt;The real question is:&lt;/p&gt;

&lt;p&gt;How does that system learn from mistakes?&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;A neural network takes data, passes it through connected layers, and produces an output.&lt;/p&gt;

&lt;p&gt;During training, it adjusts internal values so future outputs become better.&lt;/p&gt;

&lt;p&gt;Those internal values are mainly weights and biases.&lt;/p&gt;

&lt;p&gt;So the simplest view is:&lt;/p&gt;

&lt;p&gt;Input → Layers → Output → Error → Update&lt;/p&gt;

&lt;p&gt;That loop is the foundation of neural network learning.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A basic neural network looks like this:&lt;/p&gt;

&lt;p&gt;Input Layer → Hidden Layers → Output Layer&lt;/p&gt;

&lt;p&gt;Each layer transforms the data.&lt;/p&gt;

&lt;p&gt;A neuron usually computes:&lt;/p&gt;

&lt;p&gt;z = w · x + b&lt;/p&gt;

&lt;p&gt;Then an activation function transforms it:&lt;/p&gt;

&lt;p&gt;a = activation(z)&lt;/p&gt;

&lt;p&gt;Where:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;x = input&lt;/li&gt;
&lt;li&gt;w = weight&lt;/li&gt;
&lt;li&gt;b = bias&lt;/li&gt;
&lt;li&gt;z = raw score&lt;/li&gt;
&lt;li&gt;a = activated output&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is the basic building block.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, training works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;take input data

run forward propagation

compute prediction

compare prediction with target

compute loss

run backpropagation

update weights and biases

repeat
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why neural networks are trainable.&lt;/p&gt;

&lt;p&gt;They do not just compute outputs.&lt;/p&gt;

&lt;p&gt;They use errors to adjust themselves.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a small model that predicts whether an email is spam.&lt;/p&gt;

&lt;p&gt;The input features might include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;suspicious words&lt;/li&gt;
&lt;li&gt;sender reputation&lt;/li&gt;
&lt;li&gt;number of links&lt;/li&gt;
&lt;li&gt;message length&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The network processes those inputs.&lt;/p&gt;

&lt;p&gt;It predicts:&lt;/p&gt;

&lt;p&gt;spam probability = 0.82&lt;/p&gt;

&lt;p&gt;If the real label is spam, the prediction is good.&lt;/p&gt;

&lt;p&gt;If the real label is not spam, the network needs to adjust.&lt;/p&gt;

&lt;p&gt;Backpropagation tells the model which weights contributed to the mistake.&lt;/p&gt;

&lt;p&gt;Then training updates those weights.&lt;/p&gt;

&lt;h2&gt;
  
  
  Perceptron vs MLP
&lt;/h2&gt;

&lt;p&gt;The perceptron is the simplest starting point.&lt;/p&gt;

&lt;p&gt;It takes inputs, applies weights, adds a bias, and produces an output.&lt;/p&gt;

&lt;p&gt;But a single perceptron is limited.&lt;/p&gt;

&lt;p&gt;It can only represent simple decision boundaries.&lt;/p&gt;

&lt;p&gt;A Multi-Layer Perceptron expands this idea.&lt;/p&gt;

&lt;p&gt;It stacks multiple layers.&lt;/p&gt;

&lt;p&gt;Perceptron:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;one basic computational unit&lt;/li&gt;
&lt;li&gt;simple input-output mapping&lt;/li&gt;
&lt;li&gt;limited representational power&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;MLP:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;multiple connected layers&lt;/li&gt;
&lt;li&gt;hidden representations&lt;/li&gt;
&lt;li&gt;more expressive decision boundaries&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the MLP is where neural networks become more useful.&lt;/p&gt;

&lt;h2&gt;
  
  
  Forward Propagation vs Backpropagation
&lt;/h2&gt;

&lt;p&gt;This comparison is the core of learning.&lt;/p&gt;

&lt;p&gt;Forward propagation computes the prediction.&lt;/p&gt;

&lt;p&gt;Backpropagation computes how to improve it.&lt;/p&gt;

&lt;p&gt;Forward propagation:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;moves from input to output&lt;/li&gt;
&lt;li&gt;calculates activations&lt;/li&gt;
&lt;li&gt;produces prediction&lt;/li&gt;
&lt;li&gt;computes loss&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Backpropagation:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;moves from loss backward&lt;/li&gt;
&lt;li&gt;computes gradients&lt;/li&gt;
&lt;li&gt;assigns error responsibility&lt;/li&gt;
&lt;li&gt;prepares parameter updates&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Forward pass answers:&lt;/p&gt;

&lt;p&gt;“What did the model predict?”&lt;/p&gt;

&lt;p&gt;Backward pass answers:&lt;/p&gt;

&lt;p&gt;“What should change?”&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Activation Functions Matter
&lt;/h2&gt;

&lt;p&gt;Stacking linear layers is not enough.&lt;/p&gt;

&lt;p&gt;Without activation functions, multiple layers can collapse into one linear transformation.&lt;/p&gt;

&lt;p&gt;That means the model cannot represent complex patterns well.&lt;/p&gt;

&lt;p&gt;Activation functions add nonlinearity.&lt;/p&gt;

&lt;p&gt;This lets neural networks learn curves, boundaries, and deeper representations.&lt;/p&gt;

&lt;p&gt;In short:&lt;/p&gt;

&lt;p&gt;Linear layers calculate.&lt;/p&gt;

&lt;p&gt;Activation functions reshape.&lt;/p&gt;

&lt;p&gt;Both are needed.&lt;/p&gt;

&lt;h2&gt;
  
  
  What Actually Changes During Learning?
&lt;/h2&gt;

&lt;p&gt;When a neural network learns, it updates parameters.&lt;/p&gt;

&lt;p&gt;The most important parameters are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;weights&lt;/li&gt;
&lt;li&gt;biases&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Weights control how strongly inputs affect the output.&lt;/p&gt;

&lt;p&gt;Biases shift the output.&lt;/p&gt;

&lt;p&gt;A simple update idea is:&lt;/p&gt;

&lt;p&gt;new parameter = old parameter - learning rate × gradient&lt;/p&gt;

&lt;p&gt;The gradient tells the direction.&lt;/p&gt;

&lt;p&gt;The learning rate controls the step size.&lt;/p&gt;

&lt;p&gt;That is the practical meaning of learning.&lt;/p&gt;

&lt;h2&gt;
  
  
  From Neural Networks to Deep Learning
&lt;/h2&gt;

&lt;p&gt;A neural network becomes “deep” when it has many layers.&lt;/p&gt;

&lt;p&gt;But depth alone is not enough.&lt;/p&gt;

&lt;p&gt;Deep learning depends on:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;layered representations&lt;/li&gt;
&lt;li&gt;nonlinear activations&lt;/li&gt;
&lt;li&gt;backpropagation&lt;/li&gt;
&lt;li&gt;parameter updates&lt;/li&gt;
&lt;li&gt;enough data and compute&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The deeper the network, the more abstract its internal representations can become.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;p&gt;In image models:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;early layers may detect edges&lt;/li&gt;
&lt;li&gt;middle layers may detect shapes&lt;/li&gt;
&lt;li&gt;deeper layers may detect objects&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;That is why neural networks became the foundation of modern deep learning.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If neural networks feel scattered, learn them in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Perceptron&lt;/li&gt;
&lt;li&gt;Multi-Layer Perceptron&lt;/li&gt;
&lt;li&gt;Neural Network&lt;/li&gt;
&lt;li&gt;Forward Propagation&lt;/li&gt;
&lt;li&gt;Backpropagation&lt;/li&gt;
&lt;li&gt;Activation Function&lt;/li&gt;
&lt;li&gt;Weights and Biases&lt;/li&gt;
&lt;li&gt;Model Parameters&lt;/li&gt;
&lt;li&gt;Deep Learning&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand the basic unit.&lt;/p&gt;

&lt;p&gt;Then you understand the network.&lt;/p&gt;

&lt;p&gt;Then you understand computation, learning, and deep extensions.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;A neural network is a trainable system of layered transformations.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Neural Network = layers + activations + parameters + learning&lt;/p&gt;

&lt;p&gt;Forward propagation makes predictions.&lt;/p&gt;

&lt;p&gt;Backpropagation computes corrections.&lt;/p&gt;

&lt;p&gt;Weights and biases store what the model has learned.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;Neural networks learn by repeatedly predicting, measuring error, and updating internal parameters.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When learning neural networks, do you find it easier to start from the perceptron, or from the full forward/backpropagation training loop?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/neural-network-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/neural-network-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>ai</category>
      <category>deeplearning</category>
      <category>neuralnetworks</category>
    </item>
    <item>
      <title>How Classical Machine Learning Works — From Linear Models to Random Forests</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Wed, 13 May 2026 07:19:59 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-classical-machine-learning-works-from-linear-models-to-random-forests-5371</link>
      <guid>https://dev.to/zeromathai/how-classical-machine-learning-works-from-linear-models-to-random-forests-5371</guid>
      <description>&lt;p&gt;Classical machine learning is not outdated.&lt;/p&gt;

&lt;p&gt;It is still one of the best ways to understand how models learn from structured data.&lt;/p&gt;

&lt;p&gt;Before deep learning learns representations automatically, classical ML asks a more explicit question:&lt;/p&gt;

&lt;p&gt;What features should we use, and what model should learn from them?&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;Classical machine learning usually starts with structured data.&lt;/p&gt;

&lt;p&gt;Rows.&lt;/p&gt;

&lt;p&gt;Columns.&lt;/p&gt;

&lt;p&gt;Features.&lt;/p&gt;

&lt;p&gt;Labels.&lt;/p&gt;

&lt;p&gt;The model learns a relationship between input features and target outputs.&lt;/p&gt;

&lt;p&gt;This makes classical ML especially useful when interpretability, tabular data, and clear model behavior matter.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simple classical ML workflow looks like this:&lt;/p&gt;

&lt;p&gt;Data → Features → Model → Prediction → Evaluation → Improvement&lt;/p&gt;

&lt;p&gt;More compactly:&lt;/p&gt;

&lt;p&gt;Classical ML = feature engineering + model learning + generalization&lt;/p&gt;

&lt;p&gt;The model does not magically understand raw data.&lt;/p&gt;

&lt;p&gt;It depends heavily on the quality of the features.&lt;/p&gt;

&lt;p&gt;That is why feature design is so important.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, a classical ML workflow looks like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;collect structured data

clean and prepare features

split data into train and test sets

choose a model

train the model

evaluate generalization

tune complexity if needed
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why classical ML is still practical.&lt;/p&gt;

&lt;p&gt;You can often inspect the pipeline clearly.&lt;/p&gt;

&lt;p&gt;You can compare models quickly.&lt;/p&gt;

&lt;p&gt;You can understand why a model works or fails.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine predicting whether a customer will churn.&lt;/p&gt;

&lt;p&gt;Your features might include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;number of logins&lt;/li&gt;
&lt;li&gt;subscription length&lt;/li&gt;
&lt;li&gt;support tickets&lt;/li&gt;
&lt;li&gt;recent activity&lt;/li&gt;
&lt;li&gt;payment history&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;A classical ML model learns how those features relate to churn.&lt;/p&gt;

&lt;p&gt;Logistic Regression may give a simple probability.&lt;/p&gt;

&lt;p&gt;SVM may focus on a cleaner boundary.&lt;/p&gt;

&lt;p&gt;Random Forest may capture more complex feature interactions.&lt;/p&gt;

&lt;p&gt;Same data.&lt;/p&gt;

&lt;p&gt;Different model assumptions.&lt;/p&gt;

&lt;h2&gt;
  
  
  Linear Models as the Starting Point
&lt;/h2&gt;

&lt;p&gt;Linear models are the foundation.&lt;/p&gt;

&lt;p&gt;They assume the prediction can be built from a weighted combination of features.&lt;/p&gt;

&lt;p&gt;Conceptually:&lt;/p&gt;

&lt;p&gt;prediction = weight1 × feature1 + weight2 × feature2 + ... + bias&lt;/p&gt;

&lt;p&gt;This structure is simple.&lt;/p&gt;

&lt;p&gt;But it teaches the most important idea:&lt;/p&gt;

&lt;p&gt;Each feature contributes to the prediction.&lt;/p&gt;

&lt;p&gt;That makes linear models easy to interpret and useful as a baseline.&lt;/p&gt;

&lt;h2&gt;
  
  
  Logistic Regression vs SVM vs Random Forest
&lt;/h2&gt;

&lt;p&gt;These three models show the classical ML landscape well.&lt;/p&gt;

&lt;p&gt;Logistic Regression:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;simple classification model&lt;/li&gt;
&lt;li&gt;outputs probability-like scores&lt;/li&gt;
&lt;li&gt;easy to interpret&lt;/li&gt;
&lt;li&gt;strong baseline for structured data&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Support Vector Machine:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;focuses on finding a good decision boundary&lt;/li&gt;
&lt;li&gt;uses margins to separate classes&lt;/li&gt;
&lt;li&gt;can work well when boundary quality matters&lt;/li&gt;
&lt;li&gt;less transparent than simple linear models&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Random Forest:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;combines many decision trees&lt;/li&gt;
&lt;li&gt;captures nonlinear feature interactions&lt;/li&gt;
&lt;li&gt;reduces overfitting compared with a single tree&lt;/li&gt;
&lt;li&gt;often strong on tabular data&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the practical choice depends on the problem.&lt;/p&gt;

&lt;p&gt;Need interpretability?&lt;/p&gt;

&lt;p&gt;Start with Logistic Regression.&lt;/p&gt;

&lt;p&gt;Need a stronger boundary?&lt;/p&gt;

&lt;p&gt;Try SVM.&lt;/p&gt;

&lt;p&gt;Need nonlinear patterns in tabular data?&lt;/p&gt;

&lt;p&gt;Try Random Forest.&lt;/p&gt;

&lt;h2&gt;
  
  
  Model Complexity and Generalization
&lt;/h2&gt;

&lt;p&gt;A model should not only fit training data.&lt;/p&gt;

&lt;p&gt;It should work on new data.&lt;/p&gt;

&lt;p&gt;That is generalization.&lt;/p&gt;

&lt;p&gt;Too simple:&lt;/p&gt;

&lt;p&gt;The model underfits.&lt;/p&gt;

&lt;p&gt;It misses important patterns.&lt;/p&gt;

&lt;p&gt;Too complex:&lt;/p&gt;

&lt;p&gt;The model overfits.&lt;/p&gt;

&lt;p&gt;It memorizes noise.&lt;/p&gt;

&lt;p&gt;The goal is not maximum training accuracy.&lt;/p&gt;

&lt;p&gt;The goal is reliable performance on unseen data.&lt;/p&gt;

&lt;p&gt;That is why model complexity matters.&lt;/p&gt;

&lt;h2&gt;
  
  
  Where Gradient Descent Fits
&lt;/h2&gt;

&lt;p&gt;Some models learn parameters through optimization.&lt;/p&gt;

&lt;p&gt;Gradient descent is one of the most important optimization methods.&lt;/p&gt;

&lt;p&gt;It updates parameters in the direction that reduces loss.&lt;/p&gt;

&lt;p&gt;A simple view:&lt;/p&gt;

&lt;p&gt;new parameter = old parameter - learning rate × gradient&lt;/p&gt;

&lt;p&gt;The gradient tells the direction.&lt;/p&gt;

&lt;p&gt;The learning rate controls the step size.&lt;/p&gt;

&lt;p&gt;This idea connects classical ML directly to modern deep learning.&lt;/p&gt;

&lt;p&gt;The models may differ.&lt;/p&gt;

&lt;p&gt;But optimization remains central.&lt;/p&gt;

&lt;h2&gt;
  
  
  Classical ML vs Deep Learning
&lt;/h2&gt;

&lt;p&gt;This comparison makes the workflow clear.&lt;/p&gt;

&lt;p&gt;Classical Machine Learning:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;works well with structured data&lt;/li&gt;
&lt;li&gt;often depends on feature engineering&lt;/li&gt;
&lt;li&gt;usually easier to interpret&lt;/li&gt;
&lt;li&gt;can perform strongly with smaller datasets&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Deep Learning:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;learns representations automatically&lt;/li&gt;
&lt;li&gt;works well with images, text, audio, and large-scale data&lt;/li&gt;
&lt;li&gt;often needs more data and compute&lt;/li&gt;
&lt;li&gt;can be harder to interpret&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So classical ML is not simply “old ML.”&lt;/p&gt;

&lt;p&gt;It is the right tool when the data and constraints fit.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If classical machine learning feels broad, learn it in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Linear Models&lt;/li&gt;
&lt;li&gt;Logistic Regression&lt;/li&gt;
&lt;li&gt;Support Vector Machine&lt;/li&gt;
&lt;li&gt;Random Forest&lt;/li&gt;
&lt;li&gt;Model Complexity and Generalization&lt;/li&gt;
&lt;li&gt;Gradient Descent&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand the model baseline.&lt;/p&gt;

&lt;p&gt;Then you compare model families.&lt;/p&gt;

&lt;p&gt;Then you connect prediction quality to generalization and optimization.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;Classical machine learning is built around structured data, features, models, and generalization.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Classical ML = features + interpretable models + generalization&lt;/p&gt;

&lt;p&gt;Linear models give the foundation.&lt;/p&gt;

&lt;p&gt;Logistic Regression gives a practical classification start.&lt;/p&gt;

&lt;p&gt;SVM improves the decision-boundary view.&lt;/p&gt;

&lt;p&gt;Random Forest captures stronger nonlinear patterns.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;Classical machine learning works best when you understand the features, the model assumptions, and how well the model generalizes.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When starting a new ML problem, do you usually begin with a simple baseline like Logistic Regression, or jump straight to stronger models like Random Forest?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/classical-machine-learning-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/classical-machine-learning-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>machinelearning</category>
      <category>ai</category>
      <category>datascience</category>
      <category>algorithms</category>
    </item>
    <item>
      <title>How Large Language Models Work — From Transformers to Conversational AI</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Tue, 12 May 2026 00:15:31 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-large-language-models-work-from-transformers-to-conversational-ai-6nk</link>
      <guid>https://dev.to/zeromathai/how-large-language-models-work-from-transformers-to-conversational-ai-6nk</guid>
      <description>&lt;p&gt;LLMs can look like magic from the outside.&lt;/p&gt;

&lt;p&gt;You type a prompt.&lt;/p&gt;

&lt;p&gt;The model generates language.&lt;/p&gt;

&lt;p&gt;But underneath that behavior is a clear architecture.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;A Large Language Model is a neural network trained to understand and generate text.&lt;/p&gt;

&lt;p&gt;The key idea is not just size.&lt;/p&gt;

&lt;p&gt;It is language modeling at scale.&lt;/p&gt;

&lt;p&gt;An LLM learns patterns in text.&lt;/p&gt;

&lt;p&gt;Then it uses those patterns to predict and generate the next tokens.&lt;/p&gt;

&lt;p&gt;That simple loop becomes powerful when combined with massive data, deep architectures, and Transformer-based attention.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simplified LLM flow looks like this:&lt;/p&gt;

&lt;p&gt;Text Input → Tokenization → Transformer Layers → Next Token Prediction → Generated Text&lt;/p&gt;

&lt;p&gt;More compactly:&lt;/p&gt;

&lt;p&gt;LLM = tokens + Transformer + next-token prediction&lt;/p&gt;

&lt;p&gt;The model does not “think” in raw sentences.&lt;/p&gt;

&lt;p&gt;It processes tokens.&lt;/p&gt;

&lt;p&gt;Then it predicts what token should come next.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, text generation works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;take the user input

split it into tokens

pass tokens through Transformer layers

compute probabilities for the next token

choose one token

append it to the sequence

repeat until stopping condition
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This loop is why LLMs can generate long responses.&lt;/p&gt;

&lt;p&gt;They do not write the whole answer at once.&lt;/p&gt;

&lt;p&gt;They generate one token at a time.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Suppose the input is:&lt;/p&gt;

&lt;p&gt;The capital of France is&lt;/p&gt;

&lt;p&gt;The model estimates likely next tokens.&lt;/p&gt;

&lt;p&gt;Maybe:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Paris&lt;/li&gt;
&lt;li&gt;Lyon&lt;/li&gt;
&lt;li&gt;France&lt;/li&gt;
&lt;li&gt;located&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;If “Paris” has the highest probability, the model may select it.&lt;/p&gt;

&lt;p&gt;Then the sequence becomes:&lt;/p&gt;

&lt;p&gt;The capital of France is Paris&lt;/p&gt;

&lt;p&gt;The model repeats the same process for the next token.&lt;/p&gt;

&lt;p&gt;That is the basic generation loop.&lt;/p&gt;

&lt;h2&gt;
  
  
  Encoder vs Decoder Models
&lt;/h2&gt;

&lt;p&gt;Transformer models are not all built the same way.&lt;/p&gt;

&lt;p&gt;The most important distinction is encoder-style vs decoder-style models.&lt;/p&gt;

&lt;p&gt;Encoder models are good at understanding input.&lt;/p&gt;

&lt;p&gt;Decoder models are good at generating output.&lt;/p&gt;

&lt;p&gt;Encoder-style models:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;read the input deeply&lt;/li&gt;
&lt;li&gt;build contextual representations&lt;/li&gt;
&lt;li&gt;work well for classification, search, and embedding tasks&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Decoder-style models:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;generate tokens step by step&lt;/li&gt;
&lt;li&gt;use previous tokens to predict the next token&lt;/li&gt;
&lt;li&gt;work well for chat, writing, coding, and text generation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is why GPT-style systems are usually decoder-based.&lt;/p&gt;

&lt;p&gt;They are built for generation.&lt;/p&gt;

&lt;h2&gt;
  
  
  Encoder-Decoder Architecture
&lt;/h2&gt;

&lt;p&gt;Some Transformer systems use both sides.&lt;/p&gt;

&lt;p&gt;The encoder processes the input.&lt;/p&gt;

&lt;p&gt;The decoder generates the output.&lt;/p&gt;

&lt;p&gt;This structure is especially intuitive for tasks like translation.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;p&gt;English sentence → Encoder → Internal representation → Decoder → Korean sentence&lt;/p&gt;

&lt;p&gt;The encoder focuses on understanding.&lt;/p&gt;

&lt;p&gt;The decoder focuses on producing.&lt;/p&gt;

&lt;p&gt;That separation makes the architecture easy to reason about.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Attention Matters
&lt;/h2&gt;

&lt;p&gt;Attention is the key mechanism inside Transformers.&lt;/p&gt;

&lt;p&gt;It lets the model decide which tokens are relevant to each other.&lt;/p&gt;

&lt;p&gt;Instead of processing words only in order, attention compares relationships across the sequence.&lt;/p&gt;

&lt;p&gt;That matters because language depends on context.&lt;/p&gt;

&lt;p&gt;A word can change meaning depending on what came before it.&lt;/p&gt;

&lt;p&gt;Attention gives the model a way to use that context.&lt;/p&gt;

&lt;h2&gt;
  
  
  Cross-Attention
&lt;/h2&gt;

&lt;p&gt;Cross-attention connects two streams of information.&lt;/p&gt;

&lt;p&gt;For example, in an encoder-decoder model:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;the encoder represents the input&lt;/li&gt;
&lt;li&gt;the decoder generates the output&lt;/li&gt;
&lt;li&gt;cross-attention lets the decoder look at the encoder’s representation&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is useful when the output must depend closely on the input.&lt;/p&gt;

&lt;p&gt;Translation is the classic example.&lt;/p&gt;

&lt;p&gt;The decoder does not generate blindly.&lt;/p&gt;

&lt;p&gt;It attends to the encoded source sentence.&lt;/p&gt;

&lt;h2&gt;
  
  
  LLMs vs Traditional NLP Systems
&lt;/h2&gt;

&lt;p&gt;Traditional NLP systems often relied on many separate components.&lt;/p&gt;

&lt;p&gt;Token rules.&lt;/p&gt;

&lt;p&gt;Feature extraction.&lt;/p&gt;

&lt;p&gt;Syntax analysis.&lt;/p&gt;

&lt;p&gt;Task-specific classifiers.&lt;/p&gt;

&lt;p&gt;LLMs changed the workflow.&lt;/p&gt;

&lt;p&gt;Traditional NLP:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;many hand-designed stages&lt;/li&gt;
&lt;li&gt;task-specific pipelines&lt;/li&gt;
&lt;li&gt;limited flexibility&lt;/li&gt;
&lt;li&gt;harder to generalize across tasks&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;LLM-based systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;use one large model for many language tasks&lt;/li&gt;
&lt;li&gt;learn representations from data&lt;/li&gt;
&lt;li&gt;generate flexible outputs&lt;/li&gt;
&lt;li&gt;can power chat, summarization, coding, translation, and more&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is why LLMs became central to modern AI products.&lt;/p&gt;

&lt;p&gt;They turned language understanding and generation into a general interface.&lt;/p&gt;

&lt;h2&gt;
  
  
  From LLMs to Conversational AI
&lt;/h2&gt;

&lt;p&gt;Conversational AI is one of the most visible uses of LLMs.&lt;/p&gt;

&lt;p&gt;The model receives a user message.&lt;/p&gt;

&lt;p&gt;It interprets the context.&lt;/p&gt;

&lt;p&gt;It generates a response.&lt;/p&gt;

&lt;p&gt;But a real product usually adds more around the model:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;system instructions&lt;/li&gt;
&lt;li&gt;safety filters&lt;/li&gt;
&lt;li&gt;retrieval systems&lt;/li&gt;
&lt;li&gt;memory or session context&lt;/li&gt;
&lt;li&gt;tool use&lt;/li&gt;
&lt;li&gt;evaluation and monitoring&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So an LLM is the core engine.&lt;/p&gt;

&lt;p&gt;Conversational AI is the full system built around it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If LLM architecture feels too broad, learn it in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Large Language Models&lt;/li&gt;
&lt;li&gt;Transformer&lt;/li&gt;
&lt;li&gt;Encoder-Decoder Architecture&lt;/li&gt;
&lt;li&gt;Encoder vs Decoder Transformers&lt;/li&gt;
&lt;li&gt;Attention Mechanism&lt;/li&gt;
&lt;li&gt;Cross-Attention&lt;/li&gt;
&lt;li&gt;Conversational AI&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand what an LLM is.&lt;/p&gt;

&lt;p&gt;Then you understand the Transformer.&lt;/p&gt;

&lt;p&gt;Then you compare architecture types.&lt;/p&gt;

&lt;p&gt;Then you connect the model to real applications.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;LLMs are not magic text machines.&lt;/p&gt;

&lt;p&gt;They are Transformer-based models trained to predict and generate tokens.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;LLM = Transformer architecture + token prediction + scale&lt;/p&gt;

&lt;p&gt;Encoder models are better for understanding.&lt;/p&gt;

&lt;p&gt;Decoder models are better for generation.&lt;/p&gt;

&lt;p&gt;Encoder-decoder models connect input understanding with output generation.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;An LLM generates language by repeatedly predicting the next token using context learned through Transformer attention.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When learning LLMs, do you find it easier to start from next-token prediction, Transformer architecture, or real applications like conversational AI?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/large-language-models-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/large-language-models-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>llm</category>
      <category>deeplearning</category>
    </item>
    <item>
      <title>How Knowledge-Based AI Works — From Rules to Inference</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Sun, 10 May 2026 23:48:44 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-knowledge-based-ai-works-from-rules-to-inference-3o1o</link>
      <guid>https://dev.to/zeromathai/how-knowledge-based-ai-works-from-rules-to-inference-3o1o</guid>
      <description>&lt;p&gt;Before AI learned from massive datasets, many systems worked with explicit knowledge.&lt;/p&gt;

&lt;p&gt;Facts.&lt;/p&gt;

&lt;p&gt;Rules.&lt;/p&gt;

&lt;p&gt;Inference.&lt;/p&gt;

&lt;p&gt;That is the core of Knowledge-Based AI.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;Knowledge-Based AI stores knowledge in a structured form.&lt;/p&gt;

&lt;p&gt;Then it uses rules to derive new conclusions.&lt;/p&gt;

&lt;p&gt;The system does not “learn” from data in the modern deep learning sense.&lt;/p&gt;

&lt;p&gt;It reasons over what it already knows.&lt;/p&gt;

&lt;p&gt;That makes the structure very different from machine learning.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simple Knowledge-Based AI system looks like this:&lt;/p&gt;

&lt;p&gt;Knowledge Base → Rules → Inference Engine → Conclusion&lt;/p&gt;

&lt;p&gt;Or more compact:&lt;/p&gt;

&lt;p&gt;Knowledge-Based AI = Facts + Rules + Inference&lt;/p&gt;

&lt;p&gt;The knowledge base stores information.&lt;/p&gt;

&lt;p&gt;The rule system defines how conclusions can be derived.&lt;/p&gt;

&lt;p&gt;The inference engine applies those rules.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, a rule-based system works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;store known facts

store IF-THEN rules

compare facts with rule conditions

apply matching rules

generate new facts or conclusions

repeat until no useful rule applies
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why Knowledge-Based AI is easy to inspect.&lt;/p&gt;

&lt;p&gt;You can often trace exactly which rule produced which conclusion.&lt;/p&gt;

&lt;p&gt;That transparency is one of its biggest strengths.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a simple medical expert system.&lt;/p&gt;

&lt;p&gt;It may store facts like:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;patient has fever&lt;/li&gt;
&lt;li&gt;patient has cough&lt;/li&gt;
&lt;li&gt;patient has fatigue&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;And rules like:&lt;/p&gt;

&lt;p&gt;IF fever AND cough THEN possible infection&lt;/p&gt;

&lt;p&gt;IF possible infection AND fatigue THEN recommend further test&lt;/p&gt;

&lt;p&gt;The system does not train on millions of examples.&lt;/p&gt;

&lt;p&gt;It applies explicit rules.&lt;/p&gt;

&lt;p&gt;That makes the reasoning path easier to explain.&lt;/p&gt;

&lt;h2&gt;
  
  
  Rule-Based AI vs Machine Learning
&lt;/h2&gt;

&lt;p&gt;This comparison is important.&lt;/p&gt;

&lt;p&gt;Rule-Based AI:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;uses explicit facts and rules&lt;/li&gt;
&lt;li&gt;depends on human-designed knowledge&lt;/li&gt;
&lt;li&gt;is easier to explain&lt;/li&gt;
&lt;li&gt;struggles when rules become too many or too brittle&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Machine Learning:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;learns patterns from data&lt;/li&gt;
&lt;li&gt;improves through training&lt;/li&gt;
&lt;li&gt;handles noisy and complex data better&lt;/li&gt;
&lt;li&gt;can be harder to interpret&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the difference is not just old AI vs modern AI.&lt;/p&gt;

&lt;p&gt;It is symbolic reasoning vs data-driven learning.&lt;/p&gt;

&lt;p&gt;Both solve problems in different ways.&lt;/p&gt;

&lt;h2&gt;
  
  
  Forward Chaining vs Backward Chaining
&lt;/h2&gt;

&lt;p&gt;Even with the same rules, inference can move in different directions.&lt;/p&gt;

&lt;p&gt;Forward chaining starts from known facts.&lt;/p&gt;

&lt;p&gt;It applies rules until it reaches conclusions.&lt;/p&gt;

&lt;p&gt;Backward chaining starts from a goal.&lt;/p&gt;

&lt;p&gt;It works backward to check whether the needed conditions are true.&lt;/p&gt;

&lt;p&gt;Forward chaining:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;data-driven&lt;/li&gt;
&lt;li&gt;useful when you want to discover what follows from known facts&lt;/li&gt;
&lt;li&gt;starts with available evidence&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Backward chaining:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;goal-driven&lt;/li&gt;
&lt;li&gt;useful when you want to prove or verify a target conclusion&lt;/li&gt;
&lt;li&gt;starts with the question&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The difference is simple:&lt;/p&gt;

&lt;p&gt;Forward chaining asks:&lt;/p&gt;

&lt;p&gt;“What can I conclude from what I know?”&lt;/p&gt;

&lt;p&gt;Backward chaining asks:&lt;/p&gt;

&lt;p&gt;“What must be true for this goal to hold?”&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Inference Engines Matter
&lt;/h2&gt;

&lt;p&gt;The inference engine is the part that makes the system active.&lt;/p&gt;

&lt;p&gt;A knowledge base alone only stores information.&lt;/p&gt;

&lt;p&gt;Rules alone only define possible logic.&lt;/p&gt;

&lt;p&gt;The inference engine applies the rules to produce conclusions.&lt;/p&gt;

&lt;p&gt;That is why it is the execution layer of Knowledge-Based AI.&lt;/p&gt;

&lt;p&gt;Without inference, the system is just a database.&lt;/p&gt;

&lt;p&gt;With inference, it becomes a reasoning system.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Expert Systems Were Important
&lt;/h2&gt;

&lt;p&gt;Expert systems are one of the clearest applications of Knowledge-Based AI.&lt;/p&gt;

&lt;p&gt;They encode domain knowledge from human experts.&lt;/p&gt;

&lt;p&gt;Then they use rules to make recommendations or decisions.&lt;/p&gt;

&lt;p&gt;Examples include:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;medical diagnosis support&lt;/li&gt;
&lt;li&gt;troubleshooting systems&lt;/li&gt;
&lt;li&gt;configuration systems&lt;/li&gt;
&lt;li&gt;rule-based decision support&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Their biggest strength is explainability.&lt;/p&gt;

&lt;p&gt;Their biggest weakness is maintenance.&lt;/p&gt;

&lt;p&gt;As the domain grows, the rule base can become difficult to manage.&lt;/p&gt;

&lt;h2&gt;
  
  
  Logical Extensions
&lt;/h2&gt;

&lt;p&gt;Knowledge-Based AI also connects to formal reasoning.&lt;/p&gt;

&lt;p&gt;Logic programming, such as PROLOG, represents knowledge as logical relations.&lt;/p&gt;

&lt;p&gt;Theorem proving uses formal logic to verify statements.&lt;/p&gt;

&lt;p&gt;Commonsense reasoning tries to represent everyday assumptions that humans usually take for granted.&lt;/p&gt;

&lt;p&gt;These extensions show the same basic idea:&lt;/p&gt;

&lt;p&gt;Represent knowledge explicitly.&lt;/p&gt;

&lt;p&gt;Then reason over it.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If Knowledge-Based AI feels broad, learn it in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Knowledge Base&lt;/li&gt;
&lt;li&gt;Rule-Based System&lt;/li&gt;
&lt;li&gt;Inference Engine&lt;/li&gt;
&lt;li&gt;Forward Chaining&lt;/li&gt;
&lt;li&gt;Backward Chaining&lt;/li&gt;
&lt;li&gt;Expert System&lt;/li&gt;
&lt;li&gt;Logic Programming&lt;/li&gt;
&lt;li&gt;Theorem Proving&lt;/li&gt;
&lt;li&gt;Commonsense Reasoning&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand storage.&lt;/p&gt;

&lt;p&gt;Then rules.&lt;/p&gt;

&lt;p&gt;Then inference.&lt;/p&gt;

&lt;p&gt;Then practical and logical extensions.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;Knowledge-Based AI is built on explicit knowledge and reasoning.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Knowledge-Based AI = facts + rules + inference&lt;/p&gt;

&lt;p&gt;It is not mainly about learning from data.&lt;/p&gt;

&lt;p&gt;It is about using stored knowledge to reach conclusions.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;A knowledge-based system becomes intelligent when stored rules can generate new conclusions from known facts.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When building AI systems, do you prefer transparent rule-based reasoning, or flexible data-driven learning?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/knowledge-based-ai-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/knowledge-based-ai-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>programming</category>
      <category>logic</category>
    </item>
    <item>
      <title>How Intelligent Agents Work — From Perception to Decision and Action</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Sun, 10 May 2026 23:48:25 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-intelligent-agents-work-from-perception-to-decision-and-action-im1</link>
      <guid>https://dev.to/zeromathai/how-intelligent-agents-work-from-perception-to-decision-and-action-im1</guid>
      <description>&lt;p&gt;AI is not just models.&lt;/p&gt;

&lt;p&gt;It is a system that perceives, decides, and acts.&lt;/p&gt;

&lt;p&gt;If you only think in terms of algorithms, you miss the bigger structure.&lt;/p&gt;

&lt;p&gt;The real question is:&lt;/p&gt;

&lt;p&gt;How does an AI system turn input into action?&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;An intelligent agent is the simplest way to understand AI as a system.&lt;/p&gt;

&lt;p&gt;It takes input from the environment.&lt;/p&gt;

&lt;p&gt;Processes that information.&lt;/p&gt;

&lt;p&gt;Then selects an action.&lt;/p&gt;

&lt;p&gt;That loop defines AI behavior.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;The basic agent loop looks like this:&lt;/p&gt;

&lt;p&gt;Environment → Perception → State → Decision → Action → Environment&lt;/p&gt;

&lt;p&gt;Or more compact:&lt;/p&gt;

&lt;p&gt;Agent = Perception + Decision + Action&lt;/p&gt;

&lt;p&gt;This is why the agent concept matters.&lt;/p&gt;

&lt;p&gt;It connects data, reasoning, and behavior into one structure.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, an agent behaves like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;observe environment

update internal state

evaluate possible actions

choose the best action

execute action

repeat
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This loop appears everywhere.&lt;/p&gt;

&lt;p&gt;Game AI.&lt;/p&gt;

&lt;p&gt;Robotics.&lt;/p&gt;

&lt;p&gt;Autonomous systems.&lt;/p&gt;

&lt;p&gt;Recommendation systems.&lt;/p&gt;

&lt;p&gt;Even large language models follow a version of this pattern.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a simple robot.&lt;/p&gt;

&lt;p&gt;It receives sensor input.&lt;/p&gt;

&lt;p&gt;It detects obstacles.&lt;/p&gt;

&lt;p&gt;It chooses a direction.&lt;/p&gt;

&lt;p&gt;It moves.&lt;/p&gt;

&lt;p&gt;That is already an intelligent agent.&lt;/p&gt;

&lt;p&gt;Now scale that idea:&lt;/p&gt;

&lt;p&gt;A recommendation system observes user behavior.&lt;/p&gt;

&lt;p&gt;Updates internal preferences.&lt;/p&gt;

&lt;p&gt;Chooses the next item to show.&lt;/p&gt;

&lt;p&gt;That is also an agent.&lt;/p&gt;

&lt;p&gt;Different domain.&lt;/p&gt;

&lt;p&gt;Same structure.&lt;/p&gt;

&lt;h2&gt;
  
  
  Reactive vs Intelligent Agent
&lt;/h2&gt;

&lt;p&gt;Not all agents are equal.&lt;/p&gt;

&lt;p&gt;This comparison matters.&lt;/p&gt;

&lt;p&gt;Reactive agent:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;responds directly to input&lt;/li&gt;
&lt;li&gt;no memory or internal model&lt;/li&gt;
&lt;li&gt;simple and fast&lt;/li&gt;
&lt;li&gt;limited flexibility&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Intelligent agent:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;maintains internal state&lt;/li&gt;
&lt;li&gt;evaluates future outcomes&lt;/li&gt;
&lt;li&gt;can optimize decisions&lt;/li&gt;
&lt;li&gt;adapts to complex environments&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the difference is not just complexity.&lt;/p&gt;

&lt;p&gt;It is the presence of internal reasoning.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Cognition Matters
&lt;/h2&gt;

&lt;p&gt;As problems become more complex, simple reaction is not enough.&lt;/p&gt;

&lt;p&gt;The agent needs internal representation.&lt;/p&gt;

&lt;p&gt;Memory.&lt;/p&gt;

&lt;p&gt;Inference.&lt;/p&gt;

&lt;p&gt;That is where cognition comes in.&lt;/p&gt;

&lt;p&gt;Cognitive systems treat thinking as information processing.&lt;/p&gt;

&lt;p&gt;Input is transformed into internal structure.&lt;/p&gt;

&lt;p&gt;That structure supports reasoning.&lt;/p&gt;

&lt;p&gt;So the flow becomes:&lt;/p&gt;

&lt;p&gt;Perception → Representation → Reasoning → Action&lt;/p&gt;

&lt;p&gt;Without this layer, AI is limited to simple responses.&lt;/p&gt;

&lt;p&gt;With it, AI can plan and infer.&lt;/p&gt;

&lt;h2&gt;
  
  
  Action vs Understanding
&lt;/h2&gt;

&lt;p&gt;This is where things get interesting.&lt;/p&gt;

&lt;p&gt;Does acting correctly mean understanding?&lt;/p&gt;

&lt;p&gt;A system can follow rules and produce correct outputs.&lt;/p&gt;

&lt;p&gt;But does it truly understand meaning?&lt;/p&gt;

&lt;p&gt;This question is not just philosophical.&lt;/p&gt;

&lt;p&gt;It affects how we interpret AI systems.&lt;/p&gt;

&lt;p&gt;Rule-following can look like intelligence.&lt;/p&gt;

&lt;p&gt;But it may not imply true understanding.&lt;/p&gt;

&lt;p&gt;That distinction matters when designing or evaluating AI.&lt;/p&gt;

&lt;h2&gt;
  
  
  Decision vs Free Will
&lt;/h2&gt;

&lt;p&gt;If an agent chooses actions, is that the same as free will?&lt;/p&gt;

&lt;p&gt;In humans, experiments suggest decisions may begin before conscious awareness.&lt;/p&gt;

&lt;p&gt;In AI, decisions are the result of computation.&lt;/p&gt;

&lt;p&gt;So the deeper question becomes:&lt;/p&gt;

&lt;p&gt;Is decision-making just a process?&lt;/p&gt;

&lt;p&gt;Or is there something more?&lt;/p&gt;

&lt;p&gt;Even if you do not answer it fully, this perspective helps you see AI systems differently.&lt;/p&gt;

&lt;p&gt;They are not just tools.&lt;/p&gt;

&lt;p&gt;They are structured decision systems.&lt;/p&gt;

&lt;h2&gt;
  
  
  From Agents to Modern AI Systems
&lt;/h2&gt;

&lt;p&gt;The agent view scales.&lt;/p&gt;

&lt;p&gt;Search algorithms:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;choose next state&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Knowledge-based systems:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;use rules and inference&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Neural networks:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;learn representations&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Modern AI combines these ideas.&lt;/p&gt;

&lt;p&gt;Perception.&lt;/p&gt;

&lt;p&gt;Representation.&lt;/p&gt;

&lt;p&gt;Decision.&lt;/p&gt;

&lt;p&gt;Learning.&lt;/p&gt;

&lt;p&gt;The agent is the unifying abstraction.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why This Matters
&lt;/h2&gt;

&lt;p&gt;If you only learn models, you miss system design.&lt;/p&gt;

&lt;p&gt;If you understand agents, you understand AI structure.&lt;/p&gt;

&lt;p&gt;That matters in practice.&lt;/p&gt;

&lt;p&gt;Because real systems are not just one model.&lt;/p&gt;

&lt;p&gt;They are pipelines.&lt;/p&gt;

&lt;p&gt;Loops.&lt;/p&gt;

&lt;p&gt;Decision processes.&lt;/p&gt;

&lt;p&gt;The agent view helps you design them.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If this feels broad, follow this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Agent vs Intelligent Agent&lt;/li&gt;
&lt;li&gt;Intelligent Agent&lt;/li&gt;
&lt;li&gt;Cognitive Agents&lt;/li&gt;
&lt;li&gt;Cognitivism&lt;/li&gt;
&lt;li&gt;Chinese Room Argument&lt;/li&gt;
&lt;li&gt;Free Will and Decision Systems&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand action.&lt;/p&gt;

&lt;p&gt;Then internal reasoning.&lt;/p&gt;

&lt;p&gt;Then the limits of understanding.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;AI is best understood as an agent.&lt;/p&gt;

&lt;p&gt;Not just a model.&lt;/p&gt;

&lt;p&gt;Not just an algorithm.&lt;/p&gt;

&lt;p&gt;A system that:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;perceives&lt;/li&gt;
&lt;li&gt;represents&lt;/li&gt;
&lt;li&gt;decides&lt;/li&gt;
&lt;li&gt;acts&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Agent = perception + decision + action&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;AI systems are decision loops, not isolated models.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When designing AI systems, do you think more in terms of models, or in terms of agents that interact with environments?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/intelligent-agent-and-cognition-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/intelligent-agent-and-cognition-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>ai</category>
      <category>machinelearning</category>
      <category>systems</category>
    </item>
    <item>
      <title>How Heuristics Make Search Algorithms Smarter</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Sun, 10 May 2026 04:06:03 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-heuristics-make-search-algorithms-smarter-36oj</link>
      <guid>https://dev.to/zeromathai/how-heuristics-make-search-algorithms-smarter-36oj</guid>
      <description>&lt;p&gt;Search gets expensive when every path looks equally possible.&lt;/p&gt;

&lt;p&gt;That is the real problem.&lt;/p&gt;

&lt;p&gt;A heuristic gives the algorithm a sense of direction.&lt;/p&gt;

&lt;p&gt;It does not solve the problem by itself.&lt;/p&gt;

&lt;p&gt;But it tells the search what looks worth exploring first.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;A heuristic function estimates how close a state is to the goal.&lt;/p&gt;

&lt;p&gt;In search algorithms, that estimate becomes a decision signal.&lt;/p&gt;

&lt;p&gt;Instead of exploring blindly, the algorithm can prioritize promising states.&lt;/p&gt;

&lt;p&gt;That is why heuristics matter.&lt;/p&gt;

&lt;p&gt;They turn search from “try everything” into “try the most promising thing first.”&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;A simple search decision looks like this:&lt;/p&gt;

&lt;p&gt;Current State → Heuristic Estimate → Priority → Next State&lt;/p&gt;

&lt;p&gt;For A*, the structure is:&lt;/p&gt;

&lt;p&gt;f(n) = g(n) + h(n)&lt;/p&gt;

&lt;p&gt;Where:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;g(n) = cost from the start to the current node&lt;/li&gt;
&lt;li&gt;h(n) = estimated cost from the current node to the goal&lt;/li&gt;
&lt;li&gt;f(n) = total estimated cost&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The heuristic is h(n).&lt;/p&gt;

&lt;p&gt;It is the part that points the search toward the goal.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, heuristic search works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;start from the initial state

while there are states to explore:
    estimate how promising each state is

    choose the state with the best score

    if it is the goal:
        return the solution

    expand the next states

return failure
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why heuristic quality matters in implementation.&lt;/p&gt;

&lt;p&gt;A weak heuristic barely improves search.&lt;/p&gt;

&lt;p&gt;A bad heuristic can guide the algorithm in the wrong direction.&lt;/p&gt;

&lt;p&gt;A good heuristic reduces wasted exploration.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine pathfinding on a grid.&lt;/p&gt;

&lt;p&gt;You want to move from Start to Goal.&lt;/p&gt;

&lt;p&gt;If movement is only up, down, left, and right, Manhattan distance often fits well.&lt;/p&gt;

&lt;p&gt;It estimates distance like this:&lt;/p&gt;

&lt;p&gt;Manhattan distance = |x1 - x2| + |y1 - y2|&lt;/p&gt;

&lt;p&gt;If movement can happen freely in straight lines, Euclidean distance may fit better.&lt;/p&gt;

&lt;p&gt;Euclidean distance = straight-line distance&lt;/p&gt;

&lt;p&gt;The point is not that one is always better.&lt;/p&gt;

&lt;p&gt;The point is that the heuristic should match the structure of the problem.&lt;/p&gt;

&lt;h2&gt;
  
  
  Blind Search vs Heuristic Search
&lt;/h2&gt;

&lt;p&gt;Blind search has no sense of direction.&lt;/p&gt;

&lt;p&gt;It explores based only on the search rule.&lt;/p&gt;

&lt;p&gt;For example, BFS expands level by level.&lt;/p&gt;

&lt;p&gt;DFS goes deep first.&lt;/p&gt;

&lt;p&gt;Heuristic search adds an estimate.&lt;/p&gt;

&lt;p&gt;Blind search:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;explores without goal guidance&lt;/li&gt;
&lt;li&gt;can waste time on irrelevant paths&lt;/li&gt;
&lt;li&gt;works well for small or simple state spaces&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Heuristic search:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;uses a goal-directed signal&lt;/li&gt;
&lt;li&gt;prioritizes promising states&lt;/li&gt;
&lt;li&gt;becomes much more useful when the state space is large&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is why heuristics are so important in AI search.&lt;/p&gt;

&lt;p&gt;They do not just make the search faster.&lt;/p&gt;

&lt;p&gt;They change the order of exploration.&lt;/p&gt;

&lt;h2&gt;
  
  
  Greedy Search vs A*
&lt;/h2&gt;

&lt;p&gt;Greedy Search and A* both use heuristics.&lt;/p&gt;

&lt;p&gt;But they use them differently.&lt;/p&gt;

&lt;p&gt;Greedy Search uses only:&lt;/p&gt;

&lt;p&gt;h(n)&lt;/p&gt;

&lt;p&gt;A* uses:&lt;/p&gt;

&lt;p&gt;f(n) = g(n) + h(n)&lt;/p&gt;

&lt;p&gt;Greedy Search asks:&lt;/p&gt;

&lt;p&gt;“Which state looks closest to the goal?”&lt;/p&gt;

&lt;p&gt;A* asks:&lt;/p&gt;

&lt;p&gt;“Which state has the best total estimated path cost?”&lt;/p&gt;

&lt;p&gt;That difference matters.&lt;/p&gt;

&lt;p&gt;Greedy Search can be fast.&lt;/p&gt;

&lt;p&gt;But it can ignore the cost already paid.&lt;/p&gt;

&lt;p&gt;A* is more balanced because it combines actual cost with estimated future cost.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Admissibility Matters
&lt;/h2&gt;

&lt;p&gt;A heuristic is admissible if it never overestimates the true cost to the goal.&lt;/p&gt;

&lt;p&gt;In simple terms:&lt;/p&gt;

&lt;p&gt;h(n) &amp;lt;= true remaining cost&lt;/p&gt;

&lt;p&gt;This condition matters because A* depends on the heuristic.&lt;/p&gt;

&lt;p&gt;If the heuristic overestimates too much, A* may skip the optimal path.&lt;/p&gt;

&lt;p&gt;Admissibility keeps the estimate safe.&lt;/p&gt;

&lt;p&gt;It helps A* preserve the optimality guarantee.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Consistency Matters
&lt;/h2&gt;

&lt;p&gt;Consistency is also called monotonicity.&lt;/p&gt;

&lt;p&gt;It means the heuristic behaves smoothly as the search moves from one node to another.&lt;/p&gt;

&lt;p&gt;Conceptually:&lt;/p&gt;

&lt;p&gt;The estimated cost should not suddenly contradict the cost of moving between nodes.&lt;/p&gt;

&lt;p&gt;Consistency helps A* behave cleanly during expansion.&lt;/p&gt;

&lt;p&gt;In many implementations, it also avoids reopening already processed nodes.&lt;/p&gt;

&lt;p&gt;So the difference is:&lt;/p&gt;

&lt;p&gt;Admissibility protects optimality.&lt;/p&gt;

&lt;p&gt;Consistency keeps the search process stable.&lt;/p&gt;

&lt;p&gt;They are related, but not identical.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If heuristics feel abstract, learn them in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Heuristic Function&lt;/li&gt;
&lt;li&gt;Manhattan Distance vs Euclidean Distance&lt;/li&gt;
&lt;li&gt;Admissibility&lt;/li&gt;
&lt;li&gt;Consistency&lt;/li&gt;
&lt;li&gt;Greedy Search&lt;/li&gt;
&lt;li&gt;A* Algorithm&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand the estimate.&lt;/p&gt;

&lt;p&gt;Then you see concrete distance examples.&lt;/p&gt;

&lt;p&gt;Then you understand the conditions that make heuristic search reliable.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;A heuristic is a search shortcut.&lt;/p&gt;

&lt;p&gt;But not a random shortcut.&lt;/p&gt;

&lt;p&gt;It is a structured estimate that tells the algorithm what looks promising.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Heuristic = estimated remaining cost&lt;/p&gt;

&lt;p&gt;In A*:&lt;/p&gt;

&lt;p&gt;f(n) = g(n) + h(n)&lt;/p&gt;

&lt;p&gt;The better h(n) matches the problem, the less unnecessary search you do.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;A heuristic makes search smarter by giving it direction before the full answer is known.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When designing a heuristic, do you prefer a simple safe estimate like Manhattan distance, or a more aggressive estimate that may guide the search faster?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/heuristic-function-ai-search-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/heuristic-function-ai-search-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>algorithms</category>
      <category>ai</category>
      <category>pathfinding</category>
      <category>programming</category>
    </item>
    <item>
      <title>How Graph Structure Makes AI Search Possible</title>
      <dc:creator>zeromathai</dc:creator>
      <pubDate>Sun, 10 May 2026 04:05:40 +0000</pubDate>
      <link>https://dev.to/zeromathai/how-graph-structure-makes-ai-search-possible-565m</link>
      <guid>https://dev.to/zeromathai/how-graph-structure-makes-ai-search-possible-565m</guid>
      <description>&lt;p&gt;AI search does not start with an algorithm.&lt;/p&gt;

&lt;p&gt;It starts with structure.&lt;/p&gt;

&lt;p&gt;Before BFS, DFS, A*, or heuristics can work, the problem must first become something searchable.&lt;/p&gt;

&lt;p&gt;That is where graphs come in.&lt;/p&gt;

&lt;h2&gt;
  
  
  Core Idea
&lt;/h2&gt;

&lt;p&gt;Many AI problems can be represented as graphs.&lt;/p&gt;

&lt;p&gt;A graph gives the problem a structure:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;states become nodes&lt;/li&gt;
&lt;li&gt;relationships become edges&lt;/li&gt;
&lt;li&gt;possible moves become paths&lt;/li&gt;
&lt;li&gt;goals become target nodes&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Once the problem has this form, search algorithms can operate on it.&lt;/p&gt;

&lt;p&gt;Without structure, search is just guessing.&lt;/p&gt;

&lt;p&gt;With structure, search becomes systematic.&lt;/p&gt;

&lt;h2&gt;
  
  
  The Key Structure
&lt;/h2&gt;

&lt;p&gt;The basic transformation looks like this:&lt;/p&gt;

&lt;p&gt;Problem → Graph → Search Order → Distance / Heuristic → Decision&lt;/p&gt;

&lt;p&gt;In simpler terms:&lt;/p&gt;

&lt;p&gt;Real-world problem → searchable structure → algorithmic choice&lt;/p&gt;

&lt;p&gt;That is why graph theory matters.&lt;/p&gt;

&lt;p&gt;It is the bridge between an abstract problem and a concrete search algorithm.&lt;/p&gt;

&lt;h2&gt;
  
  
  Implementation View
&lt;/h2&gt;

&lt;p&gt;At a high level, graph-based search works like this:&lt;/p&gt;

&lt;div class="highlight js-code-highlight"&gt;
&lt;pre class="highlight plaintext"&gt;&lt;code&gt;represent the problem as nodes and edges

choose a data structure for exploration

visit nodes in a controlled order

measure progress toward the goal

use heuristics if the search space is large

return a path, decision, or failure
&lt;/code&gt;&lt;/pre&gt;

&lt;/div&gt;

&lt;p&gt;This is why graph structure is not just theory.&lt;/p&gt;

&lt;p&gt;It directly affects implementation.&lt;/p&gt;

&lt;p&gt;The graph decides what can be visited.&lt;/p&gt;

&lt;p&gt;The data structure decides the order.&lt;/p&gt;

&lt;p&gt;The heuristic decides what looks promising.&lt;/p&gt;

&lt;h2&gt;
  
  
  Concrete Example
&lt;/h2&gt;

&lt;p&gt;Imagine a map navigation problem.&lt;/p&gt;

&lt;p&gt;Cities can be nodes.&lt;/p&gt;

&lt;p&gt;Roads can be edges.&lt;/p&gt;

&lt;p&gt;The destination is the goal.&lt;/p&gt;

&lt;p&gt;Now the problem becomes searchable.&lt;/p&gt;

&lt;p&gt;BFS can explore nearby cities first.&lt;/p&gt;

&lt;p&gt;DFS can follow one route deeply.&lt;/p&gt;

&lt;p&gt;A* can use distance estimates to prioritize promising routes.&lt;/p&gt;

&lt;p&gt;Same problem.&lt;/p&gt;

&lt;p&gt;Different search behavior.&lt;/p&gt;

&lt;p&gt;The difference comes from how the graph is explored.&lt;/p&gt;

&lt;h2&gt;
  
  
  Directed vs Undirected Graphs
&lt;/h2&gt;

&lt;p&gt;Not all graphs mean the same thing.&lt;/p&gt;

&lt;p&gt;Direction changes the meaning of the structure.&lt;/p&gt;

&lt;p&gt;An undirected graph says:&lt;/p&gt;

&lt;p&gt;A is connected to B.&lt;/p&gt;

&lt;p&gt;So movement or relationship can go both ways.&lt;/p&gt;

&lt;p&gt;A directed graph says:&lt;/p&gt;

&lt;p&gt;A points to B.&lt;/p&gt;

&lt;p&gt;So the relationship has direction.&lt;/p&gt;

&lt;p&gt;This matters in AI because direction often represents dependency, order, or allowed movement.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;road networks may be directed if some roads are one-way&lt;/li&gt;
&lt;li&gt;dependency graphs are directed&lt;/li&gt;
&lt;li&gt;task scheduling often uses direction&lt;/li&gt;
&lt;li&gt;causal-style structures often use direction&lt;/li&gt;
&lt;/ul&gt;

&lt;h2&gt;
  
  
  DAG vs Undirected Graph
&lt;/h2&gt;

&lt;p&gt;A DAG is a Directed Acyclic Graph.&lt;/p&gt;

&lt;p&gt;That means:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;edges have direction&lt;/li&gt;
&lt;li&gt;cycles are not allowed&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This makes DAGs useful when order matters.&lt;/p&gt;

&lt;p&gt;For example:&lt;/p&gt;

&lt;p&gt;Task A must happen before Task B.&lt;/p&gt;

&lt;p&gt;An undirected graph is different.&lt;/p&gt;

&lt;p&gt;It represents mutual connection without direction.&lt;/p&gt;

&lt;p&gt;So the key comparison is:&lt;/p&gt;

&lt;p&gt;DAG = ordered dependency structure&lt;/p&gt;

&lt;p&gt;Undirected graph = mutual relationship structure&lt;/p&gt;

&lt;p&gt;If you mix these up, the meaning of the graph changes.&lt;/p&gt;

&lt;h2&gt;
  
  
  Queue vs Stack
&lt;/h2&gt;

&lt;p&gt;Once the graph exists, search needs an exploration rule.&lt;/p&gt;

&lt;p&gt;Two basic data structures explain a lot:&lt;/p&gt;

&lt;p&gt;Queue:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;first in, first out&lt;/li&gt;
&lt;li&gt;used by BFS&lt;/li&gt;
&lt;li&gt;explores level by level&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Stack:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;last in, first out&lt;/li&gt;
&lt;li&gt;used by DFS&lt;/li&gt;
&lt;li&gt;follows one path deeply&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is why BFS and DFS behave so differently.&lt;/p&gt;

&lt;p&gt;They are not mysterious.&lt;/p&gt;

&lt;p&gt;They are mostly different because they use different exploration orders.&lt;/p&gt;

&lt;h2&gt;
  
  
  BFS vs DFS
&lt;/h2&gt;

&lt;p&gt;BFS expands the nearest nodes first.&lt;/p&gt;

&lt;p&gt;DFS follows one branch as deep as possible.&lt;/p&gt;

&lt;p&gt;BFS:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;good when shortest path in an unweighted graph matters&lt;/li&gt;
&lt;li&gt;can use more memory&lt;/li&gt;
&lt;li&gt;explores broadly&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;DFS:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;good when memory is limited&lt;/li&gt;
&lt;li&gt;can go deep quickly&lt;/li&gt;
&lt;li&gt;may explore a poor branch for too long&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;So the choice is not “which is better?”&lt;/p&gt;

&lt;p&gt;The better question is:&lt;/p&gt;

&lt;p&gt;What kind of exploration does the problem need?&lt;/p&gt;

&lt;h2&gt;
  
  
  Distance Metrics and Heuristics
&lt;/h2&gt;

&lt;p&gt;Basic graph search follows structure.&lt;/p&gt;

&lt;p&gt;Heuristic search adds judgment.&lt;/p&gt;

&lt;p&gt;To make better decisions, the algorithm needs a way to estimate closeness.&lt;/p&gt;

&lt;p&gt;That is where distance metrics help.&lt;/p&gt;

&lt;p&gt;For grid-like problems:&lt;/p&gt;

&lt;p&gt;Manhattan distance works well when movement is horizontal and vertical.&lt;/p&gt;

&lt;p&gt;Euclidean distance works well when straight-line distance matters.&lt;/p&gt;

&lt;p&gt;Then a heuristic function uses that idea to estimate remaining cost.&lt;/p&gt;

&lt;p&gt;A simple pathfinding view:&lt;/p&gt;

&lt;p&gt;f(n) = g(n) + h(n)&lt;/p&gt;

&lt;p&gt;Where:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;g(n) = cost so far&lt;/li&gt;
&lt;li&gt;h(n) = estimated cost to the goal&lt;/li&gt;
&lt;li&gt;f(n) = total estimated cost&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;This is the core of A*.&lt;/p&gt;

&lt;h2&gt;
  
  
  Why Heuristic Quality Matters
&lt;/h2&gt;

&lt;p&gt;A heuristic can make search much faster.&lt;/p&gt;

&lt;p&gt;But it must be designed carefully.&lt;/p&gt;

&lt;p&gt;If the heuristic is too weak, search behaves almost like brute force.&lt;/p&gt;

&lt;p&gt;If the heuristic is too aggressive, it may guide the search incorrectly.&lt;/p&gt;

&lt;p&gt;For A*, two conditions often matter:&lt;/p&gt;

&lt;p&gt;Admissibility:&lt;/p&gt;

&lt;p&gt;The heuristic should not overestimate the true cost.&lt;/p&gt;

&lt;p&gt;Consistency:&lt;/p&gt;

&lt;p&gt;The heuristic should stay stable as the search moves between nodes.&lt;/p&gt;

&lt;p&gt;These conditions help heuristic search remain reliable.&lt;/p&gt;

&lt;h2&gt;
  
  
  Recommended Learning Order
&lt;/h2&gt;

&lt;p&gt;If graph-based search feels scattered, learn it in this order:&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Graph Theory&lt;/li&gt;
&lt;li&gt;Directed and Undirected Graphs&lt;/li&gt;
&lt;li&gt;DAG&lt;/li&gt;
&lt;li&gt;Queue&lt;/li&gt;
&lt;li&gt;Stack&lt;/li&gt;
&lt;li&gt;BFS&lt;/li&gt;
&lt;li&gt;DFS&lt;/li&gt;
&lt;li&gt;Distance Metrics&lt;/li&gt;
&lt;li&gt;Heuristic Function&lt;/li&gt;
&lt;li&gt;A* Algorithm&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;This order works because you first understand the structure.&lt;/p&gt;

&lt;p&gt;Then you understand exploration order.&lt;/p&gt;

&lt;p&gt;Then you understand heuristic guidance.&lt;/p&gt;

&lt;h2&gt;
  
  
  Takeaway
&lt;/h2&gt;

&lt;p&gt;AI search is not just about algorithms.&lt;/p&gt;

&lt;p&gt;It is about turning a problem into a structure that algorithms can operate on.&lt;/p&gt;

&lt;p&gt;The shortest version is:&lt;/p&gt;

&lt;p&gt;Graph structure + exploration order + heuristic guidance = search behavior&lt;/p&gt;

&lt;p&gt;Graphs define the possible world.&lt;/p&gt;

&lt;p&gt;Queues and stacks define how that world is explored.&lt;/p&gt;

&lt;p&gt;Distance metrics and heuristics define what looks promising.&lt;/p&gt;

&lt;p&gt;If you remember one idea, remember this:&lt;/p&gt;

&lt;p&gt;Before you choose BFS, DFS, or A*, first ask how the problem should be represented as a graph.&lt;/p&gt;

&lt;h2&gt;
  
  
  Discussion
&lt;/h2&gt;

&lt;p&gt;When solving graph search problems, do you usually start by choosing the algorithm first, or by modeling the states and edges first?&lt;/p&gt;

&lt;p&gt;Originally published at zeromathai.com.&lt;br&gt;
Original article: &lt;a href="https://zeromathai.com/en/graph-structure-search-hub-en/" rel="noopener noreferrer"&gt;https://zeromathai.com/en/graph-structure-search-hub-en/&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;GitHub Resources&lt;br&gt;
AI diagrams, study notes, and visual guides:&lt;br&gt;
&lt;a href="https://github.com/zeromathai/zeromathai-ai" rel="noopener noreferrer"&gt;https://github.com/zeromathai/zeromathai-ai&lt;/a&gt;&lt;/p&gt;

</description>
      <category>algorithms</category>
      <category>ai</category>
      <category>datastructures</category>
      <category>graphtheory</category>
    </item>
  </channel>
</rss>
