How Does AI Learn? Training Data, Patterns & Models Explained (2026) | AI Basics Day 2

#trainingdata #supervisedlearning #asics #inacking

📰 Originally published on Securityelites — AI Red Team Education — the canonical, fully-updated version of this article.

🤖 AI BASICS FOR BEGINNERS FREE

Course Hub →

Day 2 of 5 · 40% complete

Every time you click “like” on a video, watch something all the way to the end, or skip a song after 5 seconds — you’re teaching an AI. Your clicks are training data. The AI is watching what you do and learning your patterns. It gets better at predicting what you want because of you.

That’s how AI learns — from examples. And the more examples it gets, the better it gets. But here’s the really interesting part: what an AI learns completely determines what it can do — and also what it can’t do, and how it can fail.

Today I’m taking you inside the learning process. You’ll see exactly how an AI goes from knowing nothing to being scarily good at its job. And you’ll see why that same learning process can be tampered with — which is one of the sneakiest ways to break an AI.

🎯 What You’ll Learn in Day 2

✅ How AI goes from zero knowledge to scary-good
✅ What “training data” is and why it’s everything
✅ Three ways AI can learn — with real-life examples of each
✅ What “model weights” are (in plain English, no maths)
✅ How someone could secretly corrupt an AI by messing with its training

⏱ 25 min read · 3 exercises · Browser helpful for exercises

📋 Before You Start:

Completed Day 1: What Is Artificial Intelligence?
Remember: AI learns from examples and makes guesses about new things
Remember: AI matches patterns — it doesn’t truly understand anything

How Does AI Learn? — Day 2 of 5

Training Data — The Fuel That Powers AI
Three Ways AI Can Learn
What Actually Happens During Training
What Are Model Weights? (No Maths, Promise)
Three Ways Learning Can Go Wrong
The Sneaky Attack: Poisoning the Training Data
Questions and Answers

Yesterday we covered what AI is. Today we go one level deeper — into the learning process itself. This is where things get really interesting. The adversarial machine learning attacks you’ll learn about later all trace back to how training works. So does understanding the LLM hacking series. Let’s build the foundation.

Training Data — The Fuel That Powers AI

Training data is the collection of examples an AI learns from. It’s the most important ingredient. An AI is literally only as good as what you show it. Bad examples → bad AI. Sneaky examples → dangerous AI. Amazing examples → amazing AI.

Let me make this concrete. Imagine you want to teach an AI to tell the difference between dogs and cats in photos.

You need training data: thousands (or millions) of photos. Each photo needs a label: “dog” or “cat.” The AI looks at the photo and the label, over and over, millions of times. It figures out what patterns separate dogs from cats. Pointy ears vs floppy ears. Whiskers vs no whiskers. Different eye shapes. Fur patterns. Body proportions. The AI learns all of this without you ever telling it what to look for.

Training data has three things that really matter:

Lots of it. More examples = better patterns found. An AI trained on 100 photos of cats is going to make lots of mistakes. An AI trained on 100 million photos is going to be very, very good. ChatGPT was trained on more text than any human could read in thousands of lifetimes.

Good variety. If you only show the AI photos of orange cats, it’ll be confused by black cats, white cats, and kittens. The training examples need to include all the different versions of the thing you want it to recognise. This is called “diversity.” When training data isn’t diverse, the AI develops blind spots — things it fails on predictably.

Correct labels. Every example needs the right answer attached. If you accidentally label 10% of your cat photos as “dog,” the AI learns the wrong patterns from those examples. Wrong labels = AI learns wrong things = AI makes wrong predictions.

securityelites.com

TRAINING DATA QUALITY — WHAT MATTERS

VOLUME
More examples → AI sees more patterns → gets smarter

VARIETY
One-sided data → AI develops blind spots it fails on every time

LABELS
Wrong labels → AI learns the wrong lesson from those examples

⚠️ POISONED
Someone sneaks in bad examples → AI learns to behave wrong on purpose

📸 The four quality levels of training data. The first three are quality problems that happen by accident. The last one (red) is an attack — someone doing it on purpose. We’ll cover that in Section 6.

Three Ways AI Can Learn

Not all AI learns the same way. There are three main styles of learning. Each one is used for different jobs — and each one can be attacked differently. I think of them like three different ways a student could study for a test.

📖 Read the complete guide on Securityelites — AI Red Team Education

This article continues with deeper technical detail, screenshots, code samples, and an interactive lab walk-through. Read the full article on Securityelites — AI Red Team Education →

This article was originally written and published by the Securityelites — AI Red Team Education team. For more cybersecurity tutorials, ethical hacking guides, and CTF walk-throughs, visit Securityelites — AI Red Team Education.