DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Fine-Tuning AI for Free — Kaggle + QLoRA Hands-On Guide

Fine-Tuning AI for Free — Kaggle + QLoRA Hands-On Guide

Comments
5 min read
Understanding Attention Mechanisms – Part 2: Comparing Encoder and Decoder Outputs

Understanding Attention Mechanisms – Part 2: Comparing Encoder and Decoder Outputs

16
Comments
2 min read
550 Hallucinations, Zero Discoveries: What Happens When You Force an LLM to Invent Mathematics

550 Hallucinations, Zero Discoveries: What Happens When You Force an LLM to Invent Mathematics

Comments
8 min read
How LLM Memory Actually Works in Production Systems

How LLM Memory Actually Works in Production Systems

Comments
4 min read
Caching Strategies for LLM Systems – Part 4: Grouped-Query Attention for Scalable, Efficient Transformers

Caching Strategies for LLM Systems – Part 4: Grouped-Query Attention for Scalable, Efficient Transformers

Comments
3 min read
🚀 Google Just Dropped Gemini 3.1 Pro: The 1M-Token Beast That Will Break Your PR Workflow

🚀 Google Just Dropped Gemini 3.1 Pro: The 1M-Token Beast That Will Break Your PR Workflow

1
Comments
3 min read
What If the GPU Was Never Hardware? Rethinking AI Acceleration with Pure Software

What If the GPU Was Never Hardware? Rethinking AI Acceleration with Pure Software

1
Comments
4 min read
How to Add Persistent Memory to an LLM App (Without Fine-Tuning) — A Practical Architecture Guide

How to Add Persistent Memory to an LLM App (Without Fine-Tuning) — A Practical Architecture Guide

Comments
4 min read
How I trained a computer vision model on the AWS Free Tier

How I trained a computer vision model on the AWS Free Tier

9
Comments
8 min read
How API Data Bloat is Ruining Your AI Agents (And How I Cut Token Usage by 98% in Python)

How API Data Bloat is Ruining Your AI Agents (And How I Cut Token Usage by 98% in Python)

5
Comments 1
3 min read
When 100.00 Means Nothing: Gaming Coding Assessments

When 100.00 Means Nothing: Gaming Coding Assessments

25
Comments 3
3 min read
ML in Warehouse Operations - How I Built a Production ML System to Automate Fashion Return Classification

ML in Warehouse Operations - How I Built a Production ML System to Automate Fashion Return Classification

Comments
4 min read
Building Transformer from Scratch

Building Transformer from Scratch

Comments
8 min read
Beyond Words: Building an AI Stress Detector with Wav2Vec 2.0 and PyTorch

Beyond Words: Building an AI Stress Detector with Wav2Vec 2.0 and PyTorch

1
Comments
4 min read
Understanding Attention Mechanisms – Part 1: Why Long Sentences Break Encoder–Decoders

Understanding Attention Mechanisms – Part 1: Why Long Sentences Break Encoder–Decoders

16
Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.