DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Caching Strategies for LLM Systems (Part 2): KV Cache and the Mathematics of Fast Transformer Inference

Caching Strategies for LLM Systems (Part 2): KV Cache and the Mathematics of Fast Transformer Inference

Comments
4 min read
TensorFlow - Hyperparameter Tuning - Complete Tutorial

TensorFlow - Hyperparameter Tuning - Complete Tutorial

Comments
2 min read
My single best advice for anyone wanting to start in AI:

My single best advice for anyone wanting to start in AI:

Comments
1 min read
Our GPU Was Idle 77% of the Time. Here's How We Fixed It

Our GPU Was Idle 77% of the Time. Here's How We Fixed It

Comments
4 min read
How I Built a Full-Stack ML App (and Fixed a 3GB Docker Image) 🐳

How I Built a Full-Stack ML App (and Fixed a 3GB Docker Image) 🐳

Comments
3 min read
Deep Learning Mastery(ft PyTorch) -Pt1 ⚡

Deep Learning Mastery(ft PyTorch) -Pt1 ⚡

1
Comments
3 min read
🔥 PyTorch Tutorial 1.1: Tensor Basics - From Zero to Hero

🔥 PyTorch Tutorial 1.1: Tensor Basics - From Zero to Hero

Comments 1
3 min read
Why Edge AI Research Needs Field Validation: Lessons from Replicating MIT CSAIL

Why Edge AI Research Needs Field Validation: Lessons from Replicating MIT CSAIL

5
Comments 1
2 min read
"Interactive Tools That Actually Help You Learn Transformers and Deep Learning"

"Interactive Tools That Actually Help You Learn Transformers and Deep Learning"

Comments
3 min read
The Myth of “Just Add a GPU” in Machine Learning

The Myth of “Just Add a GPU” in Machine Learning

5
Comments
3 min read
🧠Perceptron vs XOR: Why One Math Problem Changed AI Forever

🧠Perceptron vs XOR: Why One Math Problem Changed AI Forever

Comments
6 min read
Building a Transparent Skin Health Classifier: Fine-tuned EfficientNet + Grad-CAM 🩺

Building a Transparent Skin Health Classifier: Fine-tuned EfficientNet + Grad-CAM 🩺

1
Comments
4 min read
Why "Attention" Changed Everything: A Deep Dive into the Transformer Architecture

Why "Attention" Changed Everything: A Deep Dive into the Transformer Architecture

Comments
4 min read
Conversational Development With Claude Code — Part 5: Full‑Stack Architecture Analysis

Conversational Development With Claude Code — Part 5: Full‑Stack Architecture Analysis

1
Comments
3 min read
Training classic ML Models using GPU on windows

Training classic ML Models using GPU on windows

7
Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.