DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Introduction to ML Compilers + Roadmap (MLIR, TVM, GPU Kernels)

Introduction to ML Compilers + Roadmap (MLIR, TVM, GPU Kernels)

Comments
1 min read
Benchmark Shadows Study: Data Alignment Limits LLM Generalization

Benchmark Shadows Study: Data Alignment Limits LLM Generalization

Comments
6 min read
AI News Update: April 10, 2026 - A Week of Breakthroughs and Concerns

AI News Update: April 10, 2026 - A Week of Breakthroughs and Concerns

Comments
5 min read
Đưa World Model Từ Bản Demo Đẹp Mắt Thành Trải Nghiệm Tương Tác Thực Sự Trên GPU Phổ Thông

Đưa World Model Từ Bản Demo Đẹp Mắt Thành Trải Nghiệm Tương Tác Thực Sự Trên GPU Phổ Thông

Comments
15 min read
Why Reasoning Models Changed Everything

Why Reasoning Models Changed Everything

Comments
8 min read
How to Train a 100B+ Parameter Model When You Can't Afford a GPU Cluster

How to Train a 100B+ Parameter Model When You Can't Afford a GPU Cluster

Comments 1
5 min read
𝗪𝗵𝗮𝘁 𝗜 𝗟𝗲𝗮𝗿𝗻𝗲𝗱 𝗳𝗿𝗼𝗺 𝗖𝗵𝗮𝗽𝘁𝗲𝗿 𝟮 𝗼𝗳 𝗔𝗜 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴: 𝗪𝗵𝘆 𝗦𝗮𝗺𝗽𝗹𝗶𝗻𝗴 𝗖𝗵𝗮𝗻𝗴𝗲𝘀 𝗘𝘃𝗲𝗿𝘆𝘁𝗵𝗶𝗻𝗴

𝗪𝗵𝗮𝘁 𝗜 𝗟𝗲𝗮𝗿𝗻𝗲𝗱 𝗳𝗿𝗼𝗺 𝗖𝗵𝗮𝗽𝘁𝗲𝗿 𝟮 𝗼𝗳 𝗔𝗜 𝗘𝗻𝗴𝗶𝗻𝗲𝗲𝗿𝗶𝗻𝗴: 𝗪𝗵𝘆 𝗦𝗮𝗺𝗽𝗹𝗶𝗻𝗴 𝗖𝗵𝗮𝗻𝗴𝗲𝘀 𝗘𝘃𝗲𝗿𝘆𝘁𝗵𝗶𝗻𝗴

1
Comments
4 min read
Blog 2: Momentum-Based Optimizers

Blog 2: Momentum-Based Optimizers

Comments
6 min read
Blog 1: Foundations of Gradient Descent

Blog 1: Foundations of Gradient Descent

Comments
5 min read
I Built the World's First AI Knowledge Arena — Battle Other Devs on ML & Deep Learning

I Built the World's First AI Knowledge Arena — Battle Other Devs on ML & Deep Learning

Comments
3 min read
Policy Gradients: REINFORCE from Scratch with NumPy

Policy Gradients: REINFORCE from Scratch with NumPy

Comments
16 min read
2 Lines of Code Saved 6.4x Memory on My Snake AI

2 Lines of Code Saved 6.4x Memory on My Snake AI

3
Comments
6 min read
Replace Claude Code's Context-Stuffing with git-semantic for Team-Wide Semantic Search

Replace Claude Code's Context-Stuffing with git-semantic for Team-Wide Semantic Search

Comments
3 min read
How Transformer Models Actually Work

How Transformer Models Actually Work

1
Comments
3 min read
Image Prompt Packaging Cuts Multimodal Inference Costs Up to 91%

Image Prompt Packaging Cuts Multimodal Inference Costs Up to 91%

Comments
6 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.