DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
A11: A Structural Answer to AI Collapse

A11: A Structural Answer to AI Collapse

Comments
3 min read
Gemma 4 12B shows how far local multimodal AI has moved

Gemma 4 12B shows how far local multimodal AI has moved

Comments
5 min read
NVIDIA Cosmos 3: How a Two-Tower Architecture Unifies Physical AI Reasoning and Generation

NVIDIA Cosmos 3: How a Two-Tower Architecture Unifies Physical AI Reasoning and Generation

1
Comments
5 min read
Building Medical AI for the Other 90%: A Field Report from a Solo Developer

Building Medical AI for the Other 90%: A Field Report from a Solo Developer

Comments
5 min read
NVIDIA Cosmos 3: Unifying Physical AI Reasoning and Generation with Two-Tower Architecture

NVIDIA Cosmos 3: Unifying Physical AI Reasoning and Generation with Two-Tower Architecture

1
Comments
5 min read
The Hierarchical Reasoning Model: Can a 27M-Parameter Network Outthink Chain-of-Thought?

The Hierarchical Reasoning Model: Can a 27M-Parameter Network Outthink Chain-of-Thought?

Comments
5 min read
Intercepting Gradients in PyTorch: Preprocess the Update Before Your Optimizer Sees It

Intercepting Gradients in PyTorch: Preprocess the Update Before Your Optimizer Sees It

Comments
3 min read
The Technology Behind Viral AI Image Generators

The Technology Behind Viral AI Image Generators

Comments
3 min read
Making RNNs Actually Work: LSTMs, Bidirectionality, and the Encoder-Decoder

Making RNNs Actually Work: LSTMs, Bidirectionality, and the Encoder-Decoder

Comments
15 min read
Demystifying Deep Learning Optimization: From Feature Scaling to Adam and Beyond

Demystifying Deep Learning Optimization: From Feature Scaling to Adam and Beyond

Comments
7 min read
Andrej Karpathy's Neural Networks: Zero to Hero — 1) Intro to Neural Networks and Backpropagation

Andrej Karpathy's Neural Networks: Zero to Hero — 1) Intro to Neural Networks and Backpropagation

Comments
9 min read
GPT-5.5: OpenAI Admits Decline. The AI Reality Check.

GPT-5.5: OpenAI Admits Decline. The AI Reality Check.

Comments
9 min read
Deep Learning for Beginners: A Complete Guide

Deep Learning for Beginners: A Complete Guide

Comments
12 min read
How Neural Networks Actually Work — A Thread for Curious Minds

How Neural Networks Actually Work — A Thread for Curious Minds

Comments
2 min read
Time When More Layers Meant Worse Model ... Birth Of Residual

Time When More Layers Meant Worse Model ... Birth Of Residual

1
Comments
6 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.