DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Qwen Image Base Model Training vs FLUX SRPO Training 20 images comparison

Qwen Image Base Model Training vs FLUX SRPO Training 20 images comparison

6
Comments
2 min read
Introduction to PyTorch

Introduction to PyTorch

Comments
7 min read
Real-Time Horn Detection and Noise Regulation System for Silence Zones

Real-Time Horn Detection and Noise Regulation System for Silence Zones

Comments
3 min read
Building Intelligent AI Agents with Modular Reinforcement Learning

Building Intelligent AI Agents with Modular Reinforcement Learning

Comments
13 min read
The Role of GPUs in Accelerating Deep Learning Training

The Role of GPUs in Accelerating Deep Learning Training

Comments
5 min read
Boosting Wan2.2 I2V Inference on 8 H100s — 2.5 Faster with Sequence Parallelism & Magcache

Boosting Wan2.2 I2V Inference on 8 H100s — 2.5 Faster with Sequence Parallelism & Magcache

1
Comments
3 min read
Why GPUs Are the Secret Weapon for Faster Deep Learning Training

Why GPUs Are the Secret Weapon for Faster Deep Learning Training

Comments
6 min read
Diagnosing layer sensitivity during post training quantization

Diagnosing layer sensitivity during post training quantization

6
Comments
4 min read
大模型微调:SFT

大模型微调:SFT

5
Comments
1 min read
Real-Time Face Recognition Attendance — QR Access & Google Sheets Integration

Real-Time Face Recognition Attendance — QR Access & Google Sheets Integration

Comments 1
3 min read
I Skipped My Birthday to Give Go Its First Real ML Framework

I Skipped My Birthday to Give Go Its First Real ML Framework

Comments
4 min read
Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Comments
4 min read
AI News: Fri, Nov 07, 2025

AI News: Fri, Nov 07, 2025

3
Comments
6 min read
Majestic Labs vs. the Memory Wall

Majestic Labs vs. the Memory Wall

6
Comments
5 min read
Tokenization in NLP: The Foundational Step That Turns Language Into Data

Tokenization in NLP: The Foundational Step That Turns Language Into Data

Comments
3 min read
Linear Algebra for AI — Part 1

Linear Algebra for AI — Part 1

1
Comments
2 min read
🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

Comments 1
5 min read
Transformers and Attention: How LLMs Actually Process Text

Transformers and Attention: How LLMs Actually Process Text

5
Comments
19 min read
Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

1
Comments
4 min read
DragonMemory: Neural Sequence Compression for Production RAG

DragonMemory: Neural Sequence Compression for Production RAG

3
Comments
8 min read
Nested Learning — My Reflections on a Model That Learns How to Learn

Nested Learning — My Reflections on a Model That Learns How to Learn

3
Comments
8 min read
DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

7
Comments
9 min read
AI Fundamentals: Puerta a la Inteligencia Artificial

AI Fundamentals: Puerta a la Inteligencia Artificial

5
Comments
3 min read
Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

1
Comments
14 min read
🧠 The Simplest Neural Network That Actually Works

🧠 The Simplest Neural Network That Actually Works

Comments
1 min read
loading...