DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

5
Comments
5 min read
Decoder-Only Transformers: The Architecture Behind GPT Models

Decoder-Only Transformers: The Architecture Behind GPT Models

Comments
5 min read
Why Studying the Turing Machine Changed How I See AI And Why Every New AI Engineer Should Revisit It

Why Studying the Turing Machine Changed How I See AI And Why Every New AI Engineer Should Revisit It

2
Comments
3 min read
Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Comments
4 min read
🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

5
Comments
4 min read
Stock Price Prediction

Stock Price Prediction

Comments
1 min read
Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

4
Comments
7 min read
The Evolution of Sequential Learning Models: RNN LSTM Transformers

The Evolution of Sequential Learning Models: RNN LSTM Transformers

Comments
2 min read
Devtool for running and benchmarking local AI

Devtool for running and benchmarking local AI

2
Comments
3 min read
BIGRAM LANGUAGE MODELS USING A NEURAL NET

BIGRAM LANGUAGE MODELS USING A NEURAL NET

Comments
14 min read
The Math Behind Machine Learning & Deep Learning (Explained Simply)

The Math Behind Machine Learning & Deep Learning (Explained Simply)

Comments
3 min read
LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

1
Comments
4 min read
Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model

Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model

2
Comments
7 min read
Qwen Image Base Model Training vs FLUX SRPO Training 20 images comparison

Qwen Image Base Model Training vs FLUX SRPO Training 20 images comparison

6
Comments
2 min read
Introduction to PyTorch

Introduction to PyTorch

Comments
7 min read
Real-Time Horn Detection and Noise Regulation System for Silence Zones

Real-Time Horn Detection and Noise Regulation System for Silence Zones

Comments
3 min read
Building Intelligent AI Agents with Modular Reinforcement Learning

Building Intelligent AI Agents with Modular Reinforcement Learning

Comments
13 min read
The Role of GPUs in Accelerating Deep Learning Training

The Role of GPUs in Accelerating Deep Learning Training

Comments
5 min read
Boosting Wan2.2 I2V Inference on 8 H100s — 2.5 Faster with Sequence Parallelism & Magcache

Boosting Wan2.2 I2V Inference on 8 H100s — 2.5 Faster with Sequence Parallelism & Magcache

1
Comments
3 min read
Why GPUs Are the Secret Weapon for Faster Deep Learning Training

Why GPUs Are the Secret Weapon for Faster Deep Learning Training

Comments
6 min read
Diagnosing layer sensitivity during post training quantization

Diagnosing layer sensitivity during post training quantization

6
Comments
4 min read
大模型微调:SFT

大模型微调:SFT

5
Comments
1 min read
Real-Time Face Recognition Attendance — QR Access & Google Sheets Integration

Real-Time Face Recognition Attendance — QR Access & Google Sheets Integration

Comments 1
3 min read
Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Zero-Degradation Training: 92% ImageNet-100 Accuracy with 61% Energy Savings

Comments
4 min read
AI News: Fri, Nov 07, 2025

AI News: Fri, Nov 07, 2025

3
Comments
6 min read
loading...