DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Majestic Labs vs. the Memory Wall

Majestic Labs vs. the Memory Wall

6
Comments
5 min read
Tokenization in NLP: The Foundational Step That Turns Language Into Data

Tokenization in NLP: The Foundational Step That Turns Language Into Data

Comments
3 min read
Linear Algebra for AI — Part 1

Linear Algebra for AI — Part 1

1
Comments
2 min read
🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

🦄 When ML Models Go Wild: Unintentional Art Created by Neural Networks

Comments 1
5 min read
Transformers and Attention: How LLMs Actually Process Text

Transformers and Attention: How LLMs Actually Process Text

5
Comments
18 min read
Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

Building a 75,000-Product Image Feature Dataset for the Amazon ML Challenge 2025

1
Comments
4 min read
DragonMemory: Neural Sequence Compression for Production RAG

DragonMemory: Neural Sequence Compression for Production RAG

3
Comments
8 min read
Nested Learning — My Reflections on a Model That Learns How to Learn

Nested Learning — My Reflections on a Model That Learns How to Learn

3
Comments
8 min read
DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

DeepSeek-OCR: When a Picture Is Actually Worth 10 Fewer Tokens

7
Comments
9 min read
AI Fundamentals: Puerta a la Inteligencia Artificial

AI Fundamentals: Puerta a la Inteligencia Artificial

5
Comments
3 min read
Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

1
Comments
14 min read
🧠 The Simplest Neural Network That Actually Works

🧠 The Simplest Neural Network That Actually Works

Comments
1 min read
Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

1
Comments
8 min read
DeepSeek-OCR: Contexts Optical Compression

DeepSeek-OCR: Contexts Optical Compression

Comments
1 min read
Seeing Shapes: Unveiling Neural Network Vision with Fourier Geometry by Arvind Sundararajan

Seeing Shapes: Unveiling Neural Network Vision with Fourier Geometry by Arvind Sundararajan

Comments
2 min read
ConsistEdit: Highly Consistent and Precise Training-free Visual Editing

ConsistEdit: Highly Consistent and Precise Training-free Visual Editing

Comments
1 min read
Understanding Deep Learning: The Basics of Neural Networks

Understanding Deep Learning: The Basics of Neural Networks

Comments
4 min read
Building a Food Classifier: What I Learned from Overfitting

Building a Food Classifier: What I Learned from Overfitting

2
Comments
14 min read
RAG Evaluation Best Practices for Reliable Retrieval Systems

RAG Evaluation Best Practices for Reliable Retrieval Systems

2
Comments
3 min read
LLMs Speaking in Tongues: Unlocking Direct Semantic Exchange

LLMs Speaking in Tongues: Unlocking Direct Semantic Exchange

Comments
2 min read
Case Study on use of AI in Tesla Autopilot

Case Study on use of AI in Tesla Autopilot

Comments
1 min read
The Phoenix of Neural Networks: Training Sparse Networks from Scratch

The Phoenix of Neural Networks: Training Sparse Networks from Scratch

3
Comments
12 min read
Labellerr SDK & YOLOv8: Cars and Number Plate Detection Practical, Step-by-Step

Labellerr SDK & YOLOv8: Cars and Number Plate Detection Practical, Step-by-Step

Comments 1
17 min read
AI Art Turbocharged: Differentiable Diffusion for Hyper-Realistic Results

AI Art Turbocharged: Differentiable Diffusion for Hyper-Realistic Results

Comments
2 min read
Vision Transformer (ViT) from Scratch in PyTorch

Vision Transformer (ViT) from Scratch in PyTorch

1
Comments
2 min read
loading...