DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Understanding Decoder-Only Transformers Part 1: Masked Self-Attention

Understanding Decoder-Only Transformers Part 1: Masked Self-Attention

15
Comments
1 min read
When Can You Actually Trust a Machine Learning Model?

When Can You Actually Trust a Machine Learning Model?

Comments
2 min read
不是模型變強,而是記憶變便宜:TurboQuant 的技術真相、華爾街恐慌與學術風暴

不是模型變強,而是記憶變便宜:TurboQuant 的技術真相、華爾街恐慌與學術風暴

Comments
3 min read
Devtrails Hackathon (phase-1)

Devtrails Hackathon (phase-1)

Comments
1 min read
ReCUBE Benchmark Reveals GPT-5 Scores Only 37.6% on Repository-Level Code Generation

ReCUBE Benchmark Reveals GPT-5 Scores Only 37.6% on Repository-Level Code Generation

Comments
6 min read
The Frozen Context Pattern: Adding State to Deep Equilibrium Models

The Frozen Context Pattern: Adding State to Deep Equilibrium Models

1
Comments
5 min read
80% of LLM 'Thinking' Is a Lie — What CoT Faithfulness Research Actually Shows

80% of LLM 'Thinking' Is a Lie — What CoT Faithfulness Research Actually Shows

Comments
7 min read
Building a Trading Bot That Could Turn $10K into $102K: xLSTM (DL) + PPO (RL)

Building a Trading Bot That Could Turn $10K into $102K: xLSTM (DL) + PPO (RL)

5
Comments
5 min read
80% of LLM 'Thinking' Is a Lie — What CoT Faithfulness Research Actually Shows

80% of LLM 'Thinking' Is a Lie — What CoT Faithfulness Research Actually Shows

Comments
7 min read
What Cursor's 8GB Storage Bloat Teaches Us About Claude Code's Clean Architecture

What Cursor's 8GB Storage Bloat Teaches Us About Claude Code's Clean Architecture

Comments 1
3 min read
Deep Learning for Image Classification: Ensemble CNN Architectures with Test Time Augmentation

Deep Learning for Image Classification: Ensemble CNN Architectures with Test Time Augmentation

Comments
5 min read
Atari Deep Q-Network ProjectAtari Deep Q-Network Project

Atari Deep Q-Network ProjectAtari Deep Q-Network Project

Comments
1 min read
Understanding AI Image-to-Video: How It Actually Works

Understanding AI Image-to-Video: How It Actually Works

Comments 1
2 min read
Google Cloud TPU Architecture Versions Explained: From v1 to the Eighth Generation

Google Cloud TPU Architecture Versions Explained: From v1 to the Eighth Generation

5
Comments
10 min read
Advances in Artificial Intelligence Architectures

Advances in Artificial Intelligence Architectures

1
Comments 1
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.