DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Revisiting the Causal Mechanisms Behind Policy Gradients

Revisiting the Causal Mechanisms Behind Policy Gradients

Comments
5 min read
Mamba-3 and AttnRes: AI Architecture Research Is Finally Building for Inference, Not Just Training

Mamba-3 and AttnRes: AI Architecture Research Is Finally Building for Inference, Not Just Training

Comments
7 min read
Why I built a Rust deep learning framework (and what I got wrong twice first)

Why I built a Rust deep learning framework (and what I got wrong twice first)

Comments
5 min read
The Mathematics That Make 1.58-bit Weights Work: How BitNet b1.58 Survives Its Own Quantization

The Mathematics That Make 1.58-bit Weights Work: How BitNet b1.58 Survives Its Own Quantization

1
Comments
7 min read
I Tried to Run VGG19 on a CPU… It Failed. So I Fixed It."

I Tried to Run VGG19 on a CPU… It Failed. So I Fixed It."

1
Comments
3 min read
Implementing ✨ Bayesian Belief Tracking in LLM Agents 🤖

Implementing ✨ Bayesian Belief Tracking in LLM Agents 🤖

Comments
4 min read
I built a real-time Drivable Area Segmentation model for Indian roads (Here is how it runs at 55 FPS)

I built a real-time Drivable Area Segmentation model for Indian roads (Here is how it runs at 55 FPS)

8
Comments
2 min read
My Study Notes on Convolutional Neural Networks (CNN)

My Study Notes on Convolutional Neural Networks (CNN)

1
Comments
3 min read
RTX 4090 vs RTX 3090 for AI/ML: Is the Upgrade Worth It?

RTX 4090 vs RTX 3090 for AI/ML: Is the Upgrade Worth It?

Comments
3 min read
arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference

arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference

1
Comments
6 min read
AI News This Week: Breaking Down the Latest Developments in Multimodal Large Language Models

AI News This Week: Breaking Down the Latest Developments in Multimodal Large Language Models

1
Comments 1
5 min read
Recurrent Neural Networks: Giving Networks Memory

Recurrent Neural Networks: Giving Networks Memory

Comments
5 min read
Defending Vibe Coding: Why Syntax Might Not Be the Bottleneck Anymore

Defending Vibe Coding: Why Syntax Might Not Be the Bottleneck Anymore

1
Comments
2 min read
Beyond ReconVLA: Annotation-Free Visual Grounding via Language-Attention Masked Reconstruction

Beyond ReconVLA: Annotation-Free Visual Grounding via Language-Attention Masked Reconstruction

1
Comments 2
9 min read
Navigating the Search Space: A Guide from BFS to Deep Learning

Navigating the Search Space: A Guide from BFS to Deep Learning

1
Comments 2
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.