DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference

arXiv Survey Maps KV Cache Optimization Landscape: 5 Strategies for Million-Token LLM Inference

1
Comments
6 min read
Beyond ReconVLA: Annotation-Free Visual Grounding via Language-Attention Masked Reconstruction

Beyond ReconVLA: Annotation-Free Visual Grounding via Language-Attention Masked Reconstruction

1
Comments 2
9 min read
Building an LLM From Scratch for Indic Languages: What No One Tells You About the Hard Parts

Building an LLM From Scratch for Indic Languages: What No One Tells You About the Hard Parts

1
Comments
19 min read
Navigating the Search Space: A Guide from BFS to Deep Learning

Navigating the Search Space: A Guide from BFS to Deep Learning

1
Comments 2
3 min read
I implemented Deep Q-Networks in pure PowerShell 5.1 — and connected them to real Windows enterprise data

I implemented Deep Q-Networks in pure PowerShell 5.1 — and connected them to real Windows enterprise data

3
Comments
1 min read
Bridging the Silence: Building a Real-Time Sign Language Translator (Part 1)

Bridging the Silence: Building a Real-Time Sign Language Translator (Part 1)

Comments
3 min read
Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation

1
Comments
1 min read
From Perceptrons to Predicting the Next Word

From Perceptrons to Predicting the Next Word

1
Comments
9 min read
15 Architecture Experiments: Training a GPT-2 Style Model on Vast.ai for $10

15 Architecture Experiments: Training a GPT-2 Style Model on Vast.ai for $10

1
Comments
4 min read
Architecting Guardian-AI: Multi-Layered Content Integrity Filters for Autonomous Publishing

Architecting Guardian-AI: Multi-Layered Content Integrity Filters for Autonomous Publishing

Comments
7 min read
Bounding Box Augmentation for Object Detection with Albumentations

Bounding Box Augmentation for Object Detection with Albumentations

5
Comments
8 min read
Attention Is All You Need — Explained Like You’re Building It From Scratch

Attention Is All You Need — Explained Like You’re Building It From Scratch

1
Comments
2 min read
Predicting Traffic in the City of Buffalo Using a Neural Network

Predicting Traffic in the City of Buffalo Using a Neural Network

1
Comments
1 min read
Standard Transformer Attention vs. Attention-Residuals: A Practical Comparison

Standard Transformer Attention vs. Attention-Residuals: A Practical Comparison

Comments
5 min read
It's Not Smarter Models — It's Cheaper Memory: TurboQuant's Real Impact, Wall Street Panic & Academic Storm

It's Not Smarter Models — It's Cheaper Memory: TurboQuant's Real Impact, Wall Street Panic & Academic Storm

1
Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.