DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Karpathy Autoresearch: 700 Experiments Rewire AI Research

Karpathy Autoresearch: 700 Experiments Rewire AI Research

Comments
6 min read
Neural Network Training - Simply Explained with a Mental Model

Neural Network Training - Simply Explained with a Mental Model

1
Comments
2 min read
Beyond the API Call: Engineering EloDtx, the Deep Learning Core of Baeyond

Beyond the API Call: Engineering EloDtx, the Deep Learning Core of Baeyond

1
Comments
1 min read
AIGQ: Taobao's End-to-End Generative Architecture for E-commerce Query Recommendation

AIGQ: Taobao's End-to-End Generative Architecture for E-commerce Query Recommendation

Comments
4 min read
Invited talk : Adversarial Attacks and Defenses in Deep Learning Systems: Threats, Mechanisms, and Countermeasures

Invited talk : Adversarial Attacks and Defenses in Deep Learning Systems: Threats, Mechanisms, and Countermeasures

Comments
1 min read
The Challenge of Unverifiable AI Rewards

The Challenge of Unverifiable AI Rewards

1
Comments
7 min read
Transformers Are Not Dead — But Hybrids Are the Future. Here's Why.

Transformers Are Not Dead — But Hybrids Are the Future. Here's Why.

Comments
13 min read
Beyond the Snore: Real-time Sleep Apnea Screening with OpenAI Whisper and PyTorch

Beyond the Snore: Real-time Sleep Apnea Screening with OpenAI Whisper and PyTorch

1
Comments
4 min read
Adversarial Attacks and Defenses in Deep Learning Systems: Threats, Mechanisms, and Countermeasures

Adversarial Attacks and Defenses in Deep Learning Systems: Threats, Mechanisms, and Countermeasures

1
Comments
6 min read
Genesis: Teaching AI to Learn Like a Child (Patent Pending)

Genesis: Teaching AI to Learn Like a Child (Patent Pending)

1
Comments
7 min read
Invited talk about: Adversarial Attacks and Defenses in Deep Learning Systems: Threats, Mechanisms, and Countermeasures

Invited talk about: Adversarial Attacks and Defenses in Deep Learning Systems: Threats, Mechanisms, and Countermeasures

Comments
1 min read
Revisiting the Causal Mechanisms Behind Policy Gradients

Revisiting the Causal Mechanisms Behind Policy Gradients

Comments
5 min read
The Pervasive Role and Hidden Limitations of Softmax

The Pervasive Role and Hidden Limitations of Softmax

Comments
6 min read
Mamba-3 and AttnRes: AI Architecture Research Is Finally Building for Inference, Not Just Training

Mamba-3 and AttnRes: AI Architecture Research Is Finally Building for Inference, Not Just Training

Comments
7 min read
The Mathematics That Make 1.58-bit Weights Work: How BitNet b1.58 Survives Its Own Quantization

The Mathematics That Make 1.58-bit Weights Work: How BitNet b1.58 Survives Its Own Quantization

1
Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.