DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Bayesian Neural Networks Under Covariate Shift: When Theory Fails Practice

Bayesian Neural Networks Under Covariate Shift: When Theory Fails Practice

Comments
6 min read
Cross-Modal Embeddings: Bridging AI Modalities

Cross-Modal Embeddings: Bridging AI Modalities

Comments
11 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
Stock Price Prediction by ML Models

Stock Price Prediction by ML Models

Comments
1 min read
AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

1
Comments 2
3 min read
Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Comments
2 min read
The Engineering History of AI: Why Your LLM Hallucinations Are as Old as the 13th Century

The Engineering History of AI: Why Your LLM Hallucinations Are as Old as the 13th Century

1
Comments
5 min read
Transformers Explained

Transformers Explained

6
Comments
1 min read
Attention Mechanism in Transformers: The Core Idea Behind Modern AI

Attention Mechanism in Transformers: The Core Idea Behind Modern AI

5
Comments
2 min read
Star Multi-Class Classification Neural Network With Pytorch

Star Multi-Class Classification Neural Network With Pytorch

Comments
12 min read
A cleaner, safer, plug-and-play NanoGPT

A cleaner, safer, plug-and-play NanoGPT

Comments
1 min read
LANGUAGE MODELS USING MLP (Part 1)

LANGUAGE MODELS USING MLP (Part 1)

Comments
15 min read
Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

5
Comments
5 min read
Risk Assessment in Fake-News Detection Using Advanced NLP and Deep Learning

Risk Assessment in Fake-News Detection Using Advanced NLP and Deep Learning

Comments
32 min read
Decoder-Only Transformers: The Architecture Behind GPT Models

Decoder-Only Transformers: The Architecture Behind GPT Models

Comments
5 min read
Why Studying the Turing Machine Changed How I See AI And Why Every New AI Engineer Should Revisit It

Why Studying the Turing Machine Changed How I See AI And Why Every New AI Engineer Should Revisit It

2
Comments
3 min read
🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

5
Comments
4 min read
Stock Price Prediction

Stock Price Prediction

Comments
1 min read
Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

4
Comments
7 min read
The Evolution of Sequential Learning Models: RNN LSTM Transformers

The Evolution of Sequential Learning Models: RNN LSTM Transformers

Comments
2 min read
Devtool for running and benchmarking local AI

Devtool for running and benchmarking local AI

2
Comments
3 min read
Meet X-AnyLabeling: The Python-native, AI-powered Annotation Tool for Modern CV 🚀

Meet X-AnyLabeling: The Python-native, AI-powered Annotation Tool for Modern CV 🚀

Comments 2
3 min read
BIGRAM LANGUAGE MODELS USING A NEURAL NET

BIGRAM LANGUAGE MODELS USING A NEURAL NET

Comments
14 min read
The Math Behind Machine Learning & Deep Learning (Explained Simply)

The Math Behind Machine Learning & Deep Learning (Explained Simply)

Comments
3 min read
LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

1
Comments
4 min read
loading...