DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Cross-Modal Embeddings: Bridging AI Modalities

Cross-Modal Embeddings: Bridging AI Modalities

Comments
11 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
Stock Price Prediction by ML Models

Stock Price Prediction by ML Models

Comments
1 min read
Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Comments
2 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments
2 min read
Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

Inside ChatGPT: Deconstructing "Attention Is All You Need" (Part 1)

5
Comments
5 min read
Attention Mechanism in Transformers: The Core Idea Behind Modern AI

Attention Mechanism in Transformers: The Core Idea Behind Modern AI

5
Comments
2 min read
Transformers and Attention: How LLMs Actually Process Text

Transformers and Attention: How LLMs Actually Process Text

4
Comments
18 min read
Star Multi-Class Classification Neural Network With Pytorch

Star Multi-Class Classification Neural Network With Pytorch

Comments
12 min read
A cleaner, safer, plug-and-play NanoGPT

A cleaner, safer, plug-and-play NanoGPT

Comments
1 min read
LANGUAGE MODELS USING MLP (Part 1)

LANGUAGE MODELS USING MLP (Part 1)

Comments
15 min read
Decoder-Only Transformers: The Architecture Behind GPT Models

Decoder-Only Transformers: The Architecture Behind GPT Models

Comments
5 min read
Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Getting Started with Azure: Create and Configure a Windows 10 Virtual Machine

Comments
4 min read
🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

🧠💥 “Linear Algebra Ruined My Life (and Made Me Better at AI)”

5
Comments
4 min read
Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

Setting Up NVIDIA Parakeet TDT 0.6B v3 for Speech Recognition on AWS EC2 Ubuntu

Comments
8 min read
Stock Price Prediction

Stock Price Prediction

Comments
1 min read
Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

Building an Enhanced PPO Trading Bot with Real-Time Data Sync and IBKR Integration

4
Comments
7 min read
The Evolution of Sequential Learning Models: RNN LSTM Transformers

The Evolution of Sequential Learning Models: RNN LSTM Transformers

Comments
2 min read
Devtool for running and benchmarking local AI

Devtool for running and benchmarking local AI

2
Comments
3 min read
Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

Speculative Decoding: Making LLMs Faster Without Sacrificing Quality

1
Comments
14 min read
BIGRAM LANGUAGE MODELS USING A NEURAL NET

BIGRAM LANGUAGE MODELS USING A NEURAL NET

Comments
14 min read
DragonMemory: Neural Sequence Compression for Production RAG

DragonMemory: Neural Sequence Compression for Production RAG

2
Comments
8 min read
LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

LLM Concepts (Explained Without Making Your Brain Hurt): What Every Developer Should Know

1
Comments
4 min read
Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model

Qwen Image Models Training - 0 to Hero Level Tutorial - LoRA & Fine Tuning - Base & Edit Model

2
Comments
7 min read
Qwen Image Base Model Training vs FLUX SRPO Training 20 images comparison

Qwen Image Base Model Training vs FLUX SRPO Training 20 images comparison

6
Comments
2 min read
loading...