DEV Community

Deep Learning

This tag is for discussing, sharing articles, and asking questions primarily on deep learning - a subfield of machine learning.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

The Hidden Geometry of AI: A Scale-Free Secret to Smarter Networks

Comments
2 min read
My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

My Model Cheated: How Grad-CAM Exposed a 95% Accuracy Lie

Comments
3 min read
I trained a Robot Arm: What I failed to learn.

I trained a Robot Arm: What I failed to learn.

4
Comments 4
3 min read
Open-Weight AI for High-Quality Image Generation & Editing

Open-Weight AI for High-Quality Image Generation & Editing

Comments
4 min read
Tame Your LLMs: A New Optimizer for Robust Deep Learning

Tame Your LLMs: A New Optimizer for Robust Deep Learning

Comments
2 min read
🚀 How I Cut Deep Learning Training Time by 45% — Without Upgrading Hardware

🚀 How I Cut Deep Learning Training Time by 45% — Without Upgrading Hardware

1
Comments
3 min read
Surgical Precision with AI: A New Era in Lung Cancer Staging

Surgical Precision with AI: A New Era in Lung Cancer Staging

Comments
2 min read
Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Anon: The Adaptive Optimizer Bridging SGD and Adam for Peak AI Performance

Comments
2 min read
Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Turbocharge Your LLMs: A Breakthrough in Neural Network Optimization

Comments
2 min read
Introducing PQNT — A New Power-Law Quantization Method

Introducing PQNT — A New Power-Law Quantization Method

Comments
1 min read
Unveiling the Hidden Geometry That Supercharges Neural Nets

Unveiling the Hidden Geometry That Supercharges Neural Nets

Comments
2 min read
How Search Engines Actually Answer Your Questions

How Search Engines Actually Answer Your Questions

Comments
11 min read
Giving AI Eyes: Multi-Modal LLMs

Giving AI Eyes: Multi-Modal LLMs

Comments
9 min read
BATCHNORM IN LANGUAGE MODELS

BATCHNORM IN LANGUAGE MODELS

Comments
16 min read
Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Unlocking Data's Hidden Geometry: A New Era for Neural Networks by Arvind Sundararajan

Comments
2 min read
Cross-Modal Embeddings: Bridging AI Modalities

Cross-Modal Embeddings: Bridging AI Modalities

Comments
11 min read
Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Observations from Finetuning Gemma Model on Strix Halo (Fedora 43)

Comments
3 min read
Stock Price Prediction by ML Models

Stock Price Prediction by ML Models

Comments
1 min read
AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

AI vs ML vs DL vs GenAI: Demystifying the Buzzwords

1
Comments 2
3 min read
Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Fixing Identity Drift in AI Image Generation with a Deterministic Constraint Layer (Minimal PoC Inside)

Comments
2 min read
How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

How I Reached 84.35% on CIFAR-100 Using ResNet-50 (PyTorch Guide)

Comments
2 min read
Attention Mechanism in Transformers: The Core Idea Behind Modern AI

Attention Mechanism in Transformers: The Core Idea Behind Modern AI

5
Comments
2 min read
Star Multi-Class Classification Neural Network With Pytorch

Star Multi-Class Classification Neural Network With Pytorch

Comments
12 min read
A cleaner, safer, plug-and-play NanoGPT

A cleaner, safer, plug-and-play NanoGPT

Comments
1 min read
LANGUAGE MODELS USING MLP (Part 1)

LANGUAGE MODELS USING MLP (Part 1)

Comments
15 min read
loading...