DEV Community

Machine Learning

A branch of artificial intelligence (AI) and computer science which focuses on the use of data and algorithms to imitate the way that humans learn, gradually improving its accuracy.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
OpenAI Killed Sora in Six Months. It Burned $15 Million a Day and Made Almost Nothing.

OpenAI Killed Sora in Six Months. It Burned $15 Million a Day and Made Almost Nothing.

Comments
4 min read
Como comprimir o KV cache do seu LLM em 33x sem treino

Como comprimir o KV cache do seu LLM em 33x sem treino

Comments
3 min read
KV cache memory calculator: how much does your LLM actually use?

KV cache memory calculator: how much does your LLM actually use?

Comments
3 min read
How to benchmark NexusQuant on your own model

How to benchmark NexusQuant on your own model

Comments
3 min read
How Much GPU Memory Does NexusQuant Actually Save?

How Much GPU Memory Does NexusQuant Actually Save?

Comments
4 min read
How to Spot a "Lemon": The Intuitive Logic Behind Decision Trees

How to Spot a "Lemon": The Intuitive Logic Behind Decision Trees

Comments
2 min read
Image Prompt Packaging Cuts Multimodal Inference Costs Up to 91%

Image Prompt Packaging Cuts Multimodal Inference Costs Up to 91%

Comments
6 min read
Why E8 lattice quantization beats scalar quantization for KV caches

Why E8 lattice quantization beats scalar quantization for KV caches

Comments
2 min read
The Circuit That Knows Itself

The Circuit That Knows Itself

Comments
6 min read
VRAM Is the New RAM — A Practical Guide to Running Large Language Models on Consumer GPUs

VRAM Is the New RAM — A Practical Guide to Running Large Language Models on Consumer GPUs

Comments
5 min read
Balancing Theory and Practice: Addressing the Shift in Machine Learning Research Focus

Balancing Theory and Practice: Addressing the Shift in Machine Learning Research Focus

Comments
18 min read
ML-based LLM request classifier for cost-optimized routing (~2ms inference)

ML-based LLM request classifier for cost-optimized routing (~2ms inference)

Comments
1 min read
Weaviate — Deep Dive

Weaviate — Deep Dive

Comments
14 min read
"How We Run AI Inference on $0/month (And Still Ship Fast)"

"How We Run AI Inference on $0/month (And Still Ship Fast)"

Comments
2 min read
I Built Free GenAI & ML Notes for Beginners (Hinglish +English+ Practical)

I Built Free GenAI & ML Notes for Beginners (Hinglish +English+ Practical)

Comments
1 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.