DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Vision Models for OCR: When They Beat Tesseract and When They Don't

Vision Models for OCR: When They Beat Tesseract and When They Don't

Comments
7 min read
How Should We Evaluate AI Coding Tools in Real Engineering Environments

How Should We Evaluate AI Coding Tools in Real Engineering Environments

Comments
4 min read
The LLM-shaped hole in your XGBoost pipeline

The LLM-shaped hole in your XGBoost pipeline

Comments
1 min read
How I cut my multi-turn LLM API costs by 90% (O(N ) O(N))

How I cut my multi-turn LLM API costs by 90% (O(N ) O(N))

Comments
2 min read
Six Principles in Practice: How an Agentic E2E Found 11 Production Bugs in 8 Runs

Six Principles in Practice: How an Agentic E2E Found 11 Production Bugs in 8 Runs

Comments
13 min read
Chunking in RAG: why your splitter matters more than your embedding model

Chunking in RAG: why your splitter matters more than your embedding model

2
Comments
5 min read
What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

What MCP Really Is — A Demo You Can Run on Your Laptop in 5 Minutes

Comments
10 min read
Why your quantized LLM loses its MTP heads and how to keep them

Why your quantized LLM loses its MTP heads and how to keep them

1
Comments
5 min read
About Sharing Local Inference: A Marketplace for Renting Idle GPUs with an OpenAI-Compatible Backend

About Sharing Local Inference: A Marketplace for Renting Idle GPUs with an OpenAI-Compatible Backend

Comments
8 min read
Cut Your AI Agent Token Costs by 75% With One Skill Plugin

Cut Your AI Agent Token Costs by 75% With One Skill Plugin

Comments
2 min read
Why most LLM API usage is quietly inefficient

Why most LLM API usage is quietly inefficient

Comments
4 min read
Qwen sky proof: compressed memory made a tiny model behave better — with the receipts

Qwen sky proof: compressed memory made a tiny model behave better — with the receipts

Comments
1 min read
The 8B Model That Punches at 32B Weight

The 8B Model That Punches at 32B Weight

Comments
2 min read
Hermes Agent CLI cheat sheet — commands, flags, and slash shortcuts

Hermes Agent CLI cheat sheet — commands, flags, and slash shortcuts

1
Comments
8 min read
The "Chat" API is a Token Tax: Why we must return to Stateless Completions

The "Chat" API is a Token Tax: Why we must return to Stateless Completions

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.