DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
An LLM benchmark is only useful for as long as it's hard

An LLM benchmark is only useful for as long as it's hard

2
Comments
10 min read
Claude Fable 5: Caching, Tokenizer & Cost vs Opus 4.6

Claude Fable 5: Caching, Tokenizer & Cost vs Opus 4.6

Comments
7 min read
How to Build an AI Coding Stack Without Going Broke in 2026

How to Build an AI Coding Stack Without Going Broke in 2026

Comments
5 min read
RAG Rerank: the Highest-Leverage Upgrade to Your Retrieval Pipeline

RAG Rerank: the Highest-Leverage Upgrade to Your Retrieval Pipeline

2
Comments
3 min read
Repair Agents, Memory OS, Interview Copilot, Alignment Insights, Multimodal Flow, and CVS AI Academy

Repair Agents, Memory OS, Interview Copilot, Alignment Insights, Multimodal Flow, and CVS AI Academy

Comments
2 min read
Echo: results so far

Echo: results so far

2
Comments
6 min read
The "open-source NotebookLM" lie — and the one repo that actually earns the label

The "open-source NotebookLM" lie — and the one repo that actually earns the label

Comments
4 min read
Stop Syncing Elasticsearch: Native Hybrid Search with Spring AI and Pgvector sparsevec

Stop Syncing Elasticsearch: Native Hybrid Search with Spring AI and Pgvector sparsevec

Comments
2 min read
We Gave AI the Keys. Nobody Asked If It Knows How to Drive.

We Gave AI the Keys. Nobody Asked If It Knows How to Drive.

Comments
4 min read
Gemma 4 QAT on a 1080 Ti: What 'Quantization-Aware' Actually Buys — and Fitting the 12B on 8 GB at 16k

Gemma 4 QAT on a 1080 Ti: What 'Quantization-Aware' Actually Buys — and Fitting the 12B on 8 GB at 16k

Comments
5 min read
RAG vs Fine-Tuning: Which Approach Should You Choose?

RAG vs Fine-Tuning: Which Approach Should You Choose?

Comments
3 min read
Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Quantization formats compared: GGUF vs GPTQ vs AWQ vs NF4

Comments
7 min read
The 'Own Hardware for AI' Myth

The 'Own Hardware for AI' Myth

Comments
11 min read
What Are Tokens and Why Do They Matter in LLMs?

What Are Tokens and Why Do They Matter in LLMs?

2
Comments
3 min read
エージェントAPI代、月数万円になってない?マルチモデルルーティングでコストを10分の1にする実践ガイド

エージェントAPI代、月数万円になってない?マルチモデルルーティングでコストを10分の1にする実践ガイド

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.