DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
✨📊 🧠 The Ultimate Visual Guide to Large Language Models (LLMs)

✨📊 🧠 The Ultimate Visual Guide to Large Language Models (LLMs)

5
Comments
4 min read
I Added a Live Dashboard to My LLM Proxy. Zero Instrumentation. Just a URL Change.

I Added a Live Dashboard to My LLM Proxy. Zero Instrumentation. Just a URL Change.

Comments
3 min read
Your AI Has Two Brains: Fast Pattern Mode and the A11 Deep Reasoning Engine

Your AI Has Two Brains: Fast Pattern Mode and the A11 Deep Reasoning Engine

1
Comments
3 min read
Benchmarking the Claude Agent SDK on a local LLM: Haiku and Sonnet tier performance

Benchmarking the Claude Agent SDK on a local LLM: Haiku and Sonnet tier performance

Comments
6 min read
We Measured LLM Prompt Caching in Production — Same Prompt, 0% to 91% Hit Rates

We Measured LLM Prompt Caching in Production — Same Prompt, 0% to 91% Hit Rates

Comments
5 min read
Why DDR5 Bandwidth Kills Dual-LLM Inference on APUs (Benchmarks Inside)

Why DDR5 Bandwidth Kills Dual-LLM Inference on APUs (Benchmarks Inside)

Comments
7 min read
The AI Labs Found Product-Market Fit in April

The AI Labs Found Product-Market Fit in April

Comments
3 min read
The Reason Your AI Chatbot Feels Fast Has Nothing to Do With a Better Model

The Reason Your AI Chatbot Feels Fast Has Nothing to Do With a Better Model

Comments
6 min read
Demystifying the AI Wave: A Backend Engineer's Guide to LLMs, RAG, and Agents

Demystifying the AI Wave: A Backend Engineer's Guide to LLMs, RAG, and Agents

Comments
9 min read
Claude Opus 4.8: What Developers Need to Know About Anthropic's New Flagship

Claude Opus 4.8: What Developers Need to Know About Anthropic's New Flagship

1
Comments 1
3 min read
/align v0.8 — personal evals for Claude Code, maintained by an LLM agent

/align v0.8 — personal evals for Claude Code, maintained by an LLM agent

Comments 1
4 min read
Nobody on the internet knows if you are a human

Nobody on the internet knows if you are a human

Comments 1
4 min read
OpenAI Responses API vs Custom RAG: Cost, Latency and Control in 2026

OpenAI Responses API vs Custom RAG: Cost, Latency and Control in 2026

Comments
4 min read
How to Stop Your AI Agent Before It Does Something You Can't Undo

How to Stop Your AI Agent Before It Does Something You Can't Undo

Comments
4 min read
Harness Engineering for AI Agents

Harness Engineering for AI Agents

3
Comments 1
15 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.