DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
I tested cheap vs expensive LLMs across 3 real agent tasks. The cheap model won every time.

I tested cheap vs expensive LLMs across 3 real agent tasks. The cheap model won every time.

Comments
4 min read
Measuring AI Gateway Failover: 30 Days of Production Data

Measuring AI Gateway Failover: 30 Days of Production Data

Comments
3 min read
Agent Series (5): Intent Recognition and Routing — Making Agents Actually Understand Users

Agent Series (5): Intent Recognition and Routing — Making Agents Actually Understand Users

1
Comments
11 min read
Routing diffusion inference traffic across three providers

Routing diffusion inference traffic across three providers

Comments
4 min read
ToolRouter: Switch AI Coding Tools Freely Without Losing Context

ToolRouter: Switch AI Coding Tools Freely Without Losing Context

2
Comments
6 min read
Beyond the Stateless Prompt: Building an Auditable Product Intelligence Pipeline with Cascadeflow and Hindsight

Beyond the Stateless Prompt: Building an Auditable Product Intelligence Pipeline with Cascadeflow and Hindsight

Comments
5 min read
Putting an LLM Gateway in Front of Our Build Agents

Putting an LLM Gateway in Front of Our Build Agents

Comments
4 min read
You Probably Don't Need 8-Bit Quantization

You Probably Don't Need 8-Bit Quantization

Comments
2 min read
The README Was a Protocol. The Entrypoint Was Still Optional.

The README Was a Protocol. The Entrypoint Was Still Optional.

Comments
8 min read
How to Build a Local LLM Agent to Automate Work List Generation from Monthly Reports (With Jira Integration)

How to Build a Local LLM Agent to Automate Work List Generation from Monthly Reports (With Jira Integration)

1
Comments
8 min read
End-to-End Observability for vLLM and TGI: from DCGM to Tokens

End-to-End Observability for vLLM and TGI: from DCGM to Tokens

Comments
13 min read
AI 週報 — 2026-05-15 to 2026-05-22 | 當 IPO 傳聞撞上 27 萬人部署規模

AI 週報 — 2026-05-15 to 2026-05-22 | 當 IPO 傳聞撞上 27 萬人部署規模

Comments
3 min read
Your No-Code AI Agent Has a Memory Problem

Your No-Code AI Agent Has a Memory Problem

1
Comments
2 min read
RAG Series (24): Code RAG — Teaching AI to Understand Your Codebase

RAG Series (24): Code RAG — Teaching AI to Understand Your Codebase

Comments
7 min read
We Connected an LLM to a 12-Year-Old Codebase. Here's What Broke.

We Connected an LLM to a 12-Year-Old Codebase. Here's What Broke.

Comments
5 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.