DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Per-customer cost attribution without a proxy

Per-customer cost attribution without a proxy

Comments
3 min read
Anthropic Just Did Something Unprecedented: They Kept a Model Because It Was Too Good at Hacking

Anthropic Just Did Something Unprecedented: They Kept a Model Because It Was Too Good at Hacking

Comments
3 min read
Agentic code review in production: orchestration, evaluation, and the cost of being wrong

Agentic code review in production: orchestration, evaluation, and the cost of being wrong

6
Comments 5
4 min read
The Chinese Open-Source Model That Draws Pelicans Better Than GPT-4o

The Chinese Open-Source Model That Draws Pelicans Better Than GPT-4o

Comments
2 min read
GLM-5.1: The 754B Open Model That Writes Animated SVG

GLM-5.1: The 754B Open Model That Writes Animated SVG

Comments
1 min read
LLM-as-Judge: using Claude to review a Gemini agent

LLM-as-Judge: using Claude to review a Gemini agent

Comments
7 min read
Six Principles for Agent Systems That Don't Hallucinate

Six Principles for Agent Systems That Don't Hallucinate

2
Comments 6
11 min read
TurboQuant: How a Simple Spin Saves Gigabytes of GPU Memory

TurboQuant: How a Simple Spin Saves Gigabytes of GPU Memory

Comments
6 min read
99.8% of LLM Inference Power Isn't Spent on Computation

99.8% of LLM Inference Power Isn't Spent on Computation

Comments
7 min read
Stop Paying Frontier Prices for Tasks a Local Model Handles Fine

Stop Paying Frontier Prices for Tasks a Local Model Handles Fine

Comments
3 min read
I shipped 14 MCP servers this week. Gemma 4 changes which ones matter.

Gemma 4 Challenge: Write about Gemma 4 Submission

I shipped 14 MCP servers this week. Gemma 4 changes which ones matter.

3
Comments 2
6 min read
Claude Found Eleven Medical Errors in One Family's Records

Claude Found Eleven Medical Errors in One Family's Records

Comments
9 min read
When Your AI Wiki Outgrows the Context Window — A Practical Guide to RAG

When Your AI Wiki Outgrows the Context Window — A Practical Guide to RAG

Comments
6 min read
Building a Voice-Controlled Local AI Agent on a 4GB GPU

Building a Voice-Controlled Local AI Agent on a 4GB GPU

Comments
3 min read
Authenticated, Authorized, and Still Unsafe: The Missing Layer in Agent Security

Authenticated, Authorized, and Still Unsafe: The Missing Layer in Agent Security

Comments
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.