DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Comments
5 min read
How to Add Caching to Any AutoGen Workflow in 2 Lines

How to Add Caching to Any AutoGen Workflow in 2 Lines

Comments
2 min read
What Building a Multi-Model AI Gateway Taught Me About Reliability

What Building a Multi-Model AI Gateway Taught Me About Reliability

Comments 1
8 min read
HIPAA Compliance, AI Abuse, and Vaccine Advances

HIPAA Compliance, AI Abuse, and Vaccine Advances

Comments
2 min read
LLM Smells: The Tells in AI Writing, and the Costlier Ones in AI Code

LLM Smells: The Tells in AI Writing, and the Costlier Ones in AI Code

Comments
5 min read
Taxonomy Surgery, Cosine = 1.0000, and Making Routing Disappear into Infrastructure

Taxonomy Surgery, Cosine = 1.0000, and Making Routing Disappear into Infrastructure

3
Comments
5 min read
Cut 70%+ LLM API Expense with Qwen-Turbo & DeepSeek: Real Pricing & Optimization Case

Cut 70%+ LLM API Expense with Qwen-Turbo & DeepSeek: Real Pricing & Optimization Case

Comments
2 min read
I Fuzzed 12 LLMs With 19 Payloads — Here What Broke

I Fuzzed 12 LLMs With 19 Payloads — Here What Broke

Comments
2 min read
Why Coding Stays in Human-AI Collaboration: A Paradox in Stanford's 51 Deployments

Why Coding Stays in Human-AI Collaboration: A Paradox in Stanford's 51 Deployments

2
Comments 1
14 min read
Stop hand-coding the Japanese Rokuyo calendar: LLM-generated lunar logic silently breaks

Stop hand-coding the Japanese Rokuyo calendar: LLM-generated lunar logic silently breaks

Comments
6 min read
SOC-in-a-Box: One LLM, Eight Hats, A Production-Bar AI SOC on a Single GPU

SOC-in-a-Box: One LLM, Eight Hats, A Production-Bar AI SOC on a Single GPU

Comments
11 min read
Why Self-Hosted Claude Code Was 15x Slower Than It Should Be

Why Self-Hosted Claude Code Was 15x Slower Than It Should Be

Comments
10 min read
Teaching a Reranker the Language of Security Tickets (+41% MRR@10)

Teaching a Reranker the Language of Security Tickets (+41% MRR@10)

Comments
9 min read
Three Chat Template Patterns That Silently Kill Your Prompt Cache

Three Chat Template Patterns That Silently Kill Your Prompt Cache

Comments
7 min read
AI vs Human: An Honest Scorecard

AI vs Human: An Honest Scorecard

6
Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.