DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
China AI Roundup: May 2026 – DeepSeek Cuts Prices, Qwen 3.7 Launches, Embodied AI Explodes

China AI Roundup: May 2026 – DeepSeek Cuts Prices, Qwen 3.7 Launches, Embodied AI Explodes

Comments
5 min read
I gave up on making my AI builder write good media queries

I gave up on making my AI builder write good media queries

Comments
5 min read
Fine-Tuning Qwen2.5-0.5B to Write SRE Post-Mortem Summaries

Fine-Tuning Qwen2.5-0.5B to Write SRE Post-Mortem Summaries

3
Comments
4 min read
Qualix: semantic coverage gates for AI-generated code

Qualix: semantic coverage gates for AI-generated code

Comments
3 min read
How to Integrate AI and LLMs into Production Web Apps (Lessons from the Field)

How to Integrate AI and LLMs into Production Web Apps (Lessons from the Field)

Comments
5 min read
Your AI Has Two Brains: Fast Pattern Mode and the A11 Deep Reasoning Engine

Your AI Has Two Brains: Fast Pattern Mode and the A11 Deep Reasoning Engine

1
Comments
3 min read
llms.txt and GEO in 2026: How to Get Your Site Cited by AI Search

llms.txt and GEO in 2026: How to Get Your Site Cited by AI Search

Comments 1
9 min read
We Measured LLM Prompt Caching in Production — Same Prompt, 0% to 91% Hit Rates

We Measured LLM Prompt Caching in Production — Same Prompt, 0% to 91% Hit Rates

Comments
5 min read
Claude Opus 4.8: What Developers Need to Know About Anthropic's New Flagship

Claude Opus 4.8: What Developers Need to Know About Anthropic's New Flagship

1
Comments 1
3 min read
/align v0.8 — personal evals for Claude Code, maintained by an LLM agent

/align v0.8 — personal evals for Claude Code, maintained by an LLM agent

Comments 1
4 min read
OWASP LLM Top 10 in Production: How I Audited My TypeScript Agent Pipeline Against All 10 Risks — and What I Found

OWASP LLM Top 10 in Production: How I Audited My TypeScript Agent Pipeline Against All 10 Risks — and What I Found

1
Comments
9 min read
Nobody on the internet knows if you are a human

Nobody on the internet knows if you are a human

Comments 1
4 min read
OpenAI Responses API vs Custom RAG: Cost, Latency and Control in 2026

OpenAI Responses API vs Custom RAG: Cost, Latency and Control in 2026

Comments
4 min read
Tracking token usage across OpenAI, Anthropic, and Gemini: every streaming gotcha I hit

Tracking token usage across OpenAI, Anthropic, and Gemini: every streaming gotcha I hit

Comments 1
5 min read
Escalate the Model, Not the Conversation

Escalate the Model, Not the Conversation

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.