DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Stop Letting AI Agents Break Your Database: Transactional Multi-Agent Workflows with Temporal and Spring AI

Stop Letting AI Agents Break Your Database: Transactional Multi-Agent Workflows with Temporal and Spring AI

Comments
2 min read
Why I Use TRAE: Free LLMs, Stability, and 1M Token Context

Why I Use TRAE: Free LLMs, Stability, and 1M Token Context

Comments
3 min read
Cola DLM — Text Generation That Plans Before It Writes

Cola DLM — Text Generation That Plans Before It Writes

Comments
4 min read
Your "Claude Opus" API Might Not Be Claude Opus

Your "Claude Opus" API Might Not Be Claude Opus

Comments
4 min read
LLM output validation: 5 patterns that actually work in production

LLM output validation: 5 patterns that actually work in production

Comments
6 min read
Fine-tuning vs RAG: a decision framework with examples

Fine-tuning vs RAG: a decision framework with examples

Comments
6 min read
How to Stop Your LLM Agent From Looping Itself Into Oblivion

How to Stop Your LLM Agent From Looping Itself Into Oblivion

Comments
5 min read
Prefix caching in vLLM under multi-tenant agent traffic

Prefix caching in vLLM under multi-tenant agent traffic

Comments 1
4 min read
I used LLMs to rewrite meta descriptions for 1,600 articles — honest results

I used LLMs to rewrite meta descriptions for 1,600 articles — honest results

Comments
5 min read
Using Qwen 3.6 Plus: Great but a Bit Expensive

Using Qwen 3.6 Plus: Great but a Bit Expensive

Comments
2 min read
Our AI Inference Bill Dropped 65% After We Stopped Treating Every Query the Same

Our AI Inference Bill Dropped 65% After We Stopped Treating Every Query the Same

Comments
5 min read
What "Subquadratic Attention" Actually Means

What "Subquadratic Attention" Actually Means

Comments
4 min read
Qwen 3.6 & llama.cpp Push Local Inference Limits on Consumer GPUs

Qwen 3.6 & llama.cpp Push Local Inference Limits on Consumer GPUs

Comments
3 min read
AI Weekly — 2026-05-15 to 2026-05-22 | The Agentic Inflection Is Real, But the Enterprise Gap Is Wider Than Ever

AI Weekly — 2026-05-15 to 2026-05-22 | The Agentic Inflection Is Real, But the Enterprise Gap Is Wider Than Ever

Comments
4 min read
I tested cheap vs expensive LLMs across 3 real agent tasks. The cheap model won every time.

I tested cheap vs expensive LLMs across 3 real agent tasks. The cheap model won every time.

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.