DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
DiffusionGemma: How Google's New Open LLM Hits 1,000 Tokens/sec and Changes Inference Economics

DiffusionGemma: How Google's New Open LLM Hits 1,000 Tokens/sec and Changes Inference Economics

5
Comments
4 min read
Email Triage Taxonomies for LLM Classification

Email Triage Taxonomies for LLM Classification

2
Comments
5 min read
I stopped trusting “same answers, fewer tokens” after watching an agent lose 1 field name and burn 3 hours

I stopped trusting “same answers, fewer tokens” after watching an agent lose 1 field name and burn 3 hours

Comments
7 min read
How to Build Production-Ready Generative AI Development Services for Enterprise Applications

How to Build Production-Ready Generative AI Development Services for Enterprise Applications

Comments
4 min read
A Frontier Model Goes Dark: AI Week of June 16, 2026

A Frontier Model Goes Dark: AI Week of June 16, 2026

Comments
23 min read
LLM cost reduction techniques ranked by ROI: the 5 that matter, the 9 that don't (much)

LLM cost reduction techniques ranked by ROI: the 5 that matter, the 9 that don't (much)

Comments
12 min read
Stop explaining yourself to Claude

Stop explaining yourself to Claude

Comments
3 min read
Multi-Turn Email Conversations for LLM Agents

Multi-Turn Email Conversations for LLM Agents

1
Comments
4 min read
The best bug reports were written by the suspect

The best bug reports were written by the suspect

1
Comments 1
1 min read
Shipping an LLM-powered "lie detector" in a Flutter dating app

Shipping an LLM-powered "lie detector" in a Flutter dating app

Comments
3 min read
The HTTP Code Your AI Agent Doesn't Handle Yet: 402

The HTTP Code Your AI Agent Doesn't Handle Yet: 402

2
Comments 1
12 min read
From Chatbot to Mailbox: Persistent Agent Memory in Threads

From Chatbot to Mailbox: Persistent Agent Memory in Threads

1
Comments
5 min read
What is a Mobile AI Agent? The Architecture, Limits, and Hardware Problem (2026)

What is a Mobile AI Agent? The Architecture, Limits, and Hardware Problem (2026)

Comments
4 min read
Sampling strategies compared: temperature, top-p, top-k, min-p, and what actually works in production

Sampling strategies compared: temperature, top-p, top-k, min-p, and what actually works in production

Comments
9 min read
Why your AI Agent needs a sandbox, not a blank check 🛡️

Why your AI Agent needs a sandbox, not a blank check 🛡️

Comments
1 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.