DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Context Engineering Is the Skill That Actually Ships Reliable AI Agents

Context Engineering Is the Skill That Actually Ships Reliable AI Agents

Comments
6 min read
How I Cut Agent Token Usage by 89% Without Touching the Agent

How I Cut Agent Token Usage by 89% Without Touching the Agent

Comments
4 min read
Gemma 4 makes on-device multimodal AI good enough to ship

Gemma 4 makes on-device multimodal AI good enough to ship

Comments
4 min read
How I added real-time Slack alerts to an open-source LLM gateway in one day

How I added real-time Slack alerts to an open-source LLM gateway in one day

Comments
1 min read
Long-Term Memory for LLM Agents That Works

Long-Term Memory for LLM Agents That Works

1
Comments
6 min read
Claude Fable 5 Is Here. Here's What Actually Matters for Developers 👨🏾‍💻

Claude Fable 5 Is Here. Here's What Actually Matters for Developers 👨🏾‍💻

1
Comments
2 min read
AI code you can't reuse isn't worth the token bill or the risk

AI code you can't reuse isn't worth the token bill or the risk

Comments
5 min read
Voice Sessions Now Know Who They Are. Here's What Changed.

Voice Sessions Now Know Who They Are. Here's What Changed.

1
Comments
2 min read
Why Integrating Multiple LLM Providers Gets Messy Fast

Why Integrating Multiple LLM Providers Gets Messy Fast

Comments
4 min read
Token Budgets Paper: Affine-Typed Budget Ownership

Token Budgets Paper: Affine-Typed Budget Ownership

Comments
6 min read
Subjectivation: A protocol to give LLMs a functional, responsible self

Subjectivation: A protocol to give LLMs a functional, responsible self

Comments
3 min read
A voice agent is not a chatbot with a phone number

A voice agent is not a chatbot with a phone number

2
Comments 1
9 min read
The MCP SDK's EventStore Lives in Memory. Here's What Happens When Your Server Restarts.

The MCP SDK's EventStore Lives in Memory. Here's What Happens When Your Server Restarts.

1
Comments
4 min read
GPT-3.5-Turbo drops from 90% accuracy to 50% when the answer sits in the middle of a 20k-token prompt instead of the sta

GPT-3.5-Turbo drops from 90% accuracy to 50% when the answer sits in the middle of a 20k-token prompt instead of the sta

Comments
1 min read
Sample Your LLM 5 Times and Take a Majority Vote — Accuracy Jumps 35 Points

Sample Your LLM 5 Times and Take a Majority Vote — Accuracy Jumps 35 Points

1
Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.