DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
How to Handle LLM API Errors & Rate Limits in Node.js

How to Handle LLM API Errors & Rate Limits in Node.js

Comments
4 min read
I measured the token cost of 13 real AI agents (GitHub's MCP server alone is 3,546 tokens/turn)

I measured the token cost of 13 real AI agents (GitHub's MCP server alone is 3,546 tokens/turn)

Comments 1
2 min read
AIchain Reasoning: One Parameter for Every Provider

AIchain Reasoning: One Parameter for Every Provider

Comments
5 min read
MarginGate: Margin-Gated Verification for Batch-Invariant Decoding

MarginGate: Margin-Gated Verification for Batch-Invariant Decoding

Comments
5 min read
Prompt Engineering Is Systems Design, Not a User Skill

Prompt Engineering Is Systems Design, Not a User Skill

1
Comments 1
5 min read
redb.Route.Llm 3.1.1 — per-message audit fields for LLM compliance / replay

redb.Route.Llm 3.1.1 — per-message audit fields for LLM compliance / replay

Comments
1 min read
Tenure — Building an AI Code Reviewer That Earns Trust Over Time

Tenure — Building an AI Code Reviewer That Earns Trust Over Time

Comments
4 min read
Your Agent's Memory Looks Like It Works. Here Is a One-Minute Test That Tells You If It Actually Does.

Your Agent's Memory Looks Like It Works. Here Is a One-Minute Test That Tells You If It Actually Does.

Comments 6
2 min read
Vector Databases: The Unsung Hero of Large Language Models and Generative AI

Vector Databases: The Unsung Hero of Large Language Models and Generative AI

Comments
7 min read
I Broke a Chatbot With a Prompt Change. Then I Built the Tool That Would've Caught It.

I Broke a Chatbot With a Prompt Change. Then I Built the Tool That Would've Caught It.

Comments
3 min read
Why 95 Reviews Beats 20 Reviews — Even When Both Score 95%

Why 95 Reviews Beats 20 Reviews — Even When Both Score 95%

Comments
6 min read
The hardest part of my AI dating app wasn't the AI — it was making it not sound like AI

The hardest part of my AI dating app wasn't the AI — it was making it not sound like AI

2
Comments
3 min read
Prefix caching at scale: when it saves you 80% of prefill cost, and the eviction policies that quietly turn it into 5%

Prefix caching at scale: when it saves you 80% of prefill cost, and the eviction policies that quietly turn it into 5%

Comments
9 min read
I Reduced My System Prompt Tokens by 70% Using a Custom Prompt DSL

I Reduced My System Prompt Tokens by 70% Using a Custom Prompt DSL

3
Comments
6 min read
MCTS-Reasoning: Tree Search for LLM Reasoning

MCTS-Reasoning: Tree Search for LLM Reasoning

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.