DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Xiaomi's MiMo Code gets better as tasks get harder. Here's how.

Xiaomi's MiMo Code gets better as tasks get harder. Here's how.

Comments
3 min read
Your AI Agent Will Double-Charge on a Lost Response

Your AI Agent Will Double-Charge on a Lost Response

Comments
14 min read
What Happens When Your AI Agent Lies (And How to Stop It)

What Happens When Your AI Agent Lies (And How to Stop It)

1
Comments
4 min read
The 5.5% Tax of OpenRouter — and Why I Built an Alternative

The 5.5% Tax of OpenRouter — and Why I Built an Alternative

Comments
7 min read
NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking

NVIDIA Blackwell Leads AgentPerf, the First Agentic-AI Infra Benchmark: Trajectory-Replay Benchmarking

Comments
6 min read
A 9-point eval gain vanished when we deduped train against test

A 9-point eval gain vanished when we deduped train against test

Comments
4 min read
I built an AI tool that saves 80% LLM costs & catches hallucinations (Open Source)

I built an AI tool that saves 80% LLM costs & catches hallucinations (Open Source)

Comments
2 min read
Emergence AI 的疯狂实验——涌现世界

Emergence AI 的疯狂实验——涌现世界

Comments
2 min read
AI Agents Explained: the Thought-Action-Observation Loop

AI Agents Explained: the Thought-Action-Observation Loop

1
Comments 1
1 min read
How We Built an AI CRM That Actually Remembers Customers Using Hindsight

How We Built an AI CRM That Actually Remembers Customers Using Hindsight

Comments
3 min read
Email Tools for Claude: Tool Use With an Agent Mailbox

Email Tools for Claude: Tool Use With an Agent Mailbox

Comments 1
5 min read
Three AI providers went down on the same day. Here's the architecture that didn't care.

Three AI providers went down on the same day. Here's the architecture that didn't care.

Comments
5 min read
How to Fine-Tune LLMs on Your Own Data: Open-Source Models, RL Environments, and Evals

How to Fine-Tune LLMs on Your Own Data: Open-Source Models, RL Environments, and Evals

Comments
7 min read
I tracked every GitHub traffic spike for my open source LLM proxy for 7 weeks. Then I did the exact same thing again, and it worked again.

I tracked every GitHub traffic spike for my open source LLM proxy for 7 weeks. Then I did the exact same thing again, and it worked again.

Comments
5 min read
caveman 真的能幫我省下 Token 帳單嗎?

caveman 真的能幫我省下 Token 帳單嗎?

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.