DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Hidden Tax: Why OpenAI Charges Up to 60% More for Spanish Prompts (and How to Fix It)

The Hidden Tax: Why OpenAI Charges Up to 60% More for Spanish Prompts (and How to Fix It)

Comments
7 min read
Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)

Claude system prompt diff: lo que cambió entre Opus 4.6 y 4.7 (y yo lo estaba viendo sin saberlo)

Comments
8 min read
How to Govern Claude Code Usage Across Engineering Teams

How to Govern Claude Code Usage Across Engineering Teams

5
Comments
5 min read
Claude Opus 4.7 Hit 87.6% on SWE-bench. The Story Is What It Didn't Charge You.

Claude Opus 4.7 Hit 87.6% on SWE-bench. The Story Is What It Didn't Charge You.

Comments
7 min read
An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them

An unexplainable thing I saw: the agent didn't just comply with rules — it endorsed them

Comments
26 min read
agent-consistency – a Python consistency layer for multi-agent workflows

agent-consistency – a Python consistency layer for multi-agent workflows

Comments
1 min read
Why RAG Breaks in Real-World Systems (and How I’m Trying to Fix It)

Why RAG Breaks in Real-World Systems (and How I’m Trying to Fix It)

Comments
2 min read
Custom Silicon, Agentic Search, and Smarter Fine-Tuning

Custom Silicon, Agentic Search, and Smarter Fine-Tuning

Comments
2 min read
The open-weight licence trap: Apache 2.0 vs. the community-licence model

The open-weight licence trap: Apache 2.0 vs. the community-licence model

Comments
5 min read
Stop Benchmarking Embedding Models. 90% of Your Search Quality Lives Upstream.

Stop Benchmarking Embedding Models. 90% of Your Search Quality Lives Upstream.

Comments
4 min read
Your AI Agent Just Spent $3,000. Nobody Told It to Stop.

Your AI Agent Just Spent $3,000. Nobody Told It to Stop.

Comments
6 min read
When "Slow Thinking" Is Just "Slow Talking"

When "Slow Thinking" Is Just "Slow Talking"

Comments
3 min read
WeightRoom — an LLM resource calculator

WeightRoom — an LLM resource calculator

1
Comments
3 min read
🛠️ Harness Engineering — Quick Actionable Guide 🤖

🛠️ Harness Engineering — Quick Actionable Guide 🤖

4
Comments
14 min read
From Generic Evals to Specific Monitors: The Annotation Queue Bridge

From Generic Evals to Specific Monitors: The Annotation Queue Bridge

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.