DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Prompts Aren't Enough: Enforcing Hard Constraints on LLM Output

Prompts Aren't Enough: Enforcing Hard Constraints on LLM Output

1
Comments 1
4 min read
I put 6 LLM guardrail tools inline and measured what they cost me. Here is the latency-vs-recall tradeoff.

I put 6 LLM guardrail tools inline and measured what they cost me. Here is the latency-vs-recall tradeoff.

1
Comments
3 min read
LLM Prompt Injection & Guardrail Security

LLM Prompt Injection & Guardrail Security

Comments 1
5 min read
What is your cutoff for killing a bad Codex run?

What is your cutoff for killing a bad Codex run?

Comments
1 min read
Tokens, Context, and Why Small AI Tasks Aren't Cheap

Tokens, Context, and Why Small AI Tasks Aren't Cheap

Comments 1
4 min read
Nvidia H100 and GPU Pricing 2026: Buy, Rent, and Cloud Costs Explained

Nvidia H100 and GPU Pricing 2026: Buy, Rent, and Cloud Costs Explained

1
Comments
5 min read
We stopped writing eval cases by hand. Now every prod incident becomes one.

We stopped writing eval cases by hand. Now every prod incident becomes one.

Comments
2 min read
Clioloop — The Open-Source AI Agent with Agentic Fusion

Clioloop — The Open-Source AI Agent with Agentic Fusion

2
Comments
2 min read
PII Redaction Built Entirely in the Browser

PII Redaction Built Entirely in the Browser

3
Comments 4
2 min read
AI Evals, Part 5: From a Number to a Gate Evals in CI and Production

AI Evals, Part 5: From a Number to a Gate Evals in CI and Production

Comments
4 min read
AI Evals, Part 4: LLM-as-Judge, Done Right

AI Evals, Part 4: LLM-as-Judge, Done Right

Comments
5 min read
Keeping a client's VLM inference inside the EU with a self-hosted-first gateway

Keeping a client's VLM inference inside the EU with a self-hosted-first gateway

Comments
4 min read
LLM Self-Preference Bias: How Anonymized Peer Review Fixes It

LLM Self-Preference Bias: How Anonymized Peer Review Fixes It

Comments
7 min read
Why Coding Agents Need Two Halves of Infrastructure: Control Plane + Fast Data Plane

Why Coding Agents Need Two Halves of Infrastructure: Control Plane + Fast Data Plane

Comments
7 min read
The latency tax of an LLM gateway: I measured Bifrost's overhead

The latency tax of an LLM gateway: I measured Bifrost's overhead

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.