DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Real LLM Drift Detection Results: Exact Outputs, Real Scores, No Fabrication

Real LLM Drift Detection Results: Exact Outputs, Real Scores, No Fabrication

Comments
3 min read
I ran 5 social engineering attacks on AI. The failure modes are human.

I ran 5 social engineering attacks on AI. The failure modes are human.

1
Comments
2 min read
Self-Hosted Email Threat Detection: Real-Time Monitoring, Multi-Stage Enrichment, and LLM Verdicts with Legal Compliance

Self-Hosted Email Threat Detection: Real-Time Monitoring, Multi-Stage Enrichment, and LLM Verdicts with Legal Compliance

1
Comments
15 min read
Anthropic Silently Dropped Prompt Cache TTL from 1 Hour to 5 Minutes

Anthropic Silently Dropped Prompt Cache TTL from 1 Hour to 5 Minutes

Comments
3 min read
Addressing LLM Benchmarking Obsolescence: Strategies for Timely and Relevant Model Evaluation

Addressing LLM Benchmarking Obsolescence: Strategies for Timely and Relevant Model Evaluation

1
Comments
13 min read
The "Always" Trap: Why Your AI Ignores Nuance (And How to Fix It)

The "Always" Trap: Why Your AI Ignores Nuance (And How to Fix It)

1
Comments
7 min read
AI개인화_블로그글

AI개인화_블로그글

Comments
3 min read
I Built an LLM Gateway That Learns Which Model to Use — Here's How the Routing Works

I Built an LLM Gateway That Learns Which Model to Use — Here's How the Routing Works

1
Comments
5 min read
GPT-5.4 Is Here, Cursor Just Got Agentic, and Open-Source LLMs Are Winning — Here's What's Happening in AI Right Now

GPT-5.4 Is Here, Cursor Just Got Agentic, and Open-Source LLMs Are Winning — Here's What's Happening in AI Right Now

Comments
3 min read
I shipped an LLM feature, got 11 users, then the model silently changed on me. Here's what I built to stop it happening again.

I shipped an LLM feature, got 11 users, then the model silently changed on me. Here's what I built to stop it happening again.

Comments
3 min read
Disassembling AI Agents - Part 2: Claude Code

Disassembling AI Agents - Part 2: Claude Code

Comments
15 min read
🏗️ 📐 Harness Engineering: The Emerging Discipline of Making AI Agents Reliable 🤖

🏗️ 📐 Harness Engineering: The Emerging Discipline of Making AI Agents Reliable 🤖

10
Comments 2
20 min read
LLM vs RAG

LLM vs RAG

1
Comments 1
1 min read
Is GitHub Copilot open source or proprietary?

Is GitHub Copilot open source or proprietary?

1
Comments
7 min read
Gemini 1.5 Pro Also Drifts: Known Regression Patterns and How to Monitor Them

Gemini 1.5 Pro Also Drifts: Known Regression Patterns and How to Monitor Them

1
Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.