DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Speculative decoding shifted our output distribution and evals missed it

Speculative decoding shifted our output distribution and evals missed it

1
Comments 1
4 min read
How to Extract Business Rules from Legacy COBOL Code

How to Extract Business Rules from Legacy COBOL Code

Comments
8 min read
How I Reduced Prompt Injection Attacks by 86% With My Own Framework (And What Went Wrong the First Time)

How I Reduced Prompt Injection Attacks by 86% With My Own Framework (And What Went Wrong the First Time)

Comments
5 min read
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

1
Comments
5 min read
RAG SOTA: I Tested 7 Pipelines and Built SEQUOIA (Open Source)

RAG SOTA: I Tested 7 Pipelines and Built SEQUOIA (Open Source)

Comments 2
2 min read
Is AI Getting Quietly Dumber? A 24/7 Benchmark That Catches LLM Degradation

Is AI Getting Quietly Dumber? A 24/7 Benchmark That Catches LLM Degradation

Comments 2
6 min read
Your LLM bill is not your capacity plan. Here's the math that pages you at 2am.

Your LLM bill is not your capacity plan. Here's the math that pages you at 2am.

Comments
4 min read
Beyond Pay-Per-Token: How Enterprises Barter Architecture for AI Access

Beyond Pay-Per-Token: How Enterprises Barter Architecture for AI Access

Comments
3 min read
Push vs Pull Memory: A Better Way to Think About AI Agent Memory

Push vs Pull Memory: A Better Way to Think About AI Agent Memory

Comments
5 min read
Why RAG Fails in Enterprise R&D (And What Actually Works)

Why RAG Fails in Enterprise R&D (And What Actually Works)

Comments 1
5 min read
LLM Structured Output Validation in Python That Holds Up

LLM Structured Output Validation in Python That Holds Up

Comments
14 min read
Agents need a black box recorder, not more memory

Agents need a black box recorder, not more memory

Comments
3 min read
AI Reliability: What It Is, Why It Matters, and How to Fix It

AI Reliability: What It Is, Why It Matters, and How to Fix It

Comments
9 min read
What is Agent Memory and why does it matter?

What is Agent Memory and why does it matter?

Comments
7 min read
LLaMA.cpp Gets Qwen MTP Boost, Ring-2.6-1T for Ollama, AMD GPU Fixes

LLaMA.cpp Gets Qwen MTP Boost, Ring-2.6-1T for Ollama, AMD GPU Fixes

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.