DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Why LLM Agents Fail: Four Mechanisms of Cognitive Decay and the Reasoning Harness Layer

Why LLM Agents Fail: Four Mechanisms of Cognitive Decay and the Reasoning Harness Layer

Comments
13 min read
Brazilian Lawyers Fined R$84,000 for Prompt Injection in Court — Here's What Caught Them (and What Didn't)

Brazilian Lawyers Fined R$84,000 for Prompt Injection in Court — Here's What Caught Them (and What Didn't)

1
Comments
5 min read
Stop Hand-Rolling AI Glue: Claude 4 MCP + Spring AI is the New Enterprise Standard

Stop Hand-Rolling AI Glue: Claude 4 MCP + Spring AI is the New Enterprise Standard

Comments
2 min read
Qwen3.6-27B vLLM 0.19 Benchmarks, GLM 5.1 Local Performance, & Multimodal WaTale

Qwen3.6-27B vLLM 0.19 Benchmarks, GLM 5.1 Local Performance, & Multimodal WaTale

Comments
4 min read
Your AI Agent Is Reading Poisoned Web Pages (And You Don't Know It)

Your AI Agent Is Reading Poisoned Web Pages (And You Don't Know It)

1
Comments
4 min read
Why Your AI Agent Loses the Plot: Reasoning Decay and Attention Loss in Long-Running Tasks

Why Your AI Agent Loses the Plot: Reasoning Decay and Attention Loss in Long-Running Tasks

Comments 1
10 min read
Prompting Is Not Magic. It Is Control.

Anti-prompts that make model failure visible

Prompting Is Not Magic. It Is Control.

21
Comments 37
8 min read
Model Output Is Not Authority: Action Assurance for AI Agents

Model Output Is Not Authority: Action Assurance for AI Agents

1
Comments
7 min read
An AI Agent Wiped a Production Database in 9 Seconds. What Engineers Must Design Before Shipping.

An AI Agent Wiped a Production Database in 9 Seconds. What Engineers Must Design Before Shipping.

1
Comments 2
5 min read
Moving Beyond My Go-To LLM: Is Anyone Using Anthropic's Mythos? 🤖

Moving Beyond My Go-To LLM: Is Anyone Using Anthropic's Mythos? 🤖

5
Comments 1
1 min read
Why MTP doesn't speed up your llama.cpp inference (and how to actually fix it)

Why MTP doesn't speed up your llama.cpp inference (and how to actually fix it)

Comments
5 min read
How LLMs Actually Work (And What That Means for Your Architecture Decisions)

How LLMs Actually Work (And What That Means for Your Architecture Decisions)

2
Comments
6 min read
Fixing the Missing think Tag Glitch When Running DeepSeek V3.2 GGUF on CPU

Fixing the Missing think Tag Glitch When Running DeepSeek V3.2 GGUF on CPU

Comments
1 min read
I built Geometric Breathing & Meditation app on VCA

I built Geometric Breathing & Meditation app on VCA

Comments
4 min read
DeepSeek V4's Real Innovation Isn't Scale—It's Memory Architecture

DeepSeek V4's Real Innovation Isn't Scale—It's Memory Architecture

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.