DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Beginner's Guide to Essential Terms in Artificial Intelligence

Beginner's Guide to Essential Terms in Artificial Intelligence

Comments
10 min read
I built a Rust LLM inference engine with custom WGSL GPU kernels, here's what I learned!

I built a Rust LLM inference engine with custom WGSL GPU kernels, here's what I learned!

Comments 1
1 min read
Claude Code is not a recursive agent. I read the source and checked.

Turn budgets based on iterative loop passes

Claude Code is not a recursive agent. I read the source and checked.

6
Comments 7
7 min read
Three LLM Observability Audits in Five Days: Each Fix Exposed the Next Bug

Three LLM Observability Audits in Five Days: Each Fix Exposed the Next Bug

Comments
6 min read
I tried to make an AI agent answer more. It answered less.

I tried to make an AI agent answer more. It answered less.

1
Comments 1
5 min read
Who pays for the tokens? Designing an AI plugin that doesn't break your users' wallets

Who pays for the tokens? Designing an AI plugin that doesn't break your users' wallets

3
Comments 2
7 min read
Claude Fable 5 can run for days. When does a solo dev actually want that?

Claude Fable 5 can run for days. When does a solo dev actually want that?

2
Comments
5 min read
Why DeepSeek V3.2 Tool Calls Can Drift from Ordered System Instructions

Why DeepSeek V3.2 Tool Calls Can Drift from Ordered System Instructions

Comments
4 min read
When You Swap Your AI Agent's Brain — Everything Breaks

When You Swap Your AI Agent's Brain — Everything Breaks

Comments 1
6 min read
GraphRAG Explained: How Knowledge Graphs Are Transforming Modern RAG Systems

GraphRAG Explained: How Knowledge Graphs Are Transforming Modern RAG Systems

Comments
3 min read
Building a Skills Updater Pipeline for AI Platforms

Building a Skills Updater Pipeline for AI Platforms

Comments
2 min read
The 10 Best AI Memory Layers for Agents in 2026

The 10 Best AI Memory Layers for Agents in 2026

1
Comments
7 min read
How fast is LlamaStash? Overhead, throughput, and a fair comparison with Ollama and LM Studio

TTFT and RAG efficiency insights

How fast is LlamaStash? Overhead, throughput, and a fair comparison with Ollama and LM Studio

12
Comments 9
24 min read
Coverage decay: when style prompts forget themselves

Coverage decay: when style prompts forget themselves

Comments 1
15 min read
DeepSeek-V3: The 671B MoE Model You Can Run Locally in 2026

DeepSeek-V3: The 671B MoE Model You Can Run Locally in 2026

Comments
9 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.