DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Run your first AI agent in Java — for free, with Mistral

Run your first AI agent in Java — for free, with Mistral

2
Comments
2 min read
Capping VLM spend per CV researcher: hierarchical budgets in practice

Capping VLM spend per CV researcher: hierarchical budgets in practice

1
Comments 2
4 min read
Token Saving, and Caveman

Token Saving, and Caveman

Comments
7 min read
Stop Shipping AI Slop: Build an Anti-Slop Harness Around Your LLM

Stop Shipping AI Slop: Build an Anti-Slop Harness Around Your LLM

Comments
6 min read
Agent as a Tool Call: Claude Code's Fork-Exec Pattern

Agent as a Tool Call: Claude Code's Fork-Exec Pattern

2
Comments 1
2 min read
Cache-Aware Spawning: What Changed in llm-cli-gateway, a Week On

Cache-Aware Spawning: What Changed in llm-cli-gateway, a Week On

Comments
12 min read
What is RAG? A Beginner's Guide to Retrieval-Augmented Generation (For Engineers Who Actually Build It)

What is RAG? A Beginner's Guide to Retrieval-Augmented Generation (For Engineers Who Actually Build It)

Comments
5 min read
How to stop your RAG assistant from hallucinating (a practical guide)

How to stop your RAG assistant from hallucinating (a practical guide)

Comments 2
3 min read
Transformer as an Incomplete Cognitive Architecture: What It Captures Well and What It Misses (A11 Perspective)

Transformer as an Incomplete Cognitive Architecture: What It Captures Well and What It Misses (A11 Perspective)

Comments
4 min read
ai, deepseek, machinelearning

ai, deepseek, machinelearning

1
Comments 2
5 min read
Stop Pasting Your Code Into ChatGPT For Debugging—Run LLMs Locally Instead

Stop Pasting Your Code Into ChatGPT For Debugging—Run LLMs Locally Instead

1
Comments
4 min read
I Added a 4th Agent That Audits My Other Agents. It Caught My Strategist Procrastinating for 3 Weeks.

I Added a 4th Agent That Audits My Other Agents. It Caught My Strategist Procrastinating for 3 Weeks.

Comments
9 min read
A six-concern production harness for Nemotron agents on Crusoe Managed Inference

A six-concern production harness for Nemotron agents on Crusoe Managed Inference

Comments
4 min read
My LLM provider went down for 11 minutes. My code spent 4 of them in connect timeouts.

My LLM provider went down for 11 minutes. My code spent 4 of them in connect timeouts.

Comments
4 min read
agenttap: see exactly what your LLM SDK sent to the wire, with API keys scrubbed

agenttap: see exactly what your LLM SDK sent to the wire, with API keys scrubbed

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.