DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Your AGENTS.md is valid. Your agent still breaks the rules.

Your AGENTS.md is valid. Your agent still breaks the rules.

Comments 3
6 min read
Agent as a Tool Call: Claude Code's Fork-Exec Pattern

Agent as a Tool Call: Claude Code's Fork-Exec Pattern

2
Comments 1
2 min read
Cache-Aware Spawning: What Changed in llm-cli-gateway, a Week On

Cache-Aware Spawning: What Changed in llm-cli-gateway, a Week On

Comments
12 min read
What is RAG? A Beginner's Guide to Retrieval-Augmented Generation (For Engineers Who Actually Build It)

What is RAG? A Beginner's Guide to Retrieval-Augmented Generation (For Engineers Who Actually Build It)

Comments
5 min read
I tested 5 LLMs for prompt-injection leaks. Same code, 0% to 90%.

I tested 5 LLMs for prompt-injection leaks. Same code, 0% to 90%.

Comments
3 min read
How to stop your RAG assistant from hallucinating (a practical guide)

How to stop your RAG assistant from hallucinating (a practical guide)

Comments 2
3 min read
Transformer as an Incomplete Cognitive Architecture: What It Captures Well and What It Misses (A11 Perspective)

Transformer as an Incomplete Cognitive Architecture: What It Captures Well and What It Misses (A11 Perspective)

Comments
4 min read
ai, deepseek, machinelearning

ai, deepseek, machinelearning

1
Comments 2
5 min read
We Obsessed Over Gateway Latency for a Month. Then We Looked at the Actual Numbers.

We Obsessed Over Gateway Latency for a Month. Then We Looked at the Actual Numbers.

2
Comments 1
4 min read
Stop Pasting Your Code Into ChatGPT For Debugging—Run LLMs Locally Instead

Stop Pasting Your Code Into ChatGPT For Debugging—Run LLMs Locally Instead

1
Comments
4 min read
I Added a 4th Agent That Audits My Other Agents. It Caught My Strategist Procrastinating for 3 Weeks.

I Added a 4th Agent That Audits My Other Agents. It Caught My Strategist Procrastinating for 3 Weeks.

Comments
9 min read
llmfleet: pool many agents' turns into one Batch API call and save 50 percent

llmfleet: pool many agents' turns into one Batch API call and save 50 percent

Comments
4 min read
My LLM provider went down for 11 minutes. My code spent 4 of them in connect timeouts.

My LLM provider went down for 11 minutes. My code spent 4 of them in connect timeouts.

Comments
4 min read
A six-concern production harness for Nemotron agents on Crusoe Managed Inference

A six-concern production harness for Nemotron agents on Crusoe Managed Inference

Comments
4 min read
I needed a stable cache key for LLM requests. The hard part was the input list order.

I needed a stable cache key for LLM requests. The hard part was the input list order.

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.