DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Claude Status: Why Your Claude API Keeps Returning 529 `overloaded_error` — A Production Debugging Playbook

Claude Status: Why Your Claude API Keeps Returning 529 `overloaded_error` — A Production Debugging Playbook

2
Comments
4 min read
Modernizing Legacy Code with Konveyor AI: From EJB to Kubernetes

Modernizing Legacy Code with Konveyor AI: From EJB to Kubernetes

Comments
3 min read
Agent-first CLIs are about reducing turns, not JSON

Agent-first CLIs are about reducing turns, not JSON

Comments
4 min read
Should You Be Using RAG in 2026?

Should You Be Using RAG in 2026?

3
Comments
12 min read
How Much VRAM Do You Actually Need to Run LLMs Locally?

How Much VRAM Do You Actually Need to Run LLMs Locally?

Comments
3 min read
Running Gemma 4 Locally on an iPhone 13 Pro with Swift

Running Gemma 4 Locally on an iPhone 13 Pro with Swift

1
Comments 1
2 min read
CodeSpeak: When English Isn't Precise Enough for AI

CodeSpeak: When English Isn't Precise Enough for AI

1
Comments
5 min read
I Tested 50 AI App Prompts for Injection Attacks. 90% Scored CRITICAL.

I Tested 50 AI App Prompts for Injection Attacks. 90% Scored CRITICAL.

2
Comments
6 min read
2026年版!AIエージェント開発に必須のオープンソースGitHubリポジトリ10選

2026年版!AIエージェント開発に必須のオープンソースGitHubリポジトリ10選

4
Comments
2 min read
🎶 AllegroAgent : A Lightweight Python Framework for Building Stateful AI Agents 🚀

🎶 AllegroAgent : A Lightweight Python Framework for Building Stateful AI Agents 🚀

6
Comments
4 min read
I asked my AI agent to audit himself. He scored 62/100.

I asked my AI agent to audit himself. He scored 62/100.

1
Comments 1
4 min read
10 Best vLLM Alternatives for LLM Inference in Production (2026)

10 Best vLLM Alternatives for LLM Inference in Production (2026)

1
Comments
22 min read
Stop Blaming Your LLM: Fix RAG Retrieval Quality With Better Chunking in .NET

Stop Blaming Your LLM: Fix RAG Retrieval Quality With Better Chunking in .NET

1
Comments
7 min read
How I Think About Reliability in LLM Applications

How I Think About Reliability in LLM Applications

3
Comments 1
6 min read
Title: Why we built a P2P inference network instead of another AI API wrapper

Title: Why we built a P2P inference network instead of another AI API wrapper

Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.