DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
GPT-5.1 Was Retired on March 11 — Here's What Broke in Your LLM App

GPT-5.1 Was Retired on March 11 — Here's What Broke in Your LLM App

1
Comments
4 min read
How LLMs Are Transforming Code Review in 2026

How LLMs Are Transforming Code Review in 2026

Comments
2 min read
Agent-first CLIs are about reducing turns, not JSON

Agent-first CLIs are about reducing turns, not JSON

Comments
4 min read
How Much VRAM Do You Actually Need to Run LLMs Locally?

How Much VRAM Do You Actually Need to Run LLMs Locally?

Comments
3 min read
CodeSpeak: When English Isn't Precise Enough for AI

CodeSpeak: When English Isn't Precise Enough for AI

1
Comments
5 min read
I Tested 50 AI App Prompts for Injection Attacks. 90% Scored CRITICAL.

I Tested 50 AI App Prompts for Injection Attacks. 90% Scored CRITICAL.

2
Comments
6 min read
2026年版!AIエージェント開発に必須のオープンソースGitHubリポジトリ10選

2026年版!AIエージェント開発に必須のオープンソースGitHubリポジトリ10選

4
Comments
2 min read
I asked my AI agent to audit himself. He scored 62/100.

I asked my AI agent to audit himself. He scored 62/100.

1
Comments 1
4 min read
Concurrent LLM Serving: Benchmarking vLLM vs SGLang vs Ollama

Concurrent LLM Serving: Benchmarking vLLM vs SGLang vs Ollama

2
Comments
2 min read
10 Best vLLM Alternatives for LLM Inference in Production (2026)

10 Best vLLM Alternatives for LLM Inference in Production (2026)

1
Comments
22 min read
Stop Blaming Your LLM: Fix RAG Retrieval Quality With Better Chunking in .NET

Stop Blaming Your LLM: Fix RAG Retrieval Quality With Better Chunking in .NET

1
Comments
7 min read
How I Think About Reliability in LLM Applications

How I Think About Reliability in LLM Applications

3
Comments 1
6 min read
Title: Why we built a P2P inference network instead of another AI API wrapper

Title: Why we built a P2P inference network instead of another AI API wrapper

Comments
2 min read
Why Is NullClaw So Small? A Deep Dive into the 678KB AI Coder

Why Is NullClaw So Small? A Deep Dive into the 678KB AI Coder

5
Comments
4 min read
When the AI's memory explodes: context overflow and compaction failures in production

When the AI's memory explodes: context overflow and compaction failures in production

Comments
3 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.