DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling

NVIDIA Blackwell Sweeps MLPerf Training 6.0: Strong Scaling

Comments
6 min read
🚀 I Ran Claude Code on Every New Claude Model. Here's What Actually Ships.

🚀 I Ran Claude Code on Every New Claude Model. Here's What Actually Ships.

Comments
14 min read
AIchain Agent: Plan, Act, Reflect

AIchain Agent: Plan, Act, Reflect

Comments
5 min read
The AI Cost Paradox: 280x Cheaper, Bills Still Rising

The AI Cost Paradox: 280x Cheaper, Bills Still Rising

Comments
8 min read
Building a Memory System for My AI Code Generator

Building a Memory System for My AI Code Generator

Comments
2 min read
60–95% fewer tokens in your agent loops, same answers. Meet Headroom.

60–95% fewer tokens in your agent loops, same answers. Meet Headroom.

Comments
2 min read
20 Claude agents for M&A diligence, built on one rule: cite the source or cut the claim

20 Claude agents for M&A diligence, built on one rule: cite the source or cut the claim

Comments
6 min read
The hardest LLM bugs are contract failures, not hallucinations

The hardest LLM bugs are contract failures, not hallucinations

Comments
2 min read
I Spent $8,857 Using Claude Code to Build 6 Projects. Here's What I Learned.

I Spent $8,857 Using Claude Code to Build 6 Projects. Here's What I Learned.

2
Comments 2
10 min read
Load late, load little: just-in-time context for conversation history

Load late, load little: just-in-time context for conversation history

Comments
10 min read
Temperature and Sampling: the LLM Creativity Dial

Temperature and Sampling: the LLM Creativity Dial

Comments
1 min read
Self-RAG: Let the Model Decide When to Retrieve, Then Grade Itself

Self-RAG: Let the Model Decide When to Retrieve, Then Grade Itself

Comments
1 min read
KV cache and PagedAttention: what they do and why they matter

KV cache and PagedAttention: what they do and why they matter

1
Comments
8 min read
CortexOps vs Langfuse: Open Source AI Observability Compared

CortexOps vs Langfuse: Open Source AI Observability Compared

Comments
3 min read
Understanding Retrieval-Augmented Generation (RAG): The AI Architecture That Makes LLMs Smarter

Understanding Retrieval-Augmented Generation (RAG): The AI Architecture That Makes LLMs Smarter

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.