DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
I built a local-first AI memory layer for LLMs in Rust (no cloud, no API keys)

I built a local-first AI memory layer for LLMs in Rust (no cloud, no API keys)

Comments
1 min read
How FinOps Teams Trace Per-Request AI Costs Through Multi-Tenant Gateways

How FinOps Teams Trace Per-Request AI Costs Through Multi-Tenant Gateways

Comments
2 min read
The ChatGPT Invisibility Bug: Why High-Quality Content Fails to Index in LLM Search

The ChatGPT Invisibility Bug: Why High-Quality Content Fails to Index in LLM Search

1
Comments
5 min read
Operator: cuando responder no basta

Operator: cuando responder no basta

Comments
7 min read
From Prompt to Production: Practical Lessons from Generative AI in .NET

From Prompt to Production: Practical Lessons from Generative AI in .NET

Comments
3 min read
Four ways production agents silently fail — and the physical patterns that prevent them (AOS v0.2)

Four ways production agents silently fail — and the physical patterns that prevent them (AOS v0.2)

Comments
5 min read
The hidden cost of streaming LLMs: caches you can't use, bills you don't expect, and complexity you don't need

The hidden cost of streaming LLMs: caches you can't use, bills you don't expect, and complexity you don't need

Comments 1
11 min read
Six failures, a 32-minute TPU lie, and the moment a language model ignored my prompt on purpose

Six failures, a 32-minute TPU lie, and the moment a language model ignored my prompt on purpose

Comments
5 min read
Unlocking the Power of RAG Systems with LangChain and Vector Databases

Unlocking the Power of RAG Systems with LangChain and Vector Databases

Comments
3 min read
AirLLM Shrinks 70B LLMs to 4GB VRAM; DPO & Supermemory Boost Open Models

AirLLM Shrinks 70B LLMs to 4GB VRAM; DPO & Supermemory Boost Open Models

Comments
3 min read
Switching our LLM-as-judge from 5-class to binary in CI: the patterns we kept

Switching our LLM-as-judge from 5-class to binary in CI: the patterns we kept

Comments
3 min read
From Chatbots to Personal AI Agents: The Infrastructure Developers Actually Need

From Chatbots to Personal AI Agents: The Infrastructure Developers Actually Need

1
Comments
18 min read
GLM 5.2: Zhipu's Open-Weight Frontier Model With 1M Context

GLM 5.2: Zhipu's Open-Weight Frontier Model With 1M Context

Comments
5 min read
RAG pilots fail when the sources are not ready

RAG pilots fail when the sources are not ready

Comments
2 min read
The most expensive bug in an AI agent is the one it's confident about

The most expensive bug in an AI agent is the one it's confident about

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.