DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
When Generated Tests Pass but Don't Protect — a small failure that became a production bug

When Generated Tests Pass but Don't Protect — a small failure that became a production bug

Comments
3 min read
How 2025 took AI from party tricks to production tools

How 2025 took AI from party tricks to production tools

Comments
6 min read
Chunking, Batching & Indexing: The Hidden Costs of RAG Systems

Chunking, Batching & Indexing: The Hidden Costs of RAG Systems

Comments
2 min read
LLM Parameter fine tuning with Spring AI

LLM Parameter fine tuning with Spring AI

Comments
3 min read
I Tested GLM-4.7 for Two Weeks—Here's What Actually Matters

I Tested GLM-4.7 for Two Weeks—Here's What Actually Matters

Comments
2 min read
Day 0 (Planning): Building my personal AI assistant that runs locally.

Day 0 (Planning): Building my personal AI assistant that runs locally.

5
Comments
4 min read
Part 3: Why Transformers Still Forget

Part 3: Why Transformers Still Forget

Comments
5 min read
Coding Agents as a First-Class Consideration in Project Structures

Coding Agents as a First-Class Consideration in Project Structures

1
Comments
6 min read
Mosaic: Sharding Attention Across GPUs When Your Sequence Doesn't Fit

Mosaic: Sharding Attention Across GPUs When Your Sequence Doesn't Fit

Comments
5 min read
The Brain of the Future Agent: Why VL-JEPA Matters for Real-World AI

The Brain of the Future Agent: Why VL-JEPA Matters for Real-World AI

1
Comments 1
5 min read
Turning React/TypeScript Codebases Into Deterministic AI Context - End to End

Turning React/TypeScript Codebases Into Deterministic AI Context - End to End

Comments
2 min read
Arcana: an agentic AI system for reasoning about MongoDB architectures

Arcana: an agentic AI system for reasoning about MongoDB architectures

Comments
1 min read
So I've been losing my mind over document extraction in insurance for the past few years

So I've been losing my mind over document extraction in insurance for the past few years

Comments 1
3 min read
Intelligent API Key Management and Load Balancing: A Complete Guide to Building Resilient AI Applications using Bifrost

Intelligent API Key Management and Load Balancing: A Complete Guide to Building Resilient AI Applications using Bifrost

Comments
22 min read
Bringing RLM to TypeScript: Building rllm

Bringing RLM to TypeScript: Building rllm

Comments
2 min read
Semantic Cache: Como Otimizar Aplicações RAG com Cache Semântico

Semantic Cache: Como Otimizar Aplicações RAG com Cache Semântico

1
Comments
5 min read
Why Production AI Applications Need an LLM Gateway: From Prototype to Reliable Scale

Why Production AI Applications Need an LLM Gateway: From Prototype to Reliable Scale

Comments
17 min read
Working with LLMs: A 50/50 Effort.

Working with LLMs: A 50/50 Effort.

5
Comments
11 min read
When code-gen suggests deprecated Pandas APIs: a case study in subtle breakage

When code-gen suggests deprecated Pandas APIs: a case study in subtle breakage

Comments
3 min read
Agents should be able to time travel

Agents should be able to time travel

Comments
3 min read
Beyond the Chatbot: Architecture for Production-Grade Agents (Context as a Service)

Beyond the Chatbot: Architecture for Production-Grade Agents (Context as a Service)

Comments
3 min read
RAG Isn’t a Modeling Problem. It’s a Data Engineering Problem.

RAG Isn’t a Modeling Problem. It’s a Data Engineering Problem.

1
Comments
6 min read
Cómo Reducir Costes LLM un 80% Sin Sacrificar Calidad en Producción

Cómo Reducir Costes LLM un 80% Sin Sacrificar Calidad en Producción

Comments
4 min read
Prompts are logic, not strings: Why I contributed to Convo-Lang

Prompts are logic, not strings: Why I contributed to Convo-Lang

Comments
3 min read
Why LLMs Break in Production (and Why It’s Not a Model Problem)

Why LLMs Break in Production (and Why It’s Not a Model Problem)

Comments
3 min read
loading...