DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Part 3: Why Transformers Still Forget

Part 3: Why Transformers Still Forget

Comments
5 min read
RAG for LLMs: How Retrieval-Augmented Generation Stops Hallucinations

RAG for LLMs: How Retrieval-Augmented Generation Stops Hallucinations

Comments
6 min read
Mosaic: Sharding Attention Across GPUs When Your Sequence Doesn't Fit

Mosaic: Sharding Attention Across GPUs When Your Sequence Doesn't Fit

Comments
5 min read
Augustus: Open Source LLM Prompt Injection Scanner

Augustus: Open Source LLM Prompt Injection Scanner

Comments
5 min read
Why “Plans” Should Be First-Class Artifacts in AI-Assisted Development

Why “Plans” Should Be First-Class Artifacts in AI-Assisted Development

1
Comments
5 min read
So I've been losing my mind over document extraction in insurance for the past few years

So I've been losing my mind over document extraction in insurance for the past few years

Comments 1
3 min read
How Transformers Work Inside an LLM (Step by Step)

How Transformers Work Inside an LLM (Step by Step)

3
Comments
3 min read
Inside the Amazon Nova Forge

Inside the Amazon Nova Forge

1
Comments
4 min read
Intelligent API Key Management and Load Balancing: A Complete Guide to Building Resilient AI Applications using Bifrost

Intelligent API Key Management and Load Balancing: A Complete Guide to Building Resilient AI Applications using Bifrost

Comments
22 min read
Stop Paying Twice for AI — Turn Your CLI Agents Into Rubber Ducks

Stop Paying Twice for AI — Turn Your CLI Agents Into Rubber Ducks

1
Comments
6 min read
AI Code Reliability — Taming the Stochastic Parrot in Deterministic Systems

AI Code Reliability — Taming the Stochastic Parrot in Deterministic Systems

Comments
3 min read
Bringing RLM to TypeScript: Building rllm

Bringing RLM to TypeScript: Building rllm

Comments
2 min read
Semantic Cache: Como Otimizar Aplicações RAG com Cache Semântico

Semantic Cache: Como Otimizar Aplicações RAG com Cache Semântico

1
Comments
5 min read
Beyond the Restart — The Era of Agentic Self-Healing Microservices

Beyond the Restart — The Era of Agentic Self-Healing Microservices

Comments
3 min read
Why Production AI Applications Need an LLM Gateway: From Prototype to Reliable Scale

Why Production AI Applications Need an LLM Gateway: From Prototype to Reliable Scale

Comments
17 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.