DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
The Amnesia Epidemic: Why the Next Era of Enterprise AI Requires "Hindsight"

The Amnesia Epidemic: Why the Next Era of Enterprise AI Requires "Hindsight"

Comments
4 min read
Reducing LLM Costs Is Easy — Until Production Starts

Reducing LLM Costs Is Easy — Until Production Starts

2
Comments
4 min read
Stop Overpaying for LLM APIs: A Practical Cost Optimization Guide 💰

Stop Overpaying for LLM APIs: A Practical Cost Optimization Guide 💰

Comments 3
8 min read
Building a Voice-Controlled Local AI Agent Using Whisper and Ollama

Building a Voice-Controlled Local AI Agent Using Whisper and Ollama

Comments
3 min read
Building a Voice-Controlled Local AI Agent with Whisper, LLaMA 3 and Streamlit

Building a Voice-Controlled Local AI Agent with Whisper, LLaMA 3 and Streamlit

Comments
3 min read
Graphify + code-review-graph: Build a Self-Updating Knowledge Graph for Claude Code and other AI Coding Agent

Graphify + code-review-graph: Build a Self-Updating Knowledge Graph for Claude Code and other AI Coding Agent

5
Comments
53 min read
Running AI Fully Offline on Mobile with Gemma 4 (Android + iOS)

Running AI Fully Offline on Mobile with Gemma 4 (Android + iOS)

Comments
3 min read
Building a Biomedical GraphRAG Inference System: Comparing LLM-Only, Basic RAG, and GraphRAG Pipelines

Building a Biomedical GraphRAG Inference System: Comparing LLM-Only, Basic RAG, and GraphRAG Pipelines

1
Comments
3 min read
We open-sourced our AI attack detection engine — 97 MITRE ATLAS rules in a Rust crate

We open-sourced our AI attack detection engine — 97 MITRE ATLAS rules in a Rust crate

Comments
3 min read
(The Voice) Multilingual Layer

(The Voice) Multilingual Layer

1
Comments
4 min read
Test Your LLM Outputs in pytest (15ms, No API Key)

Test Your LLM Outputs in pytest (15ms, No API Key)

Comments
4 min read
Llama4 108B Local Inference, MiniMax M2.7 GGUF Alert, & Ollama Security Scanner

Llama4 108B Local Inference, MiniMax M2.7 GGUF Alert, & Ollama Security Scanner

2
Comments
3 min read
AI한테 기억을 가르치려면, 잊는 법부터 가르쳐야 한다

AI한테 기억을 가르치려면, 잊는 법부터 가르쳐야 한다

Comments
2 min read
Why your AI response restarts on page refresh (and what it takes to prevent it)

Why your AI response restarts on page refresh (and what it takes to prevent it)

Comments
3 min read
Persistent Identity Agents: Why Memory Isn’t Enough

Persistent Identity Agents: Why Memory Isn’t Enough

1
Comments
2 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.