DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Built a Fully Offline AI Assistant on My Mac (Using Local LLMs)

Built a Fully Offline AI Assistant on My Mac (Using Local LLMs)

Comments
2 min read
Why routing LLM calls is harder than it looks (lessons from building ai-gateway)

Why routing LLM calls is harder than it looks (lessons from building ai-gateway)

1
Comments
2 min read
Llama.cpp's New MTP on MacOS

Llama.cpp's New MTP on MacOS

1
Comments
4 min read
Building a Voice-Controlled Local AI Agent: Architecture, Models, and Hard-Won Lessons

Building a Voice-Controlled Local AI Agent: Architecture, Models, and Hard-Won Lessons

Comments
5 min read
NEUROLEARN

NEUROLEARN

Comments
3 min read
How to Build a Multi-Provider LLM Router in 50 Lines of Code 🛤️

How to Build a Multi-Provider LLM Router in 50 Lines of Code 🛤️

Comments 1
9 min read
The Amnesia Epidemic: Why the Next Era of Enterprise AI Requires "Hindsight"

The Amnesia Epidemic: Why the Next Era of Enterprise AI Requires "Hindsight"

Comments
4 min read
Building a Voice-Controlled Local AI Agent Using Whisper and Ollama

Building a Voice-Controlled Local AI Agent Using Whisper and Ollama

Comments
3 min read
Reducing LLM Costs Is Easy — Until Production Starts

Reducing LLM Costs Is Easy — Until Production Starts

2
Comments
4 min read
Graphify + code-review-graph: Build a Self-Updating Knowledge Graph for Claude Code and other AI Coding Agent

Graphify + code-review-graph: Build a Self-Updating Knowledge Graph for Claude Code and other AI Coding Agent

10
Comments
53 min read
Stop Overpaying for LLM APIs: A Practical Cost Optimization Guide đź’°

Stop Overpaying for LLM APIs: A Practical Cost Optimization Guide đź’°

Comments 3
8 min read
Building a Voice-Controlled Local AI Agent with Whisper, LLaMA 3 and Streamlit

Building a Voice-Controlled Local AI Agent with Whisper, LLaMA 3 and Streamlit

Comments
3 min read
Running AI Fully Offline on Mobile with Gemma 4 (Android + iOS)

Running AI Fully Offline on Mobile with Gemma 4 (Android + iOS)

Comments
3 min read
Building a Biomedical GraphRAG Inference System: Comparing LLM-Only, Basic RAG, and GraphRAG Pipelines

Building a Biomedical GraphRAG Inference System: Comparing LLM-Only, Basic RAG, and GraphRAG Pipelines

1
Comments
3 min read
We open-sourced our AI attack detection engine — 97 MITRE ATLAS rules in a Rust crate

We open-sourced our AI attack detection engine — 97 MITRE ATLAS rules in a Rust crate

Comments
3 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.