DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Run Hermes Agent on Any Model — Free, Local, and Cost-Routed

Run Hermes Agent on Any Model — Free, Local, and Cost-Routed

Comments
6 min read
How to Scale AI Development Beyond Prototype Speed

How to Scale AI Development Beyond Prototype Speed

1
Comments
10 min read
Gemini 3.5 Flash Developer Guide

Gemini 3.5 Flash Developer Guide

56
Comments 5
8 min read
How to Run a 35B Parameter Model on Your Laptop Without Melting It

How to Run a 35B Parameter Model on Your Laptop Without Melting It

Comments
5 min read
RAG: How AI Models Use Your Data Without Forgetting

RAG: How AI Models Use Your Data Without Forgetting

4
Comments 2
14 min read
We built traceAI, an open-source tool for tracing LLM calls in production

We built traceAI, an open-source tool for tracing LLM calls in production

Comments
1 min read
Claude's default teaching shape has no return: the 5-node loop that fixes it

Claude's default teaching shape has no return: the 5-node loop that fixes it

1
Comments
6 min read
Rethinking Open Source Contribution in the Age of AI Agents, featuring vLLM Core Maintainer Roger Wang at MLSys'26

Rethinking Open Source Contribution in the Age of AI Agents, featuring vLLM Core Maintainer Roger Wang at MLSys'26

8
Comments 6
3 min read
I built a reasoning harness for LLM agents. Here's what an agent receives when it calls it.

I built a reasoning harness for LLM agents. Here's what an agent receives when it calls it.

1
Comments 2
4 min read
How to choose the right AIOps platform

How to choose the right AIOps platform

Comments
4 min read
I Built a Private AI Assistant That Queries My Git History and Project Management Data — Using Only Local LLMs

I Built a Private AI Assistant That Queries My Git History and Project Management Data — Using Only Local LLMs

Comments 1
5 min read
Qwen3.6 GGUF Benchmarks, Ternary Bonsai 1.58-bit Models, & Ollama Code Explainer Tool

Qwen3.6 GGUF Benchmarks, Ternary Bonsai 1.58-bit Models, & Ollama Code Explainer Tool

Comments
3 min read
How to Run LLMs Locally When Cloud AI Gets Too Invasive

How to Run LLMs Locally When Cloud AI Gets Too Invasive

Comments
5 min read
Most document AI questions aren't retrieval problems

Most document AI questions aren't retrieval problems

4
Comments
4 min read
How I got 80% code retrieval accuracy without vectors, embeddings, or any ML

How I got 80% code retrieval accuracy without vectors, embeddings, or any ML

Comments
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.