DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
How to Run a 35B Parameter Model on Your Laptop Without Melting It

How to Run a 35B Parameter Model on Your Laptop Without Melting It

Comments
5 min read
We built traceAI, an open-source tool for tracing LLM calls in production

We built traceAI, an open-source tool for tracing LLM calls in production

Comments
1 min read
Claude's default teaching shape has no return: the 5-node loop that fixes it

Claude's default teaching shape has no return: the 5-node loop that fixes it

1
Comments
6 min read
How to choose the right AIOps platform

How to choose the right AIOps platform

Comments
4 min read
Qwen3.6 GGUF Benchmarks, Ternary Bonsai 1.58-bit Models, & Ollama Code Explainer Tool

Qwen3.6 GGUF Benchmarks, Ternary Bonsai 1.58-bit Models, & Ollama Code Explainer Tool

Comments
3 min read
How to Run LLMs Locally When Cloud AI Gets Too Invasive

How to Run LLMs Locally When Cloud AI Gets Too Invasive

Comments
5 min read
Most document AI questions aren't retrieval problems

Most document AI questions aren't retrieval problems

4
Comments
4 min read
How I got 80% code retrieval accuracy without vectors, embeddings, or any ML

How I got 80% code retrieval accuracy without vectors, embeddings, or any ML

Comments
2 min read
Agentic AI's Infrastructure Boom Meets Its Reliability Problem

Agentic AI's Infrastructure Boom Meets Its Reliability Problem

Comments
3 min read
Stop Paying for the Same Answer Twice: A Deep Dive into llm-cache

Stop Paying for the Same Answer Twice: A Deep Dive into llm-cache

3
Comments
9 min read
Why I built ragwise: pip-installable RAG with hybrid search, streaming, and agent tools by default

Why I built ragwise: pip-installable RAG with hybrid search, streaming, and agent tools by default

Comments
4 min read
Running LLM Classification After the Response: Next.js after() + OpenRouter at $0.0002 per Call

Running LLM Classification After the Response: Next.js after() + OpenRouter at $0.0002 per Call

5
Comments
8 min read
Gemma-4-31B on v6e-4 TPU Benchmarks

Gemma-4-31B on v6e-4 TPU Benchmarks

7
Comments
2 min read
All Data and AI Weekly #238-20April2026

All Data and AI Weekly #238-20April2026

5
Comments
11 min read
Opus 4.7 First Look: I Tested the Day-Old Model Against 3 Other Claudes on 10 Real Tasks

Opus 4.7 First Look: I Tested the Day-Old Model Against 3 Other Claudes on 10 Real Tasks

Comments 1
5 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.