DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
This is why grep is failure when it comes to quality and token saving!

This is why grep is failure when it comes to quality and token saving!

2
Comments
1 min read
Evaluating LLMs for Under a Dollar

Evaluating LLMs for Under a Dollar

2
Comments
4 min read
I fine-tuned an LLM for a client, then told them not to use it

I fine-tuned an LLM for a client, then told them not to use it

1
Comments
5 min read
LangChain fundamentals part 2: structured outputs and tool calling

LangChain fundamentals part 2: structured outputs and tool calling

Comments
2 min read
Agent Series (12): Agent Evaluation Framework — How Do You Know If Your Agent Is Actually Good?

Agent Series (12): Agent Evaluation Framework — How Do You Know If Your Agent Is Actually Good?

Comments
6 min read
KVQuant: Run 70B LLMs on 8GB RAM with KV Cache Quantization

KVQuant: Run 70B LLMs on 8GB RAM with KV Cache Quantization

Comments
1 min read
Building a Persistent Knowledge Base RAG System with FastAPI, llama.cpp, Chroma, and Open WebUI

Building a Persistent Knowledge Base RAG System with FastAPI, llama.cpp, Chroma, and Open WebUI

Comments
7 min read
The database is where AI agents in production get weird

The database is where AI agents in production get weird

Comments
2 min read
On Character Formation and Identity

On Character Formation and Identity

Comments
5 min read
99% of Requests Failed and My Dashboard Showed Green

99% of Requests Failed and My Dashboard Showed Green

1
Comments 1
7 min read
Qwen 3.5 SAEs & 3.6 Q6_K Multimodal, DeepSeek's Visual Primitives Framework

Qwen 3.5 SAEs & 3.6 Q6_K Multimodal, DeepSeek's Visual Primitives Framework

Comments
3 min read
Your AI Conversations Are Not Yours. Yet…

Your AI Conversations Are Not Yours. Yet…

1
Comments
8 min read
My AI Read a JSON File from Disk 900 Times in a Loop (And Why No Linter Can Save You)

My AI Read a JSON File from Disk 900 Times in a Loop (And Why No Linter Can Save You)

Comments
6 min read
Why 99% of What You Send to Claude Is Already Cached

Why 99% of What You Send to Claude Is Already Cached

Comments
7 min read
Madness Driven Design: Don Quixote, Sancho Panza, and Your AI Copilot

Madness Driven Design: Don Quixote, Sancho Panza, and Your AI Copilot

Comments
8 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.