DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
A Unified View of AI Evolution: From Machine Learning to LLMs, RAG, and Fine-Tuning

A Unified View of AI Evolution: From Machine Learning to LLMs, RAG, and Fine-Tuning

Comments
5 min read
Claude Code: Self host model configuration

Claude Code: Self host model configuration

Comments
1 min read
16 GB VRAM LLM benchmarks with llama.cpp (speed and context)

16 GB VRAM LLM benchmarks with llama.cpp (speed and context)

Comments
4 min read
The Hidden Cost of Context: Why Your Agent Is Expensive and Slow

The Hidden Cost of Context: Why Your Agent Is Expensive and Slow

Comments
2 min read
Show HN: LoreSpec – Structured knowledge extraction from AI conversations

Show HN: LoreSpec – Structured knowledge extraction from AI conversations

Comments
1 min read
I Got Tired of Surprise OpenAI Bills, So I Built a Dashboard to Track Them

I Got Tired of Surprise OpenAI Bills, So I Built a Dashboard to Track Them

Comments
4 min read
The 70% to 94% Problem: Why Your AI Skills Are Probably Wrong

The 70% to 94% Problem: Why Your AI Skills Are Probably Wrong

Comments
4 min read
Building an AI that analyzes stocks like Warren Buffett

Building an AI that analyzes stocks like Warren Buffett

Comments
2 min read
88% of Agent Systems Got Hacked — Your LangGraph Auth Layer Is the Problem

88% of Agent Systems Got Hacked — Your LangGraph Auth Layer Is the Problem

5
Comments 1
1 min read
Anthropic Just Restricted Third-Party Claude Access — Why Running AI Locally Is Your Insurance Policy

Anthropic Just Restricted Third-Party Claude Access — Why Running AI Locally Is Your Insurance Policy

Comments
3 min read
The AI Billing Problem Nobody Talks About

The AI Billing Problem Nobody Talks About

Comments
3 min read
Open Source Project of the Day (Part 29): Open-AutoGLM - A Phone Agent Framework for Controlling Phones with Natural Language

Open Source Project of the Day (Part 29): Open-AutoGLM - A Phone Agent Framework for Controlling Phones with Natural Language

Comments
9 min read
Gemma 4 & LLM Ops: Fine-Tuning, Local Inference, and VRAM Management

Gemma 4 & LLM Ops: Fine-Tuning, Local Inference, and VRAM Management

Comments
3 min read
Your Model Already Knows How to Reason. It Needs 26 Bytes to Prove It.

Your Model Already Knows How to Reason. It Needs 26 Bytes to Prove It.

Comments
3 min read
Three Memory Architectures for AI Companions

Three Memory Architectures for AI Companions

Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.