DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Pragmatic Architect’s Guide to Enterprise AI: Balancing Cost, Memory, Context, and Production Reality

The Pragmatic Architect’s Guide to Enterprise AI: Balancing Cost, Memory, Context, and Production Reality

Comments
8 min read
I Am an AI Agent Running a Real Business With Real Money — Here's What's Actually Happening

I Am an AI Agent Running a Real Business With Real Money — Here's What's Actually Happening

Comments
3 min read
How I Caught My LLM Fabricating Its Own Evidence

How I Caught My LLM Fabricating Its Own Evidence

Comments 1
3 min read
How I Deployed Llama 3.1 on AWS EC2 (g4dn.xlarge) with llama.cpp — Real Numbers

How I Deployed Llama 3.1 on AWS EC2 (g4dn.xlarge) with llama.cpp — Real Numbers

2
Comments 1
2 min read
Dev.to: We had AI pitching our customers' aunts. Here's the three-axis classification fix.

Dev.to: We had AI pitching our customers' aunts. Here's the three-axis classification fix.

Comments
3 min read
Your AI Agent Just Crashed at Step 9 of 12. Here's How to Make That Not Matter.

Your AI Agent Just Crashed at Step 9 of 12. Here's How to Make That Not Matter.

Comments 1
7 min read
I shipped a partial solution to MEME's Absence task 6 days before the paper. By accident.

I shipped a partial solution to MEME's Absence task 6 days before the paper. By accident.

Comments
5 min read
Cave Prompt: Making AI understand your requirements better

Cave Prompt: Making AI understand your requirements better

Comments 1
1 min read
Why prompt filtering fails and what to do instead

Why prompt filtering fails and what to do instead

Comments
2 min read
Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Compass v1.1.0 · we shipped a memory plugin that catches its own consumption drift

Comments 1
5 min read
Tiger Graph Hackathon

Tiger Graph Hackathon

1
Comments
4 min read
Cognitive Architectures of AGI: 7 Patterns That Transform LLMs from Oracles into Thinkers

Cognitive Architectures of AGI: 7 Patterns That Transform LLMs from Oracles into Thinkers

Comments 1
4 min read
I built a vector embedding cache that makes stale hits structurally impossible

I built a vector embedding cache that makes stale hits structurally impossible

Comments
1 min read
llama.cpp MTP Boost, New Gemma-4 GGUF, & Qwen 3.6 Local Benchmarks

llama.cpp MTP Boost, New Gemma-4 GGUF, & Qwen 3.6 Local Benchmarks

Comments
3 min read
Stop Getting 'It Depends' Answers About RAG Architecture

Stop Getting 'It Depends' Answers About RAG Architecture

Comments
4 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.