DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
Beyond the "Brute Force Beauty": A Modular, Brain-Inspired LLM Architecture (Thoughts on grand models: Part 2)

Beyond the "Brute Force Beauty": A Modular, Brain-Inspired LLM Architecture (Thoughts on grand models: Part 2)

Comments
4 min read
Your LLM-as-judge eval set is too small. Here is the math

Your LLM-as-judge eval set is too small. Here is the math

1
Comments 1
4 min read
Prompt Injection Explained for Security Professionals

Prompt Injection Explained for Security Professionals

Comments
4 min read
Building a Production-Grade AI Fact-Checker: Patterns, Pipelines, and the Question Every AI Engineer Must Answer

Building a Production-Grade AI Fact-Checker: Patterns, Pipelines, and the Question Every AI Engineer Must Answer

Comments
9 min read
How Superpowers Forces Skill Execution

How Superpowers Forces Skill Execution

Comments 2
7 min read
Voice agent latency is a lie. The number you care about is barge-in interrupt rate.

Voice agent latency is a lie. The number you care about is barge-in interrupt rate.

1
Comments 2
4 min read
Beyond Prompting: Building a 4-Stage LLM Compiler with Surgical Self-Repair

Beyond Prompting: Building a 4-Stage LLM Compiler with Surgical Self-Repair

Comments 1
3 min read
I built an AI debugging assistant with Llama 3.3 — here's what actually worked

I built an AI debugging assistant with Llama 3.3 — here's what actually worked

2
Comments
4 min read
Serving a Fleet of SLMs on One RTX 5080: Multi-Model on a Single Consumer GPU

Serving a Fleet of SLMs on One RTX 5080: Multi-Model on a Single Consumer GPU

Comments 1
4 min read
Token-level eval harness for tool-calling agents: what we wired up

Token-level eval harness for tool-calling agents: what we wired up

Comments 1
4 min read
LLM Cost Optimization for Agent Workflows: A Practical Guide

LLM Cost Optimization for Agent Workflows: A Practical Guide

Comments 1
13 min read
I built a free LLM pricing tool that updates itself daily. here's how

I built a free LLM pricing tool that updates itself daily. here's how

3
Comments
3 min read
Evaluating Open-Weight LLMs for Phishing Simulation and Red Teaming

Evaluating Open-Weight LLMs for Phishing Simulation and Red Teaming

Comments
3 min read
Per-customer budget caps on our caption pipeline: 3 weeks with virtual keys

Per-customer budget caps on our caption pipeline: 3 weeks with virtual keys

Comments 1
4 min read
I'm writing this down before I lose the thread

I'm writing this down before I lose the thread

Comments
7 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.