DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
📌 Most models use Grouped Query Attention. That doesn’t mean yours should.📌

📌 Most models use Grouped Query Attention. That doesn’t mean yours should.📌

1
Comments
1 min read
Comparative Cost & ROI: Chatbots vs LLM Integrations vs Autonomous Agents

Comparative Cost & ROI: Chatbots vs LLM Integrations vs Autonomous Agents

Comments
5 min read
Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Mooncake Memory Deep Dive: KVCache, Token Cost, DRAM Usage, and Saturation Analysis

Comments
5 min read
How I Built a Multilingual Food Database Using a Local LLM on an AMD GPU

How I Built a Multilingual Food Database Using a Local LLM on an AMD GPU

Comments 1
2 min read
I Didn’t Build a Chatbot — I Built an AI That Runs the System

I Didn’t Build a Chatbot — I Built an AI That Runs the System

Comments
2 min read
A Deep Dive into Deep Agent Architecture for AI Coding Assistants

A Deep Dive into Deep Agent Architecture for AI Coding Assistants

2
Comments
16 min read
Divide and Conquer: Mitigating LLM Context Saturation in Compliance Workflows

Divide and Conquer: Mitigating LLM Context Saturation in Compliance Workflows

3
Comments
6 min read
Local Lock Down Lobe Chat Setup

Local Lock Down Lobe Chat Setup

Comments
4 min read
Agentic AI — From Workflows to Goal-Driven Systems

Agentic AI — From Workflows to Goal-Driven Systems

3
Comments 1
3 min read
Decoupling the AI Stack: How to Architect a Production-Grade Local LLM System

Decoupling the AI Stack: How to Architect a Production-Grade Local LLM System

Comments 2
4 min read
Introducing `everyrow.io/dedupe`: An LLM-based approach to semantic deduplication

Introducing `everyrow.io/dedupe`: An LLM-based approach to semantic deduplication

2
Comments
6 min read
OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

OWL-Aware Chunking Strategies: A Comprehensive Performance Analysis

Comments
12 min read
Beyond Sharper Images: How LLM-Guided Super-Resolution Transforms Geo-Spatial Analysis

Beyond Sharper Images: How LLM-Guided Super-Resolution Transforms Geo-Spatial Analysis

Comments
7 min read
Tool Calling in LLMs: How Models Talk to the Real World

Tool Calling in LLMs: How Models Talk to the Real World

1
Comments 1
5 min read
Teaching LLMs to Stop Wasting Tokens

Teaching LLMs to Stop Wasting Tokens

5
Comments 2
2 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.