DEV Community

# llm

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
🧩 Runtime Snapshots #5 — The Real Thing: How We Actually Use It

🧩 Runtime Snapshots #5 — The Real Thing: How We Actually Use It

1
Comments
2 min read
From Prompts to Agents: My Learning Journey in the 5-Day AI Agents Intensive

From Prompts to Agents: My Learning Journey in the 5-Day AI Agents Intensive

Comments
1 min read
An Intro to Large Language Models and the Transformer Architecture: Talking to a calculator

An Intro to Large Language Models and the Transformer Architecture: Talking to a calculator

Comments
4 min read
How Deep Agents Actually Work: A Browsr Architecture Walkthrough

How Deep Agents Actually Work: A Browsr Architecture Walkthrough

1
Comments
4 min read
Large File MCP: Handle Massive Files in Claude with Intelligent Chunking

Large File MCP: Handle Massive Files in Claude with Intelligent Chunking

Comments
5 min read
Nov14, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Nov14, 2025 | The Tongyi Weekly: Your weekly dose of cutting-edge AI from Tongyi Lab

Comments
5 min read
Docify: Building a Production RAG System for Knowledge Management

Docify: Building a Production RAG System for Knowledge Management

Comments
4 min read
The Security Logic Behind LLM Jailbreaking

The Security Logic Behind LLM Jailbreaking

1
Comments
6 min read
The Architecture of Agent Memory: How LangGraph Really Works

The Architecture of Agent Memory: How LangGraph Really Works

2
Comments
11 min read
LangGraph: Orchestrating Complex LLM Workflows with State Machines

LangGraph: Orchestrating Complex LLM Workflows with State Machines

Comments
4 min read
Understanding AI: From LLMs to MCP

Understanding AI: From LLMs to MCP

Comments
8 min read
From SEO Playbooks to GEO Architectures

From SEO Playbooks to GEO Architectures

3
Comments
13 min read
The Best LLM and AI Orchestration Toolkits for Your Stack

The Best LLM and AI Orchestration Toolkits for Your Stack

Comments
4 min read
LiteLLM Broke at 300 RPS in Production. Here's How We Fixed It

LiteLLM Broke at 300 RPS in Production. Here's How We Fixed It

5
Comments
4 min read
Adding NVIDIA GPU Support to Docker Model Runner

Adding NVIDIA GPU Support to Docker Model Runner

Comments
6 min read
Fine-Tuning LLMs: LoRA, Quantization, and Distillation Simplified

Fine-Tuning LLMs: LoRA, Quantization, and Distillation Simplified

1
Comments
5 min read
Creating Custom Evaluators to Measure Model Quality

Creating Custom Evaluators to Measure Model Quality

Comments
9 min read
Running Out of Data: How Synthetic Data is Saving the Future of AI

Running Out of Data: How Synthetic Data is Saving the Future of AI

10
Comments
4 min read
KV Caching in LLMs: How It Speeds Up Text Generation

KV Caching in LLMs: How It Speeds Up Text Generation

10
Comments
2 min read
⚛ MCP Explained: A Simple Guide 📜 to AI 🤖 Agents

⚛ MCP Explained: A Simple Guide 📜 to AI 🤖 Agents

1
Comments 2
3 min read
Notes on the margins

Notes on the margins

Comments
3 min read
5 Must-Read Books for Backend Engineers in 2026

5 Must-Read Books for Backend Engineers in 2026

16
Comments
4 min read
How Bifrost Integrates With Your Existing LLM Stack (No Refactoring Required)

How Bifrost Integrates With Your Existing LLM Stack (No Refactoring Required)

5
Comments
4 min read
Preventing AI Project Failures Through Effective Prompt Engineering

Preventing AI Project Failures Through Effective Prompt Engineering

1
Comments
5 min read
Building Your First Agentic AI: Complete Guide to MCP + Ollama Tool Calling

Building Your First Agentic AI: Complete Guide to MCP + Ollama Tool Calling

2
Comments 3
14 min read
loading...