DEV Community

# llm

Posts

đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.
The Prompting Trick That Fixed My AI Image Generation

The Prompting Trick That Fixed My AI Image Generation

11
Comments
7 min read
Reranking and Two-Stage Retrieval: Precision When It Matters Most

Reranking and Two-Stage Retrieval: Precision When It Matters Most

Comments
2 min read
Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

Deploying NVIDIA Dynamo & LMCache for LLMs: Installation, Containers, and Integration

4
Comments 2
2 min read
Bifrost: The Fastest Open Source LLM Gateway

Bifrost: The Fastest Open Source LLM Gateway

2
Comments
4 min read
🏠 Self-Hosted AI Code Generation: The Complete Guide to Building Your Private AI Coding Assistant

🏠 Self-Hosted AI Code Generation: The Complete Guide to Building Your Private AI Coding Assistant

14
Comments 1
6 min read
The Poetic Hack: Exploiting LLMs with Verse by Arvind Sundararajan

The Poetic Hack: Exploiting LLMs with Verse by Arvind Sundararajan

Comments
2 min read
TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

TOON: Token-Oriented Object Notation – A Complete Guide for LLM Data Efficiency

Comments 1
3 min read
Finally Got My Dify Agent Working in Discord, Telegram and Slack

Finally Got My Dify Agent Working in Discord, Telegram and Slack

4
Comments
3 min read
From 16-bit to 4-bit: The Architecture for Scalable Personalized LLM Deployment

From 16-bit to 4-bit: The Architecture for Scalable Personalized LLM Deployment

5
Comments
6 min read
Dense vs Sparse Retrieval: Mastering FAISS, BM25, and Hybrid Search

Dense vs Sparse Retrieval: Mastering FAISS, BM25, and Hybrid Search

1
Comments
15 min read
Prompt‑Powered User Personas: From Messy Logs to Living Profiles

Prompt‑Powered User Personas: From Messy Logs to Living Profiles

Comments 1
14 min read
Why your AI assistant lies to you (and how to fix it)

Why your AI assistant lies to you (and how to fix it)

Comments
4 min read
Prompt Length vs. Context Window: The Real Limits Behind LLM Performance

Prompt Length vs. Context Window: The Real Limits Behind LLM Performance

1
Comments 1
4 min read
Context-Optimized APIs: Designing MCP Servers for LLMs

Context-Optimized APIs: Designing MCP Servers for LLMs

1
Comments
5 min read
LangGraph Streaming 101: 5 Modes to Build Responsive AI Applications

LangGraph Streaming 101: 5 Modes to Build Responsive AI Applications

1
Comments 4
6 min read
đź‘‹ Sign in for the ability to sort posts by relevant, latest, or top.