Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
RAG in the Wild: What I Learned After Two Weeks of Chunking Experiments
Moon Robert
Moon Robert
Moon Robert
Follow
Mar 8
RAG in the Wild: What I Learned After Two Weeks of Chunking Experiments
#
rag
#
vectordatabases
#
llm
#
embeddings
Comments
2
 comments
7 min read
How to Reduce OpenAI Bill Without Hurting Quality: A Practical Audit Framework
Daniel R. Foster
Daniel R. Foster
Daniel R. Foster
Follow
for
OptyxStack
Mar 8
How to Reduce OpenAI Bill Without Hurting Quality: A Practical Audit Framework
#
ai
#
openai
#
llm
#
softwareengineering
6
 reactions
Comments
3
 comments
6 min read
Monitor LLM Inference in Production (2026): Prometheus & Grafana for vLLM, TGI, llama.cpp
Rost
Rost
Rost
Follow
Mar 4
Monitor LLM Inference in Production (2026): Prometheus & Grafana for vLLM, TGI, llama.cpp
#
monitoring
#
hosting
#
selfhosting
#
llm
1
 reaction
Comments
Add Comment
9 min read
I Read a Paper That Genuinely Made Me Stop and Think — AI is Now Jailbreaking Other AI
Aaryan Shukla
Aaryan Shukla
Aaryan Shukla
Follow
Mar 4
I Read a Paper That Genuinely Made Me Stop and Think — AI is Now Jailbreaking Other AI
#
discuss
#
ai
#
llm
#
machinelearning
Comments
Add Comment
3 min read
Your AI Gateway Just Became an Attack Vector: Anatomy of the LiteLLM Supply Chain Compromise
Deepti Shukla
Deepti Shukla
Deepti Shukla
Follow
Mar 27
Your AI Gateway Just Became an Attack Vector: Anatomy of the LiteLLM Supply Chain Compromise
#
llm
#
opensource
#
python
#
security
1
 reaction
Comments
1
 comment
7 min read
Why Your RAG System Returns Garbage (And How to Actually Fix It)
Alan West
Alan West
Alan West
Follow
Mar 27
Why Your RAG System Returns Garbage (And How to Actually Fix It)
#
rag
#
llm
#
python
#
ai
Comments
Add Comment
5 min read
Six Characters Fixed My AI's Personality: A Fine-Tuning Story
Meridian_AI
Meridian_AI
Meridian_AI
Follow
Mar 17
Six Characters Fixed My AI's Personality: A Fine-Tuning Story
#
ai
#
machinelearning
#
llm
#
engineering
Comments
Add Comment
4 min read
Why Your AI Agents Are Burning Cash and How to Fix It
Alan West
Alan West
Alan West
Follow
Mar 27
Why Your AI Agents Are Burning Cash and How to Fix It
#
ai
#
llm
#
agents
#
python
Comments
Add Comment
5 min read
Llama vs Mistral vs Phi: Complete Open-Source LLM Comparison for Enterprise (2026)
Jaipal Singh
Jaipal Singh
Jaipal Singh
Follow
Mar 4
Llama vs Mistral vs Phi: Complete Open-Source LLM Comparison for Enterprise (2026)
#
ai
#
llm
#
opensource
#
enterprise
Comments
Add Comment
16 min read
Introducing llm-lean-log: Token-Efficient Chat Logging for AI Agents
Luu Vinh Loc
Luu Vinh Loc
Luu Vinh Loc
Follow
Mar 4
Introducing llm-lean-log: Token-Efficient Chat Logging for AI Agents
#
ai
#
llm
#
logger
Comments
Add Comment
4 min read
LLMs - How Did They Get So Good?
Joshua Ballanco
Joshua Ballanco
Joshua Ballanco
Follow
Mar 17
LLMs - How Did They Get So Good?
#
ai
#
programming
#
machinelearning
#
llm
4
 reactions
Comments
2
 comments
13 min read
Revolutionary LLM‑Generated Helm Charts: Build, Test, Deploy in Minutes
myroslav mokhammad abdeljawwad
myroslav mokhammad abdeljawwad
myroslav mokhammad abdeljawwad
Follow
Mar 4
Revolutionary LLM‑Generated Helm Charts: Build, Test, Deploy in Minutes
#
helm
#
kubernetes
#
llm
#
automation
1
 reaction
Comments
Add Comment
5 min read
Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción
Moon Robert
Moon Robert
Moon Robert
Follow
Mar 4
Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción
#
llm
#
finetuning
#
rag
#
inteligenciaartificial
Comments
Add Comment
8 min read
vLLM vs SGLang vs LMDeploy: Fastest LLM Inference Engine in 2026?
Jaipal Singh
Jaipal Singh
Jaipal Singh
Follow
Mar 5
vLLM vs SGLang vs LMDeploy: Fastest LLM Inference Engine in 2026?
#
ai
#
llm
#
cloud
#
devops
1
 reaction
Comments
Add Comment
9 min read
Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción
Moon Robert
Moon Robert
Moon Robert
Follow
Mar 4
Fine-tuning vs RAG: Cuándo Usar Cada Enfoque para LLMs en Producción
#
llm
#
finetuning
#
rag
#
inteligenciaartificial
Comments
Add Comment
8 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account