Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Tackle High Token Usage with GraphRAG
Apoorva Sachan
Apoorva Sachan
Apoorva Sachan
Follow
May 17
Tackle High Token Usage with GraphRAG
#
devchallenge
#
llm
#
performance
#
rag
1
 reaction
Comments
Add Comment
4 min read
High-Value If, Low-Value Foreach: Why Agents Trade in Judgment Structures, Not Models
suhui
suhui
suhui
Follow
May 18
High-Value If, Low-Value Foreach: Why Agents Trade in Judgment Structures, Not Models
#
ai
#
agents
#
llm
#
mcp
2
 reactions
Comments
Add Comment
23 min read
How to build a production RAG pipeline in Python (without a vector database)
Ayi NEDJIMI
Ayi NEDJIMI
Ayi NEDJIMI
Follow
May 22
How to build a production RAG pipeline in Python (without a vector database)
#
python
#
ai
#
llm
#
tutorial
1
 reaction
Comments
Add Comment
5 min read
ModelChain: Measurable LLM Router with Adaptive Model Selection, Real-Time Scoring, Budget Guards and Failover for Node.js, Edge and Browser
David C Cavalcante
David C Cavalcante
David C Cavalcante
Follow
May 30
ModelChain: Measurable LLM Router with Adaptive Model Selection, Real-Time Scoring, Budget Guards and Failover for Node.js, Edge and Browser
#
showdev
#
ai
#
javascript
#
llm
Comments
1
 comment
3 min read
My AI Agent Kept Lying to Me. Then It Tried to Trick Me.
mariatanbobo
mariatanbobo
mariatanbobo
Follow
May 31
My AI Agent Kept Lying to Me. Then It Tried to Trick Me.
#
ai
#
llm
#
devops
#
hermes
Comments
2
 comments
5 min read
Como treinei uma IA de suporte com histórico real de atendimento: da conversa bruta ao RAG em produção
Gabriel Brocco de Oliveira
Gabriel Brocco de Oliveira
Gabriel Brocco de Oliveira
Follow
May 21
Como treinei uma IA de suporte com histórico real de atendimento: da conversa bruta ao RAG em produção
#
ai
#
rag
#
llm
#
datascience
1
 reaction
Comments
1
 comment
11 min read
Stop Burning Cash on Long-Context RAG: Ephemeral Prompt Caching with Spring AI and JTokkit
Machine coding Master
Machine coding Master
Machine coding Master
Follow
May 31
Stop Burning Cash on Long-Context RAG: Ephemeral Prompt Caching with Spring AI and JTokkit
#
java
#
ai
#
llm
#
systemdesign
Comments
1
 comment
2 min read
The Daimon Java SDK: Chat, Stream, and Query Memory from 3 Lines of Java
Rishi Kumar
Rishi Kumar
Rishi Kumar
Follow
May 17
The Daimon Java SDK: Chat, Stream, and Query Memory from 3 Lines of Java
#
ai
#
java
#
llm
#
tutorial
Comments
Add Comment
5 min read
Show HN: Needle distilled Gemini tool calling en 26M parámetros — lectura técnica sin hype
Juan Torchia
Juan Torchia
Juan Torchia
Follow
May 17
Show HN: Needle distilled Gemini tool calling en 26M parámetros — lectura técnica sin hype
#
spanish
#
espanol
#
typescript
#
llm
Comments
Add Comment
9 min read
Stop Burning Tokens on Chat / Agent Loops — Here's What Actually Works
lilili
lilili
lilili
Follow
May 31
Stop Burning Tokens on Chat / Agent Loops — Here's What Actually Works
#
ai
#
llm
#
webdev
#
agents
Comments
1
 comment
6 min read
When the LLM Refuses: A Fallback Chain That Salvages Most Refusals
sm1ck
sm1ck
sm1ck
Follow
May 31
When the LLM Refuses: A Fallback Chain That Salvages Most Refusals
#
ai
#
llm
#
python
Comments
1
 comment
5 min read
Welcome to the Slop KPI Era: How Tokenmaxxing Is Making AI Worse
Misha Lanin
Misha Lanin
Misha Lanin
Follow
May 18
Welcome to the Slop KPI Era: How Tokenmaxxing Is Making AI Worse
#
ai
#
llm
#
mcp
#
agents
1
 reaction
Comments
Add Comment
4 min read
Your RAG Pipeline Is Failing 40% of Queries. Here's the Fix.
Spicy
Spicy
Spicy
Follow
May 17
Your RAG Pipeline Is Failing 40% of Queries. Here's the Fix.
#
rag
#
ai
#
llm
#
machinelearning
Comments
Add Comment
2 min read
Inworld TTS Paralinguistic Tags Don't Work — Here's What Does
sm1ck
sm1ck
sm1ck
Follow
May 31
Inworld TTS Paralinguistic Tags Don't Work — Here's What Does
#
ai
#
tts
#
llm
#
voice
Comments
1
 comment
4 min read
Qwen3.7 Max vs Open-Weight LLMs: Practical Migration Notes
Alan West
Alan West
Alan West
Follow
May 21
Qwen3.7 Max vs Open-Weight LLMs: Practical Migration Notes
#
ai
#
llm
#
opensource
#
machinelearning
2
 reactions
Comments
Add Comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account