Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
The hidden cost of streaming LLMs: caches you can't use, bills you don't expect, and complexity you don't need
Ravi Patel
Ravi Patel
Ravi Patel
Follow
Jun 8
The hidden cost of streaming LLMs: caches you can't use, bills you don't expect, and complexity you don't need
#
llm
#
streaming
#
costoptimization
#
ux
Comments
1
 comment
11 min read
Unlocking the Power of RAG Systems with LangChain and Vector Databases
Naveen Malothu
Naveen Malothu
Naveen Malothu
Follow
Jun 4
Unlocking the Power of RAG Systems with LangChain and Vector Databases
#
ai
#
llm
#
python
Comments
Add Comment
3 min read
Switching our LLM-as-judge from 5-class to binary in CI: the patterns we kept
Ethan Walker
Ethan Walker
Ethan Walker
Follow
Jun 3
Switching our LLM-as-judge from 5-class to binary in CI: the patterns we kept
#
ai
#
llm
#
ci
#
machinelearning
Comments
Add Comment
3 min read
AirLLM Shrinks 70B LLMs to 4GB VRAM; DPO & Supermemory Boost Open Models
soy
soy
soy
Follow
Jun 3
AirLLM Shrinks 70B LLMs to 4GB VRAM; DPO & Supermemory Boost Open Models
#
ai
#
llm
#
selfhosted
Comments
Add Comment
3 min read
From Chatbots to Personal AI Agents: The Infrastructure Developers Actually Need
Mundo Ghose
Mundo Ghose
Mundo Ghose
Follow
Jun 8
From Chatbots to Personal AI Agents: The Infrastructure Developers Actually Need
#
llm
#
ai
#
agentskills
1
 reaction
Comments
Add Comment
18 min read
GLM 5.2: Zhipu's Open-Weight Frontier Model With 1M Context
Md Jamilur Rahman
Md Jamilur Rahman
Md Jamilur Rahman
Follow
Jun 17
GLM 5.2: Zhipu's Open-Weight Frontier Model With 1M Context
#
news
#
ai
#
llm
#
opensource
Comments
Add Comment
5 min read
RAG pilots fail when the sources are not ready
Mindtrovert Labs
Mindtrovert Labs
Mindtrovert Labs
Follow
Jun 4
RAG pilots fail when the sources are not ready
#
ai
#
llm
#
rag
#
productivity
Comments
Add Comment
2 min read
The most expensive bug in an AI agent is the one it's confident about
Andrii Krugliak
Andrii Krugliak
Andrii Krugliak
Follow
Jun 3
The most expensive bug in an AI agent is the one it's confident about
#
discuss
#
agents
#
ai
#
llm
Comments
Add Comment
3 min read
AWS Optimizes Starts, Adaptive Worms Rise, and LLM Memory Gets Local
Anikalp Jaiswal
Anikalp Jaiswal
Anikalp Jaiswal
Follow
Jun 4
AWS Optimizes Starts, Adaptive Worms Rise, and LLM Memory Gets Local
#
ai
#
technology
#
llm
#
programming
Comments
Add Comment
2 min read
# Enterprise RAG’s Biggest Risk: Answers That Look Correct but Aren’t
Anthony Jiang
Anthony Jiang
Anthony Jiang
Follow
Jun 3
# Enterprise RAG’s Biggest Risk: Answers That Look Correct but Aren’t
#
rag
#
ai
#
devops
#
llm
Comments
Add Comment
7 min read
I built a circuit breaker for LLM agents after seeing someone lose $200 overnight
BOSS_METALLIQUE
BOSS_METALLIQUE
BOSS_METALLIQUE
Follow
Jun 3
I built a circuit breaker for LLM agents after seeing someone lose $200 overnight
#
ai
#
python
#
llm
#
opensource
1
 reaction
Comments
Add Comment
6 min read
From Commerce to E-Commerce to MCP-Commerce: The Third Wave
Ben Habif Rudnik 🇨🇱🇮🇱🇺🇦
Ben Habif Rudnik 🇨🇱🇮🇱🇺🇦
Ben Habif Rudnik 🇨🇱🇮🇱🇺🇦
Follow
Jun 4
From Commerce to E-Commerce to MCP-Commerce: The Third Wave
#
agents
#
ai
#
llm
#
mcp
Comments
Add Comment
3 min read
Stop Spending $500/Month on API Calls: Build Your Own LLM Pipeline
Learn AI Resource
Learn AI Resource
Learn AI Resource
Follow
Jun 3
Stop Spending $500/Month on API Calls: Build Your Own LLM Pipeline
#
ai
#
llm
#
productivity
#
devops
Comments
Add Comment
3 min read
Over-editing is a token tax: GPT-5.4 ships 6.5x more diff per fix than Claude Opus 4.6, and your bill notices
John Medina
John Medina
John Medina
Follow
Jun 3
Over-editing is a token tax: GPT-5.4 ships 6.5x more diff per fix than Claude Opus 4.6, and your bill notices
#
llm
#
opensource
#
ai
#
costtracking
Comments
Add Comment
2 min read
Rodei IA de 35B na minha GPU velha e me surpreendi!
Marcelo Cabral Ghilardi
Marcelo Cabral Ghilardi
Marcelo Cabral Ghilardi
Follow
Jun 3
Rodei IA de 35B na minha GPU velha e me surpreendi!
#
ai
#
gpu
#
llm
#
quantizacao
Comments
Add Comment
4 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account