Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
How Komilion's Request Routing Actually Works
Robin
Robin
Robin
Follow
Feb 21
How Komilion's Request Routing Actually Works
#
ai
#
architecture
#
llm
#
devtools
Comments
Add Comment
4 min read
Caching Strategies for LLM Systems – Part 4: Grouped-Query Attention for Scalable, Efficient Transformers
vaibhav ahluwalia
vaibhav ahluwalia
vaibhav ahluwalia
Follow
Feb 21
Caching Strategies for LLM Systems – Part 4: Grouped-Query Attention for Scalable, Efficient Transformers
#
deeplearning
#
llm
#
machinelearning
#
performance
Comments
Add Comment
3 min read
Claude's Context Compaction API: Infinite Conversations with One Parameter
Ayyaz Zafar
Ayyaz Zafar
Ayyaz Zafar
Follow
Feb 21
Claude's Context Compaction API: Infinite Conversations with One Parameter
#
agents
#
ai
#
api
#
llm
Comments
Add Comment
3 min read
PortKey Just Raised $15M — Here's What That Means for Your AI Costs
Robin
Robin
Robin
Follow
Feb 21
PortKey Just Raised $15M — Here's What That Means for Your AI Costs
#
ai
#
devtools
#
llm
#
api
Comments
Add Comment
3 min read
Your AI Gateway Just Became an Attack Vector: Anatomy of the LiteLLM Supply Chain Compromise
Deepti Shukla
Deepti Shukla
Deepti Shukla
Follow
Mar 27
Your AI Gateway Just Became an Attack Vector: Anatomy of the LiteLLM Supply Chain Compromise
#
llm
#
opensource
#
python
#
security
1
 reaction
Comments
1
 comment
7 min read
The $500 GPU That Outperforms Claude Sonnet on Coding Benchmarks
Pooya Golchian
Pooya Golchian
Pooya Golchian
Follow
Mar 27
The $500 GPU That Outperforms Claude Sonnet on Coding Benchmarks
#
ai
#
llm
#
benchmarks
#
nvidia
Comments
Add Comment
3 min read
Why AI Agents Need to Think About Trust: Lessons from the MoltBook Security Incident
Operational Neuralnet
Operational Neuralnet
Operational Neuralnet
Follow
Feb 21
Why AI Agents Need to Think About Trust: Lessons from the MoltBook Security Incident
#
ai
#
agents
#
security
#
llm
1
 reaction
Comments
Add Comment
2 min read
What's semantic caching?
Kushal
Kushal
Kushal
Follow
Mar 16
What's semantic caching?
#
ai
#
architecture
#
llm
#
performance
2
 reactions
Comments
Add Comment
6 min read
What we found when we audited botlington.com itself
gary-botlington
gary-botlington
gary-botlington
Follow
Mar 16
What we found when we audited botlington.com itself
#
ai
#
agents
#
llm
#
productivity
1
 reaction
Comments
Add Comment
5 min read
Did You Know That LLMs Can Take Architecture as Code to the Next Level?
Alexey Pronsky
Alexey Pronsky
Alexey Pronsky
Follow
Mar 27
Did You Know That LLMs Can Take Architecture as Code to the Next Level?
#
architecture
#
ai
#
llm
#
agents
2
 reactions
Comments
3
 comments
14 min read
Your AI Hit Its Limit. Your Knowledge Shouldn't.
Evert
Evert
Evert
Follow
Mar 27
Your AI Hit Its Limit. Your Knowledge Shouldn't.
#
ai
#
chatgpt
#
llm
#
productivity
1
 reaction
Comments
1
 comment
4 min read
Google Gemini 3.1 Pro Review: What's New? – Proje Defteri
Yunus Emre
Yunus Emre
Yunus Emre
Follow
for
Proje Defteri
Feb 21
Google Gemini 3.1 Pro Review: What's New? – Proje Defteri
#
google
#
ai
#
gemini
#
llm
Comments
Add Comment
5 min read
Your AI Agent Is Probably Costing 10x More Than It Should
Robin
Robin
Robin
Follow
Feb 21
Your AI Agent Is Probably Costing 10x More Than It Should
#
ai
#
agents
#
llm
#
devtools
Comments
Add Comment
4 min read
LiteLLM vs. Komilion: Two Different Bets on the Same Problem
Robin
Robin
Robin
Follow
Feb 21
LiteLLM vs. Komilion: Two Different Bets on the Same Problem
#
ai
#
python
#
llm
#
devtools
Comments
Add Comment
4 min read
OpenRouter vs. Komilion: When to Use Each
Robin
Robin
Robin
Follow
Feb 21
OpenRouter vs. Komilion: When to Use Each
#
ai
#
devtools
#
llm
#
api
Comments
Add Comment
4 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account