Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
Caching Strategies for LLM Systems (Part 3): Multi-Query Attention and Memory-Efficient Decoding
vaibhav ahluwalia
vaibhav ahluwalia
vaibhav ahluwalia
Follow
Feb 8
Caching Strategies for LLM Systems (Part 3): Multi-Query Attention and Memory-Efficient Decoding
#
deeplearning
#
llm
#
machinelearning
#
performance
Comments
Add Comment
5 min read
LocalAI QuickStart: Run OpenAI-Compatible LLMs Locally
Rost
Rost
Rost
Follow
Mar 15
LocalAI QuickStart: Run OpenAI-Compatible LLMs Locally
#
cheatsheet
#
selfhosting
#
llm
#
ai
1
 reaction
Comments
Add Comment
9 min read
Building in Public: CV Analyzer - Closure
Voke
Voke
Voke
Follow
Feb 9
Building in Public: CV Analyzer - Closure
#
buildinginpublic
#
ai
#
webdev
#
llm
1
 reaction
Comments
Add Comment
1 min read
Your Next.js Site Is Serving 26 KB of Noise to LLMs. Here's the Fix.
Kacper Siniło
Kacper Siniło
Kacper Siniło
Follow
Mar 3
Your Next.js Site Is Serving 26 KB of Noise to LLMs. Here's the Fix.
#
nextjs
#
opensource
#
llm
#
webdev
1
 reaction
Comments
Add Comment
3 min read
I’m Building a Dating App for AI Agents (For Science… Probably)
Neel
Neel
Neel
Follow
Feb 8
I’m Building a Dating App for AI Agents (For Science… Probably)
#
ai
#
llm
#
agents
Comments
Add Comment
2 min read
VibeBox: Ultrafast CLI for fast, sandboxed development and LLM agents
Finn Sheng
Finn Sheng
Finn Sheng
Follow
Feb 8
VibeBox: Ultrafast CLI for fast, sandboxed development and LLM agents
#
agents
#
cli
#
llm
#
tooling
Comments
Add Comment
1 min read
Introducing ThinkLang: A Programming Language Where AI Is a First-Class Citizen
elias hourany
elias hourany
elias hourany
Follow
Feb 8
Introducing ThinkLang: A Programming Language Where AI Is a First-Class Citizen
#
ai
#
programming
#
llm
#
typescript
Comments
Add Comment
6 min read
Nuevo en Backboard.io: Gestión automática de la ventana de contexto en más de 17.000 modelos
Jonathan Murray
Jonathan Murray
Jonathan Murray
Follow
Mar 14
Nuevo en Backboard.io: Gestión automática de la ventana de contexto en más de 17.000 modelos
#
news
#
ai
#
automation
#
llm
2
 reactions
Comments
Add Comment
3 min read
Building Reliable AI Applications: A Validation Strategy
mtdevworks
mtdevworks
mtdevworks
Follow
Feb 8
Building Reliable AI Applications: A Validation Strategy
#
ai
#
programming
#
llm
#
developers
Comments
Add Comment
4 min read
vLLM vs TensorRT-LLM vs Ollama vs llama.cpp — Choosing the Right Inference Engine on RTX 5090
soy
soy
soy
Follow
Mar 14
vLLM vs TensorRT-LLM vs Ollama vs llama.cpp — Choosing the Right Inference Engine on RTX 5090
#
ai
#
llm
#
nvidia
#
deeplearning
1
 reaction
Comments
Add Comment
7 min read
🎯MCP vs Direct API Calls
Vansh Uttam
Vansh Uttam
Vansh Uttam
Follow
Feb 12
🎯MCP vs Direct API Calls
#
mcp
#
ai
#
llm
#
api
1
 reaction
Comments
Add Comment
2 min read
Letting LLMs Jump — and Then Verifying Ruthlessly
Shinsuke KAGAWA
Shinsuke KAGAWA
Shinsuke KAGAWA
Follow
Feb 12
Letting LLMs Jump — and Then Verifying Ruthlessly
#
ai
#
llm
#
productivity
#
programming
1
 reaction
Comments
Add Comment
5 min read
The Battle Between RAG and Long Context
Tomer Ben David
Tomer Ben David
Tomer Ben David
Follow
Mar 13
The Battle Between RAG and Long Context
#
ai
#
architecture
#
rag
#
llm
6
 reactions
Comments
2
 comments
3 min read
Most Enterprise AI Can Talk. Very Few Can Decide.
The Pragamatic Architect
The Pragamatic Architect
The Pragamatic Architect
Follow
Mar 14
Most Enterprise AI Can Talk. Very Few Can Decide.
#
agents
#
ai
#
architecture
#
llm
4
 reactions
Comments
Add Comment
3 min read
AgentMisalignment: Engineering a Real-time Detection System for LLM Agents
vishalmysore
vishalmysore
vishalmysore
Follow
Feb 21
AgentMisalignment: Engineering a Real-time Detection System for LLM Agents
#
agents
#
ai
#
llm
#
security
2
 reactions
Comments
Add Comment
3 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account