Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
When to Move Beyond LiteLLM (And When Not To)
Sahajmeet Kaur
Sahajmeet Kaur
Sahajmeet Kaur
Follow
Jun 13
When to Move Beyond LiteLLM (And When Not To)
#
ai
#
mcp
#
llm
#
claude
1
 reaction
Comments
Add Comment
6 min read
I Built a Python Agent That Uses a Vector DB as Memory, Not Retrieval
Greg Mate
Greg Mate
Greg Mate
Follow
Jun 11
I Built a Python Agent That Uses a Vector DB as Memory, Not Retrieval
#
ai
#
python
#
vectordatabase
#
llm
11
 reactions
Comments
8
 comments
6 min read
Claude Code Source Analysis Series, Chapter 5: Tools Overview
LienJack
LienJack
LienJack
Follow
May 10
Claude Code Source Analysis Series, Chapter 5: Tools Overview
#
agents
#
architecture
#
claude
#
llm
1
 reaction
Comments
Add Comment
12 min read
Tool-Response Engineering: The Frontier Beyond Prompt Engineering
Simon Reiff
Simon Reiff
Simon Reiff
Follow
for
HIC AI, Inc.
May 12
Tool-Response Engineering: The Frontier Beyond Prompt Engineering
#
agents
#
ai
#
llm
#
tooling
Comments
Add Comment
17 min read
Most RAG failures don’t crash. They silently return bad answers. I built a repair layer for that.
BN
BN
BN
Follow
May 10
Most RAG failures don’t crash. They silently return bad answers. I built a repair layer for that.
#
rag
#
llm
#
ai
#
deterministic
Comments
Add Comment
1 min read
On-device LLM on iPhone: which runtime is fastest? MLX vs llama.cpp vs LiteRT-LM vs CoreML
Daisuke Majima
Daisuke Majima
Daisuke Majima
Follow
Jun 2
On-device LLM on iPhone: which runtime is fastest? MLX vs llama.cpp vs LiteRT-LM vs CoreML
#
ios
#
machinelearning
#
llm
#
swift
1
 reaction
Comments
1
 comment
4 min read
Agents assemble. One agent is a hire. Many agents are a workforce.
The Pragamatic Architect
The Pragamatic Architect
The Pragamatic Architect
Follow
May 9
Agents assemble. One agent is a hire. Many agents are a workforce.
#
agents
#
ai
#
architecture
#
llm
Comments
Add Comment
5 min read
Gemma 4: Frontier AI in Your Hands”
Sam Keb
Sam Keb
Sam Keb
Follow
May 11
Gemma 4: Frontier AI in Your Hands”
#
ai
#
devchallenge
#
google
#
llm
1
 reaction
Comments
Add Comment
2 min read
BeeLlama.cpp enhances llama.cpp, Qwen 35B hits 128K context, iOS local LLMs with Ollama
soy
soy
soy
Follow
May 9
BeeLlama.cpp enhances llama.cpp, Qwen 35B hits 128K context, iOS local LLMs with Ollama
#
ai
#
llm
#
selfhosted
Comments
Add Comment
3 min read
Deterministic reliability stack for LLM pipelines
BN
BN
BN
Follow
May 9
Deterministic reliability stack for LLM pipelines
#
ai
#
llm
#
mlops
#
rag
Comments
Add Comment
1 min read
LLM Token Counting and Cost Optimization: A Practical Guide
Ayi NEDJIMI
Ayi NEDJIMI
Ayi NEDJIMI
Follow
May 23
LLM Token Counting and Cost Optimization: A Practical Guide
#
python
#
ai
#
llm
#
tutorial
1
 reaction
Comments
Add Comment
5 min read
Generation 1 — Standalone Models (2018–2022)
Raghavendra Govindu
Raghavendra Govindu
Raghavendra Govindu
Follow
May 9
Generation 1 — Standalone Models (2018–2022)
#
ai
#
deeplearning
#
llm
#
nlp
Comments
Add Comment
5 min read
Why Most WordPress SEO Plugins Are Not Ready for AI Search Yet
AEO God Mode (Answer Engine Optimization for Wordpress)
AEO God Mode (Answer Engine Optimization for Wordpress)
AEO God Mode (Answer Engine Optimization for Wordpress)
Follow
May 10
Why Most WordPress SEO Plugins Are Not Ready for AI Search Yet
#
wordpress
#
seo
#
ai
#
llm
Comments
Add Comment
5 min read
A Survey of LLM-based Deep Search Agents Adaptive Path Planning via Weighted A* and Heuristic Rewards
24P-0507 Muhammad Uzair Shoaib
24P-0507 Muhammad Uzair Shoaib
24P-0507 Muhammad Uzair Shoaib
Follow
May 9
A Survey of LLM-based Deep Search Agents Adaptive Path Planning via Weighted A* and Heuristic Rewards
#
agents
#
ai
#
algorithms
#
llm
Comments
Add Comment
4 min read
Evaluating LLM code reviewers: an offline harness for precision, recall, and routing"
Prakhar Singh
Prakhar Singh
Prakhar Singh
Follow
May 13
Evaluating LLM code reviewers: an offline harness for precision, recall, and routing"
#
llm
#
codereview
#
evaluation
#
ai
2
 reactions
Comments
Add Comment
5 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account