Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
How to Evaluate LLM Output Quality Programmatically
Ayi NEDJIMI
Ayi NEDJIMI
Ayi NEDJIMI
Follow
Jun 16
How to Evaluate LLM Output Quality Programmatically
#
python
#
llm
#
ai
#
tutorial
1
 reaction
Comments
Add Comment
5 min read
I let Claude and Codex argue about my code for a week. Here's what they caught.
Brian Mello
Brian Mello
Brian Mello
Follow
Jun 2
I let Claude and Codex argue about my code for a week. Here's what they caught.
#
ai
#
codereview
#
devtools
#
llm
Comments
Add Comment
5 min read
Claude Fable 5 is currently unavailable , here's what actually happened
Muhammad Moeed
Muhammad Moeed
Muhammad Moeed
Follow
Jun 16
Claude Fable 5 is currently unavailable , here's what actually happened
#
claude
#
ai
#
anthropic
#
llm
Comments
1
 comment
2 min read
How to Cheat LLM Context: A Lightweight AI Doc Assistant Architecture
Piotr Zielinski
Piotr Zielinski
Piotr Zielinski
Follow
Jun 2
How to Cheat LLM Context: A Lightweight AI Doc Assistant Architecture
#
ai
#
architecture
#
llm
#
rag
2
 reactions
Comments
Add Comment
3 min read
I Feel Sorry for AI
Mark Huang
Mark Huang
Mark Huang
Follow
Jun 3
I Feel Sorry for AI
#
discuss
#
ai
#
llm
1
 reaction
Comments
Add Comment
9 min read
Skills + Dense-Mem: Making AI Workflows Learn From Experience
Mark Huang
Mark Huang
Mark Huang
Follow
Jun 2
Skills + Dense-Mem: Making AI Workflows Learn From Experience
#
agents
#
ai
#
llm
#
rag
Comments
Add Comment
13 min read
JetBrains just open-sourced the missing piece of self-hosted AI pipelines
Andrew Kew
Andrew Kew
Andrew Kew
Follow
Jun 2
JetBrains just open-sourced the missing piece of self-hosted AI pipelines
#
ai
#
llm
#
programming
#
opensource
Comments
Add Comment
3 min read
Lum1104 — Understand-Anything
Muhammad Shoaib Syed
Muhammad Shoaib Syed
Muhammad Shoaib Syed
Follow
Jun 2
Lum1104 — Understand-Anything
#
ai
#
llm
#
opensource
#
tooling
Comments
Add Comment
3 min read
Hermes Agent: First Contact
Rob
Rob
Rob
Follow
Jun 2
Hermes Agent: First Contact
#
agents
#
llm
#
buildinginpublic
#
meta
Comments
Add Comment
5 min read
Enterprise AI doesn't need a better model. It needs smarter agent logic.
Andrew Kew
Andrew Kew
Andrew Kew
Follow
Jun 2
Enterprise AI doesn't need a better model. It needs smarter agent logic.
#
ai
#
agents
#
llm
#
machinelearning
Comments
Add Comment
2 min read
I kept rewriting the same quiz + spaced-repetition code. So I packaged it into an API.
limack0
limack0
limack0
Follow
Jun 2
I kept rewriting the same quiz + spaced-repetition code. So I packaged it into an API.
#
showdev
#
api
#
learning
#
llm
Comments
Add Comment
1 min read
I built Huiyu Pi — a self-hosted AI coding agent that starts at ~80 tokens.
Huiyu
Huiyu
Huiyu
Follow
Jun 2
I built Huiyu Pi — a self-hosted AI coding agent that starts at ~80 tokens.
#
showdev
#
agents
#
ai
#
llm
Comments
Add Comment
1 min read
GrepSeek Trains a Search Agent to Use Shell Commands: GRPO-Trained Shell-Command Search
pueding
pueding
pueding
Follow
Jun 2
GrepSeek Trains a Search Agent to Use Shell Commands: GRPO-Trained Shell-Command Search
#
ai
#
agents
#
rag
#
llm
Comments
Add Comment
6 min read
GPT-5.6 Is Real (a Codex Log Says So) — Everything Else Is Made Up
tokenmixai
tokenmixai
tokenmixai
Follow
Jun 2
GPT-5.6 Is Real (a Codex Log Says So) — Everything Else Is Made Up
#
openai
#
ai
#
llm
#
productivity
3
 reactions
Comments
Add Comment
6 min read
Self-Hosted AI Risk Gate in 10 Minutes: Meet ITTE – Your Pre-Deploy Risk Brain with Self-Evolving Memory
LeonXx
LeonXx
LeonXx
Follow
Jun 2
Self-Hosted AI Risk Gate in 10 Minutes: Meet ITTE – Your Pre-Deploy Risk Brain with Self-Evolving Memory
#
ai
#
cicd
#
llm
#
security
Comments
Add Comment
1 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account