Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
evals
Follow
Hide
Posts
Left menu
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
All I Want for Christmas is Observable Multi-Modal Agentic Systems
Scarlett Attensil
Scarlett Attensil
Scarlett Attensil
Follow
for
LaunchDarkly
Dec 17 '25
All I Want for Christmas is Observable Multi-Modal Agentic Systems
#
observability
#
ai
#
evals
#
agents
Comments
Add Comment
8 min read
LLM evaluation guide: When to add online evals to your AI application
Scarlett Attensil
Scarlett Attensil
Scarlett Attensil
Follow
for
LaunchDarkly
Dec 17 '25
LLM evaluation guide: When to add online evals to your AI application
#
evals
#
agents
#
ai
#
observability
Comments
Add Comment
5 min read
From Prototype to Production: 10 Metrics for Reliable AI Agents
Navya Yadav
Navya Yadav
Navya Yadav
Follow
Nov 27 '25
From Prototype to Production: 10 Metrics for Reliable AI Agents
#
ai
#
aiops
#
evals
#
development
Comments
Add Comment
10 min read
Why Data Management Makes or Breaks Your AI Agent Evaluations
Navya Yadav
Navya Yadav
Navya Yadav
Follow
Nov 27 '25
Why Data Management Makes or Breaks Your AI Agent Evaluations
#
ai
#
evals
#
agents
#
observability
Comments
Add Comment
7 min read
AI Hallucinations in 2025: Causes, Impact, and Solutions for Trustworthy AI
Navya Yadav
Navya Yadav
Navya Yadav
Follow
Oct 27 '25
AI Hallucinations in 2025: Causes, Impact, and Solutions for Trustworthy AI
#
aiops
#
hallucinations
#
llm
#
evals
5
 reactions
Comments
Add Comment
6 min read
LLM evaluation: a quick overview of Stax
Thomas Ezan
Thomas Ezan
Thomas Ezan
Follow
Oct 23 '25
LLM evaluation: a quick overview of Stax
#
ai
#
evals
#
genai
Comments
Add Comment
2 min read
Why Your AI Agent Is Failing (and How to Fix It)
Seth Rose
Seth Rose
Seth Rose
Follow
Aug 13 '25
Why Your AI Agent Is Failing (and How to Fix It)
#
ai
#
agentaichallenge
#
promptengineering
#
evals
Comments
1
 comment
2 min read
The Hidden Risks of Testing AI-Powered Features with Traditional Tools
Shashank Arora
Shashank Arora
Shashank Arora
Follow
Oct 24 '24
The Hidden Risks of Testing AI-Powered Features with Traditional Tools
#
evals
#
testing
#
ai
Comments
Add Comment
3 min read
HoloDeck Part 1: Why Building AI Agents Feels So Broken
Jeremiah Justin Barias
Jeremiah Justin Barias
Jeremiah Justin Barias
Follow
Dec 6 '25
HoloDeck Part 1: Why Building AI Agents Feels So Broken
#
ai
#
agents
#
evals
Comments
Add Comment
3 min read
đź‘‹
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account