Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
Evaluating LLMs, For Real Series' Articles
Back to Suman Nath's Series
Breaking down the accuracy number: Building an LLM Eval Harness From Scratch
Suman Nath
Suman Nath
Suman Nath
Follow
Jun 26
Breaking down the accuracy number: Building an LLM Eval Harness From Scratch
#
machinelearning
#
llm
#
python
#
ai
Comments
1
comment
4 min read
LLM-as-a-Judge: I Built One From Scratch, Then Checked It Against Humans
Suman Nath
Suman Nath
Suman Nath
Follow
Jun 29
LLM-as-a-Judge: I Built One From Scratch, Then Checked It Against Humans
#
machinelearning
#
llm
#
python
#
ai
Comments
Add Comment
4 min read
A Better LLM Judge? The Rubric Made My Small Model Worse
Suman Nath
Suman Nath
Suman Nath
Follow
Jun 29
A Better LLM Judge? The Rubric Made My Small Model Worse
#
machinelearning
#
llm
#
python
#
ai
Comments
Add Comment
5 min read
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account