DEV Community

Discussion on: How to Evaluate LLM Applications

Collapse
 
guybuildingai profile image
Jeffrey Ip

There's actually a new model called Prometheus (huggingface.co/kaist-ai/prometheus...), claims to be on par with gpt-4 for evaluation!