DEV Community

[Comment from a deleted post]
Collapse
 
srbhr profile image
𝚂𝚊𝚞𝚛𝚊𝚋𝚑 𝚁𝚊𝚒

Really liked this article, and thanks for sharing about "Other Approaches to Evaluation" I was thinking of something same as well, in what ways we can explore the capacity of the current LLMs.

Would you be looking into other models as well? Like Mistral, LlaMa 2, and the likes?

Collapse
 
guybuildingai profile image
Jeffrey Ip

There's actually a new model called Prometheus (huggingface.co/kaist-ai/prometheus...), claims to be on par with gpt-4 for evaluation!