DEV Community

Cover image for Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate RewardHacking
Paperium
Paperium

Posted on • Originally published at paperium.net

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate RewardHacking

{{ $json.postContent }}

Top comments (0)