DEV Community

Cover image for On the Generalization of SFT: A Reinforcement Learning Perspective with RewardRectification
Paperium
Paperium

Posted on • Originally published at paperium.net

On the Generalization of SFT: A Reinforcement Learning Perspective with RewardRectification

{{ $json.postContent }}

Top comments (0)