Alert button

WARM: On the Benefits of Weight Averaged Reward Models

Jan 22, 2024
Alexandre Ramé, Nino Vieillard, Léonard Hussenot, Robert Dadashi, Geoffrey Cideron, Olivier Bachem, Johan Ferret

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: