Alert button

Generalized Preference Optimization: A Unified Approach to Offline Alignment

Feb 08, 2024
Yunhao Tang, Zhaohan Daniel Guo, Zeyu Zheng, Daniele Calandriello, Rémi Munos, Mark Rowland, Pierre Harvey Richemond, Michal Valko, Bernardo Ávila Pires, Bilal Piot

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: