Alert button

Towards Understanding the Influence of Reward Margin on Preference Model Performance

Apr 07, 2024
Bowen Qin, Duanyu Feng, Xi Yang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: