Alert button

Aligning Crowd Feedback via Distributional Preference Reward Modeling

Feb 15, 2024
Dexun Li, Cong Zhang, Kuicai Dong, Derrick Goh Xin Deik, Ruiming Tang, Yong Liu

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: