Alert button

Robust Preference Optimization with Provable Noise Tolerance for LLMs

Apr 05, 2024
Xize Liang, Chao Chen, Jie Wang, Yue Wu, Zhihang Fu, Zhihao Shi, Feng Wu, Jieping Ye

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: