Alert button

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Feb 15, 2024
Rui Yang, Xiaoman Pan, Feng Luo, Shuang Qiu, Han Zhong, Dong Yu, Jianshu Chen

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: