Alert button

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Add code
Bookmark button
Alert button
Feb 29, 2024
Haoxiang Wang, Yong Lin, Wei Xiong, Rui Yang, Shizhe Diao, Shuang Qiu, Han Zhao, Tong Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: