Alert button

Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment

Apr 18, 2024
Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: