Picture for Yatish Hosmane Revanasiddappa

Yatish Hosmane Revanasiddappa

Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation

Add code
Mar 13, 2026
Viaarxiv icon