Picture for Liu Kang

Liu Kang

IRPO: Scaling the Bradley-Terry Model via Reinforcement Learning

Add code
Jan 02, 2026
Viaarxiv icon