Picture for Yangyang Zhou

Yangyang Zhou

RMGAP: Benchmarking the Generalization of Reward Models across Diverse Preferences

Add code
May 03, 2026
Viaarxiv icon