Picture for Ray Zhou

Ray Zhou

A Unifying Lens on Reward Uncertainty in RLHF

Add code
Jun 08, 2026
Viaarxiv icon