Picture for Leo Cheng

Leo Cheng

UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

Add code
Jun 17, 2026
Viaarxiv icon