Picture for Seokhun Ju

Seokhun Ju

Policy-labeled Preference Learning: Is Preference Enough for RLHF?

Add code
May 13, 2025
Viaarxiv icon

Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation

Add code
Jul 31, 2024
Viaarxiv icon