Picture for JunHyeok Oh

JunHyeok Oh

Rethinking DPO: The Role of Rejected Responses in Preference Misalignment

Add code
Jun 15, 2025
Viaarxiv icon

Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Add code
May 16, 2025
Viaarxiv icon