Picture for JunHyeok Oh

JunHyeok Oh

Offline Reinforcement Learning with Penalized Action Noise Injection

Add code
Jul 03, 2025
Viaarxiv icon

Rethinking DPO: The Role of Rejected Responses in Preference Misalignment

Add code
Jun 15, 2025
Viaarxiv icon

Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Add code
May 16, 2025
Viaarxiv icon