Picture for JunHyeok Oh

JunHyeok Oh

TABX: A High-Throughput Sandbox Battle Simulator for Multi-Agent Reinforcement Learning

Add code
Feb 02, 2026
Viaarxiv icon

Offline Reinforcement Learning with Penalized Action Noise Injection

Add code
Jul 03, 2025
Viaarxiv icon

Rethinking DPO: The Role of Rejected Responses in Preference Misalignment

Add code
Jun 15, 2025
Viaarxiv icon

Prior-Guided Diffusion Planning for Offline Reinforcement Learning

Add code
May 16, 2025
Viaarxiv icon