Picture for Hung T. C. Le

Hung T. C. Le

Active Advantage-Aligned Online Reinforcement Learning with Offline Data

Add code
Feb 11, 2025
Viaarxiv icon