Picture for Yiqiang Lu

Yiqiang Lu

Ant Group, Shanghai, China

ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization

Add code
May 14, 2026
Viaarxiv icon