Alert button

Policy Finetuning: Bridging Sample-Efficient Offline and Online Reinforcement Learning

Jun 09, 2021
Tengyang Xie, Nan Jiang, Huan Wang, Caiming Xiong, Yu Bai

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: