Alert button

Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning

Apr 09, 2024
Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: