Alert button

Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

May 16, 2023
Jose Blanchet, Miao Lu, Tong Zhang, Han Zhong

Figure 1 for Double Pessimism is Provably Efficient for Distributionally Robust Offline Reinforcement Learning: Generic Algorithm and Robust Partial Coverage

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: