Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Jiaxing Song

Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies


Nov 29, 2020
Jinlin Lai, Lixin Zou, Jiaxing Song

* Offline Reinforcement Learning Workshop at Neural Information Processing Systems, 2020 

  Access Paper or Ask Questions