Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

On Well-posedness and Minimax Optimal Rates of Nonparametric Q-function Estimation in Off-policy Evaluation



Xiaohong Chen , Zhengling Qi


   Access Paper or Ask Questions

Pessimistic Model Selection for Offline Deep Reinforcement Learning



Chao-Han Huck Yang , Zhengling Qi , Yifan Cui , Pin-Yu Chen

* Preprint. A non-archival and preliminary venue was presented at NeurIPS 2021 Offline Reinforcement Learning Workshop 

   Access Paper or Ask Questions

Rejoinder: Learning Optimal Distributionally Robust Individualized Treatment Rules



Weibin Mo , Zhengling Qi , Yufeng Liu

* Journal of the American Statistical Association, 116:534, 699-707 (2021) 

   Access Paper or Ask Questions

Projected State-action Balancing Weights for Offline Reinforcement Learning



Jiayi Wang , Zhengling Qi , Raymond K. W. Wong


   Access Paper or Ask Questions

Proximal Learning for Individualized Treatment Regimes Under Unmeasured Confounding



Zhengling Qi , Rui Miao , Xiaoke Zhang


   Access Paper or Ask Questions

Robust Batch Policy Learning in Markov Decision Processes



Zhengling Qi , Peng Liao


   Access Paper or Ask Questions

Batch Policy Learning in Average Reward Markov Decision Processes



Peng Liao , Zhengling Qi , Susan Murphy


   Access Paper or Ask Questions

Learning Optimal Distributionally Robust Individualized Treatment Rules



Weibin Mo , Zhengling Qi , Yufeng Liu


   Access Paper or Ask Questions

1
2
>>