Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Policy evaluation from a single path: Multi-step methods, mixing and mis-specification


Nov 07, 2022
Yaqi Duan, Martin J. Wainwright

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Near-optimal Offline Reinforcement Learning with Linear Representation: Leveraging Variance Information with Pessimism


Mar 11, 2022
Ming Yin, Yaqi Duan, Mengdi Wang, Yu-Xiang Wang

Add code

* ICLR 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Adaptive and Robust Multi-task Learning


Feb 10, 2022
Yaqi Duan, Kaizheng Wang

Add code

* 60 pages, 2 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Optimal policy evaluation using kernel-based temporal difference methods


Sep 24, 2021
Yaqi Duan, Mengdi Wang, Martin J. Wainwright

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

PU-Flow: a Point Cloud Upsampling Networkwith Normalizing Flows


Jul 13, 2021
Aihua Mao, Zihui Du, Junhui Hou, Yaqi Duan, Yong-jin Liu, Ying He

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning Good State and Action Representations via Tensor Decomposition


May 03, 2021
Chengzhuo Ni, Anru Zhang, Yaqi Duan, Mengdi Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Risk Bounds and Rademacher Complexity in Batch Reinforcement Learning


Mar 25, 2021
Yaqi Duan, Chi Jin, Zhiyuan Li

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Bootstrapping Statistical Inference for Off-Policy Evaluation


Feb 09, 2021
Botao Hao, Xiang Ji, Yaqi Duan, Hao Lu, Csaba Szepesvári, Mengdi Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient


Nov 08, 2020
Botao Hao, Yaqi Duan, Tor Lattimore, Csaba Szepesvári, Mengdi Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Minimax-Optimal Off-Policy Evaluation with Linear Function Approximation


Feb 21, 2020
Yaqi Duan, Mengdi Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>