Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

A Unified Framework for Alternating Offline Model Training and Policy Learning


Oct 12, 2022
Shentao Yang, Shujian Zhang, Yihao Feng, Mingyuan Zhou

* 36th Conference on Neural Information Processing Systems (NeurIPS 2022) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning


Aug 17, 2022
Bo Liu, Yihao Feng, Qiang Liu, Peter Stone

* Goal-conditioned reinforcement learning, neural architecture design 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning


Jun 14, 2022
Shentao Yang, Yihao Feng, Shujian Zhang, Mingyuan Zhou

* International Conference on Machine Learning (ICML) 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Regularized Implicit Policy for Offline Reinforcement Learning


Feb 19, 2022
Shentao Yang, Zhendong Wang, Huangjie Zheng, Yihao Feng, Mingyuan Zhou


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning


Jan 01, 2022
Ziyang Tang, Yihao Feng, Qiang Liu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Unsupervised Out-of-Domain Detection via Pre-trained Transformers


Jun 02, 2021
Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng, Caiming Xiong

* Accepted by ACL 2021. Code is available at https://github.com/rivercold/BERT-unsupervised-OOD 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System


Apr 24, 2021
Congying Xia, Wenpeng Yin, Yihao Feng, Philip Yu

* 10 pages, accepted to NAACL 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Non-asymptotic Confidence Intervals of Off-policy Evaluation: Primal and Dual Bounds


Mar 09, 2021
Yihao Feng, Ziyang Tang, Na Zhang, Qiang Liu

* 33 Pages, 5 figures, extended version of a paper with the same title accepted by ICLR2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Transparent Interpretation with Knockouts


Nov 01, 2020
Xing Han, Yihao Feng, Na Zhang, Qiang Liu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Off-Policy Interval Estimation with Lipschitz Value Iteration


Oct 29, 2020
Ziyang Tang, Yihao Feng, Na Zhang, Jian Peng, Qiang Liu

* To appear at NeurIPS 2020 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>