Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Learning Domain Invariant Representations in Goal-conditioned Block MDPs


Oct 28, 2021
Beining Han, Chongyi Zheng, Harris Chan, Keiran Paster, Michael R. Zhang, Jimmy Ba

* NeurIPS2021 
* 33 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

On the Estimation Bias in Double Q-Learning


Sep 29, 2021
Zhizhou Ren, Guangxiang Zhu, Hao Hu, Beining Han, Jianglun Chen, Chongjie Zhang

* Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS 2021) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Off-Policy Reinforcement Learning with Delayed Rewards


Jun 22, 2021
Beining Han, Zhizhou Ren, Zuofan Wu, Yuan Zhou, Jian Peng

* 24 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Off-Policy Multi-Agent Decomposed Policy Gradients


Jul 24, 2020
Yihan Wang, Beining Han, Tonghan Wang, Heng Dong, Chongjie Zhang


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Towards Understanding Linear Value Decomposition in Cooperative Multi-Agent Q-Learning


Jun 23, 2020
Jianhao Wang, Zhizhou Ren, Beining Han, Chongjie Zhang


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email