Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

State-Aware Proximal Pessimistic Algorithms for Offline Reinforcement Learning


Nov 28, 2022
Chen Chen, Hongyao Tang, Yi Ma, Chao Wang, Qianli Shen, Dong Li, Jianye Hao

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation


Oct 26, 2022
Pengyi Li, Hongyao Tang, Jianye Hao, Yan Zheng, Xian Fu, Zhaopeng Meng

Add code

* The paper has been accpeted by Deep Reinforcement Learning Workshop, NeurIPS 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Towards A Unified Policy Abstraction Theory and Representation Learning Approach in Markov Decision Processes


Sep 16, 2022
Min Zhang, Hongyao Tang, Jianye Hao, Yan Zheng

Add code

* Preprint version 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations


Apr 06, 2022
Tong Sang, Hongyao Tang, Yi Ma, Jianye Hao, Yan Zheng, Zhaopeng Meng, Boyan Li, Zhen Wang

Add code

* Preprint, work presented at the Generalizable Policy Learning in the Physical World Workshop (ICLR 2022) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration


Mar 16, 2022
Pengyi Li, Hongyao Tang, Tianpei Yang, Xiaotian Hao, Tong Sang, Yan Zheng, Jianye Hao, Matthew E. Taylor, Zhen Wang

Add code

* A preliminary version has been accepted on the Cooperative AI Workshop at 35th Conference on Neural Information Processing Systems (NeurIPS 2021) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

ED2: An Environment Dynamics Decomposition Framework for World Model Construction


Dec 06, 2021
Cong Wang, Tianpei Yang, Jianye Hao, Yan Zheng, Hongyao Tang, Fazl Barez, Jinyi Liu, Jiajie Peng, Haiyin Piao, Zhixiao Sun

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning


Nov 19, 2021
Tong Sang, Hongyao Tang, Jianye Hao, Yan Zheng, Zhaopeng Meng

Add code

* This paper is accepted by The 3rd International Conference on Distributed Artificial Intelligence (DAI 2021, Shanghai, China) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Exploration in Deep Reinforcement Learning: A Comprehensive Survey


Sep 15, 2021
Tianpei Yang, Hongyao Tang, Chenjia Bai, Jinyi Liu, Jianye Hao, Zhaopeng Meng, Peng Liu

Add code

* Repolishment is made, revise some incorrect descriptions 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation


Sep 12, 2021
Boyan Li, Hongyao Tang, Yan Zheng, Jianye Hao, Pengyi Li, Zhen Wang, Zhaopeng Meng, Li Wang

Add code

* 15 pages, preprint 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Addressing Action Oscillations through Learning Policy Inertia


Mar 03, 2021
Chen Chen, Hongyao Tang, Jianye Hao, Wulong Liu, Zhaopeng Meng

Add code

* Accepted paper on AAAI 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>