Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Constrained Update Projection Approach to Safe Policy Optimization


Sep 15, 2022
Long Yang, Jiaming Ji, Juntao Dai, Linrui Zhang, Binbin Zhou, Pengfei Li, Yaodong Yang, Gang Pan

* Accepted by NeurIPS2022. arXiv admin note: substantial text overlap with arXiv:2202.07565; text overlap with arXiv:2002.06506 by other authors 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Penalized Proximal Policy Optimization for Safe Reinforcement Learning


May 24, 2022
Linrui zhang, Li Shen, Long Yang, Shixiang Chen, Bo Yuan, Xueqian Wang, Dacheng Tao

* IJCAI2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

A Review of Safe Reinforcement Learning: Methods, Theory and Applications


May 23, 2022
Shangding Gu, Long Yang, Yali Du, Guang Chen, Florian Walter, Jun Wang, Yaodong Yang, Alois Knoll


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

CUP: A Conservative Update Policy Algorithm for Safe Reinforcement Learning


Feb 15, 2022
Long Yang, Jiaming Ji, Juntao Dai, Yu Zhang, Pengfei Li, Gang Pan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Secure Hybrid Beamforming for IRS-Assisted Millimeter Wave Systems


Jan 09, 2022
Xuan Xue, Yongchao Wang, Long Yang, Jian Chen


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Thompson Sampling for Unimodal Bandits


Jun 16, 2021
Long Yang, Zhao Li, Zehong Hu, Shasha Ruan, Shijian Li, Gang Pan, Hongyang Chen

* There are some technical parts need to be improved. We will fix these places and provide an updated version 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

On Convergence of Gradient Expected Sarsa($位$)


Dec 14, 2020
Long Yang, Gang Zheng, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan

* This submission has been accepted by AAAI2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Sample Complexity of Policy Gradient Finding Second-Order Stationary Points


Dec 02, 2020
Long Yang, Qian Zheng, Gang Pan

* This submission has been accepted by AAAI2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Gradient Q$(蟽, 位)$: A Unified Algorithm with Function Approximation for Reinforcement Learning


Sep 06, 2019
Long Yang, Yu Zhang, Qian Zheng, Pengfei Li, Gang Pan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

FiDi-RL: Incorporating Deep Reinforcement Learning with Finite-Difference Policy Search for Efficient Learning of Continuous Control


Jul 10, 2019
Longxiang Shi, Shijian Li, Longbing Cao, Long Yang, Gang Zheng, Gang Pan

* 9 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>