Alert button
Picture for Deheng Ye

Deheng Ye

Alert button

Tencent Inc

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

Add code
Bookmark button
Alert button
Feb 05, 2023
Zichuan Lin, Xiapeng Wu, Mingfei Sun, Deheng Ye, Qiang Fu, Wei Yang, Wei Liu

Figure 1 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 2 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 3 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 4 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Viaarxiv icon

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 20, 2023
Haoxuan Pan, Deheng Ye, Xiaoming Duan, Qiang Fu, Wei Yang, Jianping He, Mingfei Sun

Figure 1 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 2 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 3 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 4 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Viaarxiv icon

A Survey on Transformers in Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 08, 2023
Wenzhe Li, Hao Luo, Zichuan Lin, Chongjie Zhang, Zongqing Lu, Deheng Ye

Figure 1 for A Survey on Transformers in Reinforcement Learning
Figure 2 for A Survey on Transformers in Reinforcement Learning
Figure 3 for A Survey on Transformers in Reinforcement Learning
Viaarxiv icon

RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 13, 2022
Boxuan Zhao, Jun Zhang, Deheng Ye, Jian Cao, Xiao Han, Qiang Fu, Wei Yang

Figure 1 for RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Figure 2 for RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Figure 3 for RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Figure 4 for RLogist: Fast Observation Strategy on Whole-slide Images with Deep Reinforcement Learning
Viaarxiv icon

Pretraining in Deep Reinforcement Learning: A Survey

Add code
Bookmark button
Alert button
Nov 08, 2022
Zhihui Xie, Zichuan Lin, Junyou Li, Shuai Li, Deheng Ye

Figure 1 for Pretraining in Deep Reinforcement Learning: A Survey
Figure 2 for Pretraining in Deep Reinforcement Learning: A Survey
Figure 3 for Pretraining in Deep Reinforcement Learning: A Survey
Figure 4 for Pretraining in Deep Reinforcement Learning: A Survey
Viaarxiv icon

Curriculum-based Asymmetric Multi-task Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 07, 2022
Hanchi Huang, Deheng Ye, Li Shen, Wei Liu

Figure 1 for Curriculum-based Asymmetric Multi-task Reinforcement Learning
Figure 2 for Curriculum-based Asymmetric Multi-task Reinforcement Learning
Figure 3 for Curriculum-based Asymmetric Multi-task Reinforcement Learning
Figure 4 for Curriculum-based Asymmetric Multi-task Reinforcement Learning
Viaarxiv icon

Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation

Add code
Bookmark button
Alert button
Oct 19, 2022
Chengqian Gao, Ke Xu, Liu Liu, Deheng Ye, Peilin Zhao, Zhiqiang Xu

Figure 1 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Figure 2 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Figure 3 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Figure 4 for Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
Viaarxiv icon

Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 09, 2022
Hua Wei, Jingxiao Chen, Xiyang Ji, Hongyang Qin, Minwen Deng, Siqin Li, Liang Wang, Weinan Zhang, Yong Yu, Lin Liu, Lanxiao Huang, Deheng Ye, Qiang Fu, Wei Yang

Figure 1 for Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
Figure 2 for Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
Figure 3 for Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
Figure 4 for Honor of Kings Arena: an Environment for Generalization in Competitive Reinforcement Learning
Viaarxiv icon

More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization

Add code
Bookmark button
Alert button
Sep 26, 2022
Jiangxing Wang, Deheng Ye, Zongqing Lu

Figure 1 for More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Figure 2 for More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Figure 3 for More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Figure 4 for More Centralized Training, Still Decentralized Execution: Multi-Agent Conditional Policy Factorization
Viaarxiv icon