Alert button
Picture for Shihong Deng

Shihong Deng

Alert button

Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play

Add code
Bookmark button
Alert button
Mar 07, 2023
Wei Xi, Yongxin Zhang, Changnan Xiao, Xuefeng Huang, Shihong Deng, Haowei Liang, Jie Chen, Peng Sun

Figure 1 for Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Figure 2 for Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Figure 3 for Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Figure 4 for Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play
Viaarxiv icon

An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 01, 2021
Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng

Figure 1 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Figure 2 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Figure 3 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Figure 4 for An Entropy Regularization Free Mechanism for Policy-based Reinforcement Learning
Viaarxiv icon

CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation

Add code
Bookmark button
Alert button
May 27, 2021
Changnan Xiao, Haosen Shi, Jiajun Fan, Shihong Deng

Figure 1 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Figure 2 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Figure 3 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Figure 4 for CASA: A Bridge Between Gradient of Policy Improvement and Policy Evaluation
Viaarxiv icon