Alert button
Picture for Yaodong Yang

Yaodong Yang

Alert button

Online Markov Decision Processes with Non-oblivious Strategic Adversary

Add code
Bookmark button
Alert button
Oct 08, 2021
Le Cong Dinh, David Henry Mguni, Long Tran-Thanh, Jun Wang, Yaodong Yang

Figure 1 for Online Markov Decision Processes with Non-oblivious Strategic Adversary
Viaarxiv icon

Multi-Agent Constrained Policy Optimisation

Add code
Bookmark button
Alert button
Oct 06, 2021
Shangding Gu, Jakub Grudzien Kuba, Munning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois Knoll, Yaodong Yang

Figure 1 for Multi-Agent Constrained Policy Optimisation
Figure 2 for Multi-Agent Constrained Policy Optimisation
Figure 3 for Multi-Agent Constrained Policy Optimisation
Figure 4 for Multi-Agent Constrained Policy Optimisation
Viaarxiv icon

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 23, 2021
Jakub Grudzien Kuba, Ruiqing Chen, Munning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang

Figure 1 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 2 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 3 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 4 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Viaarxiv icon

Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics

Add code
Bookmark button
Alert button
Sep 20, 2021
Yixin Wu, Rui Luo, Chen Zhang, Jun Wang, Yaodong Yang

Figure 1 for Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Figure 2 for Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Figure 3 for Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Figure 4 for Revisiting the Characteristics of Stochastic Gradient Noise and Dynamics
Viaarxiv icon

On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games

Add code
Bookmark button
Alert button
Sep 04, 2021
Xiaotie Deng, Yuhao Li, David Henry Mguni, Jun Wang, Yaodong Yang

Viaarxiv icon

Settling the Variance of Multi-Agent Policy Gradients

Add code
Bookmark button
Alert button
Aug 20, 2021
Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang

Figure 1 for Settling the Variance of Multi-Agent Policy Gradients
Figure 2 for Settling the Variance of Multi-Agent Policy Gradients
Figure 3 for Settling the Variance of Multi-Agent Policy Gradients
Viaarxiv icon

A Game-Theoretic Approach to Multi-Agent Trust Region Optimization

Add code
Bookmark button
Alert button
Jun 12, 2021
Ying Wen, Hui Chen, Yaodong Yang, Zheng Tian, Minne Li, Xu Chen, Jun Wang

Figure 1 for A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Figure 2 for A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Figure 3 for A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Figure 4 for A Game-Theoretic Approach to Multi-Agent Trust Region Optimization
Viaarxiv icon

Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

Add code
Bookmark button
Alert button
Jun 10, 2021
Xiangyu Liu, Hangtian Jia, Ying Wen, Yaodong Yang, Yujing Hu, Yingfeng Chen, Changjie Fan, Zhipeng Hu

Figure 1 for Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Figure 2 for Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Figure 3 for Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Figure 4 for Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games
Viaarxiv icon