Alert button
Picture for Yaodong Yang

Yaodong Yang

Alert button

Efficient Policy Space Response Oracles

Add code
Bookmark button
Alert button
Feb 17, 2022
Ming Zhou, Jingxiao Chen, Ying Wen, Weinan Zhang, Yaodong Yang, Yong Yu

Figure 1 for Efficient Policy Space Response Oracles
Figure 2 for Efficient Policy Space Response Oracles
Figure 3 for Efficient Policy Space Response Oracles
Figure 4 for Efficient Policy Space Response Oracles
Viaarxiv icon

Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 16, 2022
Zehao Dou, Jakub Grudzien Kuba, Yaodong Yang

Viaarxiv icon

Settling the Communication Complexity for Distributed Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 10, 2022
Juliusz Krysztof Ziomek, Jun Wang, Yaodong Yang

Figure 1 for Settling the Communication Complexity for Distributed Offline Reinforcement Learning
Viaarxiv icon

Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 31, 2021
Bo Liu, Xidong Feng, Haifeng Zhang, Jun Wang, Yaodong Yang

Figure 1 for Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning
Figure 2 for Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning
Figure 3 for Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning
Figure 4 for Settling the Bias and Variance of Meta-Gradient Estimation for Meta-Reinforcement Learning
Viaarxiv icon

Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks

Add code
Bookmark button
Alert button
Dec 20, 2021
Linghui Meng, Muning Wen, Yaodong Yang, Chenyang Le, Xiyun Li, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Bo Xu

Figure 1 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Figure 2 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Figure 3 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Figure 4 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks
Viaarxiv icon

Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Conquers All StarCraftII Tasks

Add code
Bookmark button
Alert button
Dec 06, 2021
Linghui Meng, Muning Wen, Yaodong Yang, Chenyang Le, Xiyun Li, Weinan Zhang, Ying Wen, Haifeng Zhang, Jun Wang, Bo Xu

Figure 1 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Conquers All StarCraftII Tasks
Figure 2 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Conquers All StarCraftII Tasks
Figure 3 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Conquers All StarCraftII Tasks
Figure 4 for Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Conquers All StarCraftII Tasks
Viaarxiv icon

A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers

Add code
Bookmark button
Alert button
Oct 29, 2021
Chenguang Wang, Yaodong Yang, Oliver Slumbers, Congying Han, Tiande Guo, Haifeng Zhang, Jun Wang

Figure 1 for A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Figure 2 for A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Figure 3 for A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Figure 4 for A Game-Theoretic Approach for Improving Generalization Ability of TSP Solvers
Viaarxiv icon

DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention

Add code
Bookmark button
Alert button
Oct 27, 2021
David Mguni, Joel Jennings, Taher Jafferjee, Aivar Sootla, Yaodong Yang, Changmin Yu, Usman Islam, Ziyan Wang, Jun Wang

Figure 1 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 2 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 3 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Figure 4 for DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention
Viaarxiv icon

Measuring the Non-Transitivity in Chess

Add code
Bookmark button
Alert button
Oct 22, 2021
Ricky Sanjaya, Jun Wang, Yaodong Yang

Figure 1 for Measuring the Non-Transitivity in Chess
Figure 2 for Measuring the Non-Transitivity in Chess
Figure 3 for Measuring the Non-Transitivity in Chess
Figure 4 for Measuring the Non-Transitivity in Chess
Viaarxiv icon