Alert button
Picture for Jakub Grudzien Kuba

Jakub Grudzien Kuba

Alert button

Mirror Learning: A Unifying Framework of Policy Optimisation

Add code
Bookmark button
Alert button
Jan 11, 2022
Jakub Grudzien Kuba, Christian Schroeder de Witt, Jakob Foerster

Figure 1 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 2 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 3 for Mirror Learning: A Unifying Framework of Policy Optimisation
Viaarxiv icon

Multi-Agent Constrained Policy Optimisation

Add code
Bookmark button
Alert button
Oct 06, 2021
Shangding Gu, Jakub Grudzien Kuba, Munning Wen, Ruiqing Chen, Ziyan Wang, Zheng Tian, Jun Wang, Alois Knoll, Yaodong Yang

Figure 1 for Multi-Agent Constrained Policy Optimisation
Figure 2 for Multi-Agent Constrained Policy Optimisation
Figure 3 for Multi-Agent Constrained Policy Optimisation
Figure 4 for Multi-Agent Constrained Policy Optimisation
Viaarxiv icon

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 23, 2021
Jakub Grudzien Kuba, Ruiqing Chen, Munning Wen, Ying Wen, Fanglei Sun, Jun Wang, Yaodong Yang

Figure 1 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 2 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 3 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 4 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Viaarxiv icon

Settling the Variance of Multi-Agent Policy Gradients

Add code
Bookmark button
Alert button
Aug 20, 2021
Jakub Grudzien Kuba, Muning Wen, Yaodong Yang, Linghui Meng, Shangding Gu, Haifeng Zhang, David Henry Mguni, Jun Wang

Figure 1 for Settling the Variance of Multi-Agent Policy Gradients
Figure 2 for Settling the Variance of Multi-Agent Policy Gradients
Figure 3 for Settling the Variance of Multi-Agent Policy Gradients
Viaarxiv icon