Alert button
Picture for Mikayel Samvelyan

Mikayel Samvelyan

Alert button

Evolving Curricula with Regret-Based Environment Design

Mar 08, 2022
Jack Parker-Holder, Minqi Jiang, Michael Dennis, Mikayel Samvelyan, Jakob Foerster, Edward Grefenstette, Tim Rocktäschel

Figure 1 for Evolving Curricula with Regret-Based Environment Design
Figure 2 for Evolving Curricula with Regret-Based Environment Design
Figure 3 for Evolving Curricula with Regret-Based Environment Design
Figure 4 for Evolving Curricula with Regret-Based Environment Design
Viaarxiv icon

Generalization in Cooperative Multi-Agent Systems

Jan 31, 2022
Anuj Mahajan, Mikayel Samvelyan, Tarun Gupta, Benjamin Ellis, Mingfei Sun, Tim Rocktäschel, Shimon Whiteson

Figure 1 for Generalization in Cooperative Multi-Agent Systems
Figure 2 for Generalization in Cooperative Multi-Agent Systems
Figure 3 for Generalization in Cooperative Multi-Agent Systems
Figure 4 for Generalization in Cooperative Multi-Agent Systems
Viaarxiv icon

Reinforcement Learning in Factored Action Spaces using Tensor Decompositions

Oct 27, 2021
Anuj Mahajan, Mikayel Samvelyan, Lei Mao, Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Animashree Anandkumar

Figure 1 for Reinforcement Learning in Factored Action Spaces using Tensor Decompositions
Figure 2 for Reinforcement Learning in Factored Action Spaces using Tensor Decompositions
Figure 3 for Reinforcement Learning in Factored Action Spaces using Tensor Decompositions
Figure 4 for Reinforcement Learning in Factored Action Spaces using Tensor Decompositions
Viaarxiv icon

MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research

Sep 27, 2021
Mikayel Samvelyan, Robert Kirk, Vitaly Kurin, Jack Parker-Holder, Minqi Jiang, Eric Hambro, Fabio Petroni, Heinrich Küttler, Edward Grefenstette, Tim Rocktäschel

Figure 1 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 2 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 3 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Figure 4 for MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Viaarxiv icon

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

May 31, 2021
Anuj Mahajan, Mikayel Samvelyan, Lei Mao, Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Animashree Anandkumar

Figure 1 for Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Figure 2 for Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Figure 3 for Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Figure 4 for Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
Viaarxiv icon

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Mar 19, 2020
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson

Figure 1 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 2 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 3 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 4 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

MAVEN: Multi-Agent Variational Exploration

Oct 16, 2019
Anuj Mahajan, Tabish Rashid, Mikayel Samvelyan, Shimon Whiteson

Figure 1 for MAVEN: Multi-Agent Variational Exploration
Figure 2 for MAVEN: Multi-Agent Variational Exploration
Figure 3 for MAVEN: Multi-Agent Variational Exploration
Figure 4 for MAVEN: Multi-Agent Variational Exploration
Viaarxiv icon

The StarCraft Multi-Agent Challenge

Feb 26, 2019
Mikayel Samvelyan, Tabish Rashid, Christian Schroeder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob Foerster, Shimon Whiteson

Figure 1 for The StarCraft Multi-Agent Challenge
Figure 2 for The StarCraft Multi-Agent Challenge
Figure 3 for The StarCraft Multi-Agent Challenge
Figure 4 for The StarCraft Multi-Agent Challenge
Viaarxiv icon

QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Jun 06, 2018
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson

Figure 1 for QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 2 for QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 3 for QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 4 for QMIX: Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon