Alert button
Picture for Remi Munos

Remi Munos

Alert button

Multiagent Evaluation under Incomplete Information

Add code
Bookmark button
Alert button
Oct 30, 2019
Mark Rowland, Shayegan Omidshafiei, Karl Tuyls, Julien Perolat, Michal Valko, Georgios Piliouras, Remi Munos

Figure 1 for Multiagent Evaluation under Incomplete Information
Figure 2 for Multiagent Evaluation under Incomplete Information
Figure 3 for Multiagent Evaluation under Incomplete Information
Figure 4 for Multiagent Evaluation under Incomplete Information
Viaarxiv icon

A Generalized Training Approach for Multiagent Learning

Add code
Bookmark button
Alert button
Sep 27, 2019
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos

Figure 1 for A Generalized Training Approach for Multiagent Learning
Figure 2 for A Generalized Training Approach for Multiagent Learning
Figure 3 for A Generalized Training Approach for Multiagent Learning
Figure 4 for A Generalized Training Approach for Multiagent Learning
Viaarxiv icon

Neural Replicator Dynamics

Add code
Bookmark button
Alert button
Jun 01, 2019
Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Remi Munos, Julien Perolat, Marc Lanctot, Audrunas Gruslys, Jean-Baptiste Lespiau, Karl Tuyls

Figure 1 for Neural Replicator Dynamics
Figure 2 for Neural Replicator Dynamics
Figure 3 for Neural Replicator Dynamics
Figure 4 for Neural Replicator Dynamics
Viaarxiv icon

The Termination Critic

Add code
Bookmark button
Alert button
Feb 26, 2019
Anna Harutyunyan, Will Dabney, Diana Borsa, Nicolas Heess, Remi Munos, Doina Precup

Figure 1 for The Termination Critic
Figure 2 for The Termination Critic
Figure 3 for The Termination Critic
Figure 4 for The Termination Critic
Viaarxiv icon

The Uncertainty Bellman Equation and Exploration

Add code
Bookmark button
Alert button
Oct 22, 2018
Brendan O'Donoghue, Ian Osband, Remi Munos, Volodymyr Mnih

Figure 1 for The Uncertainty Bellman Equation and Exploration
Figure 2 for The Uncertainty Bellman Equation and Exploration
Figure 3 for The Uncertainty Bellman Equation and Exploration
Figure 4 for The Uncertainty Bellman Equation and Exploration
Viaarxiv icon

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Add code
Bookmark button
Alert button
Oct 21, 2018
Sriram Srinivasan, Marc Lanctot, Vinicius Zambaldi, Julien Perolat, Karl Tuyls, Remi Munos, Michael Bowling

Figure 1 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 2 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 3 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 4 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Viaarxiv icon

IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

Add code
Bookmark button
Alert button
Jun 28, 2018
Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu

Figure 1 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 2 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 3 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Figure 4 for IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures
Viaarxiv icon

The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 19, 2018
Audrunas Gruslys, Will Dabney, Mohammad Gheshlaghi Azar, Bilal Piot, Marc Bellemare, Remi Munos

Figure 1 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 2 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 3 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Figure 4 for The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning
Viaarxiv icon

Maximum a Posteriori Policy Optimisation

Add code
Bookmark button
Alert button
Jun 14, 2018
Abbas Abdolmaleki, Jost Tobias Springenberg, Yuval Tassa, Remi Munos, Nicolas Heess, Martin Riedmiller

Figure 1 for Maximum a Posteriori Policy Optimisation
Figure 2 for Maximum a Posteriori Policy Optimisation
Figure 3 for Maximum a Posteriori Policy Optimisation
Figure 4 for Maximum a Posteriori Policy Optimisation
Viaarxiv icon