Alert button
Picture for Guy Lever

Guy Lever

Alert button

Replay across Experiments: A Natural Extension of Off-Policy RL

Nov 28, 2023
Dhruva Tirumala, Thomas Lampe, Jose Enrique Chen, Tuomas Haarnoja, Sandy Huang, Guy Lever, Ben Moran, Tim Hertweck, Leonard Hasenclever, Martin Riedmiller, Nicolas Heess, Markus Wulfmeier

Viaarxiv icon

Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning

Apr 26, 2023
Tuomas Haarnoja, Ben Moran, Guy Lever, Sandy H. Huang, Dhruva Tirumala, Markus Wulfmeier, Jan Humplik, Saran Tunyasuvunakool, Noah Y. Siegel, Roland Hafner, Michael Bloesch, Kristian Hartikainen, Arunkumar Byravan, Leonard Hasenclever, Yuval Tassa, Fereshteh Sadeghi, Nathan Batchelor, Federico Casarini, Stefano Saliceti, Charles Game, Neil Sreendra, Kushal Patel, Marlon Gwira, Andrea Huber, Nicole Hurley, Francesco Nori, Raia Hadsell, Nicolas Heess

Figure 1 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 2 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 3 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Figure 4 for Learning Agile Soccer Skills for a Bipedal Robot with Deep Reinforcement Learning
Viaarxiv icon

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

Sep 22, 2022
Ian Gemp, Thomas Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome Connor, Vibhavari Dasagi, Bart De Vylder, Edgar Duenez-Guzman, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Perolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov, Zhe Wang, Karl Tuyls

Viaarxiv icon

From Motor Control to Team Play in Simulated Humanoid Football

May 25, 2021
Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, S. M. Ali Eslami, Daniel Hennes, Wojciech M. Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Y. Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, H. Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan D. Tracey, Karl Tuyls, Thore Graepel, Nicolas Heess

Figure 1 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 2 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 3 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 4 for From Motor Control to Team Play in Simulated Humanoid Football
Viaarxiv icon

Biases for Emergent Communication in Multi-agent Reinforcement Learning

Dec 11, 2019
Tom Eccles, Yoram Bachrach, Guy Lever, Angeliki Lazaridou, Thore Graepel

Figure 1 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Figure 2 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Figure 3 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Figure 4 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Viaarxiv icon

A Generalized Training Approach for Multiagent Learning

Sep 27, 2019
Paul Muller, Shayegan Omidshafiei, Mark Rowland, Karl Tuyls, Julien Perolat, Siqi Liu, Daniel Hennes, Luke Marris, Marc Lanctot, Edward Hughes, Zhe Wang, Guy Lever, Nicolas Heess, Thore Graepel, Remi Munos

Figure 1 for A Generalized Training Approach for Multiagent Learning
Figure 2 for A Generalized Training Approach for Multiagent Learning
Figure 3 for A Generalized Training Approach for Multiagent Learning
Figure 4 for A Generalized Training Approach for Multiagent Learning
Viaarxiv icon

Emergent Coordination Through Competition

Feb 21, 2019
Siqi Liu, Guy Lever, Josh Merel, Saran Tunyasuvunakool, Nicolas Heess, Thore Graepel

Figure 1 for Emergent Coordination Through Competition
Figure 2 for Emergent Coordination Through Competition
Figure 3 for Emergent Coordination Through Competition
Figure 4 for Emergent Coordination Through Competition
Viaarxiv icon

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

Jul 03, 2018
Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio Garcia Castaneda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel

Viaarxiv icon

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Jun 16, 2017
Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinicius Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z. Leibo, Karl Tuyls, Thore Graepel

Figure 1 for Value-Decomposition Networks For Cooperative Multi-Agent Learning
Figure 2 for Value-Decomposition Networks For Cooperative Multi-Agent Learning
Viaarxiv icon

Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent

Jul 11, 2016
Aleksandar Botev, Guy Lever, David Barber

Figure 1 for Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent
Figure 2 for Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent
Figure 3 for Nesterov's Accelerated Gradient and Momentum as approximations to Regularised Update Descent
Viaarxiv icon