Alert button
Picture for Max Jaderberg

Max Jaderberg

Alert button

Open-ended Learning in Symmetric Zero-sum Games

Add code
Bookmark button
Alert button
Jan 23, 2019
David Balduzzi, Marta Garnelo, Yoram Bachrach, Wojciech M. Czarnecki, Julien Perolat, Max Jaderberg, Thore Graepel

Figure 1 for Open-ended Learning in Symmetric Zero-sum Games
Figure 2 for Open-ended Learning in Symmetric Zero-sum Games
Figure 3 for Open-ended Learning in Symmetric Zero-sum Games
Figure 4 for Open-ended Learning in Symmetric Zero-sum Games
Viaarxiv icon

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

Add code
Bookmark button
Alert button
Jul 03, 2018
Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio Garcia Castaneda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel

Viaarxiv icon

Unsupervised Learning of 3D Structure from Images

Add code
Bookmark button
Alert button
Jun 19, 2018
Danilo Jimenez Rezende, S. M. Ali Eslami, Shakir Mohamed, Peter Battaglia, Max Jaderberg, Nicolas Heess

Figure 1 for Unsupervised Learning of 3D Structure from Images
Figure 2 for Unsupervised Learning of 3D Structure from Images
Figure 3 for Unsupervised Learning of 3D Structure from Images
Figure 4 for Unsupervised Learning of 3D Structure from Images
Viaarxiv icon

Mix&Match - Agent Curricula for Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 05, 2018
Wojciech Marian Czarnecki, Siddhant M. Jayakumar, Max Jaderberg, Leonard Hasenclever, Yee Whye Teh, Simon Osindero, Nicolas Heess, Razvan Pascanu

Figure 1 for Mix&Match - Agent Curricula for Reinforcement Learning
Figure 2 for Mix&Match - Agent Curricula for Reinforcement Learning
Figure 3 for Mix&Match - Agent Curricula for Reinforcement Learning
Figure 4 for Mix&Match - Agent Curricula for Reinforcement Learning
Viaarxiv icon

Population Based Training of Neural Networks

Add code
Bookmark button
Alert button
Nov 28, 2017
Max Jaderberg, Valentin Dalibard, Simon Osindero, Wojciech M. Czarnecki, Jeff Donahue, Ali Razavi, Oriol Vinyals, Tim Green, Iain Dunning, Karen Simonyan, Chrisantha Fernando, Koray Kavukcuoglu

Figure 1 for Population Based Training of Neural Networks
Figure 2 for Population Based Training of Neural Networks
Figure 3 for Population Based Training of Neural Networks
Figure 4 for Population Based Training of Neural Networks
Viaarxiv icon

Sobolev Training for Neural Networks

Add code
Bookmark button
Alert button
Jul 26, 2017
Wojciech Marian Czarnecki, Simon Osindero, Max Jaderberg, Grzegorz Świrszcz, Razvan Pascanu

Figure 1 for Sobolev Training for Neural Networks
Figure 2 for Sobolev Training for Neural Networks
Figure 3 for Sobolev Training for Neural Networks
Figure 4 for Sobolev Training for Neural Networks
Viaarxiv icon

Decoupled Neural Interfaces using Synthetic Gradients

Add code
Bookmark button
Alert button
Jul 03, 2017
Max Jaderberg, Wojciech Marian Czarnecki, Simon Osindero, Oriol Vinyals, Alex Graves, David Silver, Koray Kavukcuoglu

Figure 1 for Decoupled Neural Interfaces using Synthetic Gradients
Figure 2 for Decoupled Neural Interfaces using Synthetic Gradients
Figure 3 for Decoupled Neural Interfaces using Synthetic Gradients
Figure 4 for Decoupled Neural Interfaces using Synthetic Gradients
Viaarxiv icon

Grounded Language Learning in a Simulated 3D World

Add code
Bookmark button
Alert button
Jun 26, 2017
Karl Moritz Hermann, Felix Hill, Simon Green, Fumin Wang, Ryan Faulkner, Hubert Soyer, David Szepesvari, Wojciech Marian Czarnecki, Max Jaderberg, Denis Teplyashin, Marcus Wainwright, Chris Apps, Demis Hassabis, Phil Blunsom

Figure 1 for Grounded Language Learning in a Simulated 3D World
Figure 2 for Grounded Language Learning in a Simulated 3D World
Figure 3 for Grounded Language Learning in a Simulated 3D World
Figure 4 for Grounded Language Learning in a Simulated 3D World
Viaarxiv icon

Value-Decomposition Networks For Cooperative Multi-Agent Learning

Add code
Bookmark button
Alert button
Jun 16, 2017
Peter Sunehag, Guy Lever, Audrunas Gruslys, Wojciech Marian Czarnecki, Vinicius Zambaldi, Max Jaderberg, Marc Lanctot, Nicolas Sonnerat, Joel Z. Leibo, Karl Tuyls, Thore Graepel

Figure 1 for Value-Decomposition Networks For Cooperative Multi-Agent Learning
Figure 2 for Value-Decomposition Networks For Cooperative Multi-Agent Learning
Viaarxiv icon

FeUdal Networks for Hierarchical Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 06, 2017
Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu

Figure 1 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 2 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 3 for FeUdal Networks for Hierarchical Reinforcement Learning
Figure 4 for FeUdal Networks for Hierarchical Reinforcement Learning
Viaarxiv icon