Alert button
Picture for Joel Z. Leibo

Joel Z. Leibo

Alert button

Learning Reciprocity in Complex Sequential Social Dilemmas

Add code
Bookmark button
Alert button
Mar 19, 2019
Tom Eccles, Edward Hughes, János Kramár, Steven Wheelwright, Joel Z. Leibo

Figure 1 for Learning Reciprocity in Complex Sequential Social Dilemmas
Figure 2 for Learning Reciprocity in Complex Sequential Social Dilemmas
Figure 3 for Learning Reciprocity in Complex Sequential Social Dilemmas
Figure 4 for Learning Reciprocity in Complex Sequential Social Dilemmas
Viaarxiv icon

Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research

Add code
Bookmark button
Alert button
Mar 11, 2019
Joel Z. Leibo, Edward Hughes, Marc Lanctot, Thore Graepel

Figure 1 for Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Figure 2 for Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Viaarxiv icon

Malthusian Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 17, 2018
Joel Z. Leibo, Julien Perolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel

Figure 1 for Malthusian Reinforcement Learning
Figure 2 for Malthusian Reinforcement Learning
Figure 3 for Malthusian Reinforcement Learning
Figure 4 for Malthusian Reinforcement Learning
Viaarxiv icon

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL

Add code
Bookmark button
Alert button
Oct 19, 2018
Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas

Figure 1 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 2 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 3 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 4 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Viaarxiv icon

Inequity aversion improves cooperation in intertemporal social dilemmas

Add code
Bookmark button
Alert button
Sep 27, 2018
Edward Hughes, Joel Z. Leibo, Matthew G. Phillips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel

Figure 1 for Inequity aversion improves cooperation in intertemporal social dilemmas
Figure 2 for Inequity aversion improves cooperation in intertemporal social dilemmas
Figure 3 for Inequity aversion improves cooperation in intertemporal social dilemmas
Figure 4 for Inequity aversion improves cooperation in intertemporal social dilemmas
Viaarxiv icon

Human-level performance in first-person multiplayer games with population-based deep reinforcement learning

Add code
Bookmark button
Alert button
Jul 03, 2018
Max Jaderberg, Wojciech M. Czarnecki, Iain Dunning, Luke Marris, Guy Lever, Antonio Garcia Castaneda, Charles Beattie, Neil C. Rabinowitz, Ari S. Morcos, Avraham Ruderman, Nicolas Sonnerat, Tim Green, Louise Deason, Joel Z. Leibo, David Silver, Demis Hassabis, Koray Kavukcuoglu, Thore Graepel

Viaarxiv icon

Unsupervised Predictive Memory in a Goal-Directed Agent

Add code
Bookmark button
Alert button
Mar 28, 2018
Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap

Figure 1 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 2 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 3 for Unsupervised Predictive Memory in a Goal-Directed Agent
Figure 4 for Unsupervised Predictive Memory in a Goal-Directed Agent
Viaarxiv icon

Kickstarting Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 10, 2018
Simon Schmitt, Jonathan J. Hudson, Augustin Zidek, Simon Osindero, Carl Doersch, Wojciech M. Czarnecki, Joel Z. Leibo, Heinrich Kuttler, Andrew Zisserman, Karen Simonyan, S. M. Ali Eslami

Figure 1 for Kickstarting Deep Reinforcement Learning
Figure 2 for Kickstarting Deep Reinforcement Learning
Figure 3 for Kickstarting Deep Reinforcement Learning
Figure 4 for Kickstarting Deep Reinforcement Learning
Viaarxiv icon

Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents

Add code
Bookmark button
Alert button
Feb 04, 2018
Joel Z. Leibo, Cyprien de Masson d'Autume, Daniel Zoran, David Amos, Charles Beattie, Keith Anderson, Antonio García Castañeda, Manuel Sanchez, Simon Green, Audrunas Gruslys, Shane Legg, Demis Hassabis, Matthew M. Botvinick

Figure 1 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 2 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 3 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Figure 4 for Psychlab: A Psychology Laboratory for Deep Reinforcement Learning Agents
Viaarxiv icon

Deep Q-learning from Demonstrations

Add code
Bookmark button
Alert button
Nov 22, 2017
Todd Hester, Matej Vecerik, Olivier Pietquin, Marc Lanctot, Tom Schaul, Bilal Piot, Dan Horgan, John Quan, Andrew Sendonaris, Gabriel Dulac-Arnold, Ian Osband, John Agapiou, Joel Z. Leibo, Audrunas Gruslys

Figure 1 for Deep Q-learning from Demonstrations
Figure 2 for Deep Q-learning from Demonstrations
Figure 3 for Deep Q-learning from Demonstrations
Viaarxiv icon