Alert button
Picture for Edward Hughes

Edward Hughes

Alert button

Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research

Add code
Bookmark button
Alert button
Mar 11, 2019
Joel Z. Leibo, Edward Hughes, Marc Lanctot, Thore Graepel

Figure 1 for Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Figure 2 for Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research
Viaarxiv icon

The Hanabi Challenge: A New Frontier for AI Research

Add code
Bookmark button
Alert button
Feb 01, 2019
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling

Figure 1 for The Hanabi Challenge: A New Frontier for AI Research
Figure 2 for The Hanabi Challenge: A New Frontier for AI Research
Figure 3 for The Hanabi Challenge: A New Frontier for AI Research
Figure 4 for The Hanabi Challenge: A New Frontier for AI Research
Viaarxiv icon

Causal Reasoning from Meta-reinforcement Learning

Add code
Bookmark button
Alert button
Jan 23, 2019
Ishita Dasgupta, Jane Wang, Silvia Chiappa, Jovana Mitrovic, Pedro Ortega, David Raposo, Edward Hughes, Peter Battaglia, Matthew Botvinick, Zeb Kurth-Nelson

Figure 1 for Causal Reasoning from Meta-reinforcement Learning
Figure 2 for Causal Reasoning from Meta-reinforcement Learning
Figure 3 for Causal Reasoning from Meta-reinforcement Learning
Figure 4 for Causal Reasoning from Meta-reinforcement Learning
Viaarxiv icon

Malthusian Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 17, 2018
Joel Z. Leibo, Julien Perolat, Edward Hughes, Steven Wheelwright, Adam H. Marblestone, Edgar Duéñez-Guzmán, Peter Sunehag, Iain Dunning, Thore Graepel

Figure 1 for Malthusian Reinforcement Learning
Figure 2 for Malthusian Reinforcement Learning
Figure 3 for Malthusian Reinforcement Learning
Figure 4 for Malthusian Reinforcement Learning
Viaarxiv icon

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 04, 2018
Jakob N. Foerster, Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew Botvinick, Michael Bowling

Figure 1 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 2 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 3 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 4 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

Intrinsic Social Motivation via Causal Influence in Multi-Agent RL

Add code
Bookmark button
Alert button
Oct 19, 2018
Natasha Jaques, Angeliki Lazaridou, Edward Hughes, Caglar Gulcehre, Pedro A. Ortega, DJ Strouse, Joel Z. Leibo, Nando de Freitas

Figure 1 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 2 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 3 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Figure 4 for Intrinsic Social Motivation via Causal Influence in Multi-Agent RL
Viaarxiv icon

Learning to Understand Goal Specifications by Modelling Reward

Add code
Bookmark button
Alert button
Oct 02, 2018
Dzmitry Bahdanau, Felix Hill, Jan Leike, Edward Hughes, Pushmeet Kohli, Edward Grefenstette

Figure 1 for Learning to Understand Goal Specifications by Modelling Reward
Figure 2 for Learning to Understand Goal Specifications by Modelling Reward
Figure 3 for Learning to Understand Goal Specifications by Modelling Reward
Figure 4 for Learning to Understand Goal Specifications by Modelling Reward
Viaarxiv icon

Inequity aversion improves cooperation in intertemporal social dilemmas

Add code
Bookmark button
Alert button
Sep 27, 2018
Edward Hughes, Joel Z. Leibo, Matthew G. Phillips, Karl Tuyls, Edgar A. Duéñez-Guzmán, Antonio García Castañeda, Iain Dunning, Tina Zhu, Kevin R. McKee, Raphael Koster, Heather Roff, Thore Graepel

Figure 1 for Inequity aversion improves cooperation in intertemporal social dilemmas
Figure 2 for Inequity aversion improves cooperation in intertemporal social dilemmas
Figure 3 for Inequity aversion improves cooperation in intertemporal social dilemmas
Figure 4 for Inequity aversion improves cooperation in intertemporal social dilemmas
Viaarxiv icon