Alert button
Picture for Tom Eccles

Tom Eccles

Alert button

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Jun 30, 2022
Julien Perolat, Bart de Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksandra Malysheva, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot, Shayegan Omidshafiei, Edward Lockhart, Laurent Sifre, Nathalie Beauguerlange, Remi Munos, David Silver, Satinder Singh, Demis Hassabis, Karl Tuyls

Figure 1 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 2 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 3 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 4 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Viaarxiv icon

A Generalist Agent

May 19, 2022
Scott Reed, Konrad Zolna, Emilio Parisotto, Sergio Gomez Colmenarejo, Alexander Novikov, Gabriel Barth-Maron, Mai Gimenez, Yury Sulsky, Jackie Kay, Jost Tobias Springenberg, Tom Eccles, Jake Bruce, Ali Razavi, Ashley Edwards, Nicolas Heess, Yutian Chen, Raia Hadsell, Oriol Vinyals, Mahyar Bordbar, Nando de Freitas

Figure 1 for A Generalist Agent
Figure 2 for A Generalist Agent
Figure 3 for A Generalist Agent
Figure 4 for A Generalist Agent
Viaarxiv icon

Human-Agent Cooperation in Bridge Bidding

Nov 28, 2020
Edward Lockhart, Neil Burch, Nolan Bard, Sebastian Borgeaud, Tom Eccles, Lucas Smaira, Ray Smith

Figure 1 for Human-Agent Cooperation in Bridge Bidding
Viaarxiv icon

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

Jun 17, 2020
Thomas Anthony, Tom Eccles, Andrea Tacchetti, János Kramár, Ian Gemp, Thomas C. Hudson, Nicolas Porcel, Marc Lanctot, Julien Pérolat, Richard Everett, Satinder Singh, Thore Graepel, Yoram Bachrach

Figure 1 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 2 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 3 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Figure 4 for Learning to Play No-Press Diplomacy with Best Response Policy Iteration
Viaarxiv icon

Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games

Feb 27, 2020
Edward Hughes, Thomas W. Anthony, Tom Eccles, Joel Z. Leibo, David Balduzzi, Yoram Bachrach

Figure 1 for Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Figure 2 for Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Figure 3 for Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Figure 4 for Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games
Viaarxiv icon

Biases for Emergent Communication in Multi-agent Reinforcement Learning

Dec 11, 2019
Tom Eccles, Yoram Bachrach, Guy Lever, Angeliki Lazaridou, Thore Graepel

Figure 1 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Figure 2 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Figure 3 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Figure 4 for Biases for Emergent Communication in Multi-agent Reinforcement Learning
Viaarxiv icon

Learning Reciprocity in Complex Sequential Social Dilemmas

Mar 19, 2019
Tom Eccles, Edward Hughes, János Kramár, Steven Wheelwright, Joel Z. Leibo

Figure 1 for Learning Reciprocity in Complex Sequential Social Dilemmas
Figure 2 for Learning Reciprocity in Complex Sequential Social Dilemmas
Figure 3 for Learning Reciprocity in Complex Sequential Social Dilemmas
Figure 4 for Learning Reciprocity in Complex Sequential Social Dilemmas
Viaarxiv icon

An investigation of model-free planning

Jan 11, 2019
Arthur Guez, Mehdi Mirza, Karol Gregor, Rishabh Kabra, Sébastien Racanière, Théophane Weber, David Raposo, Adam Santoro, Laurent Orseau, Tom Eccles, Greg Wayne, David Silver, Timothy Lillicrap

Figure 1 for An investigation of model-free planning
Figure 2 for An investigation of model-free planning
Figure 3 for An investigation of model-free planning
Figure 4 for An investigation of model-free planning
Viaarxiv icon