Alert button
Picture for H. Francis Song

H. Francis Song

Alert button

From Motor Control to Team Play in Simulated Humanoid Football

May 25, 2021
Siqi Liu, Guy Lever, Zhe Wang, Josh Merel, S. M. Ali Eslami, Daniel Hennes, Wojciech M. Czarnecki, Yuval Tassa, Shayegan Omidshafiei, Abbas Abdolmaleki, Noah Y. Siegel, Leonard Hasenclever, Luke Marris, Saran Tunyasuvunakool, H. Francis Song, Markus Wulfmeier, Paul Muller, Tuomas Haarnoja, Brendan D. Tracey, Karl Tuyls, Thore Graepel, Nicolas Heess

Figure 1 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 2 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 3 for From Motor Control to Team Play in Simulated Humanoid Football
Figure 4 for From Motor Control to Team Play in Simulated Humanoid Football
Viaarxiv icon

A Distributional View on Multi-Objective Policy Optimization

May 15, 2020
Abbas Abdolmaleki, Sandy H. Huang, Leonard Hasenclever, Michael Neunert, H. Francis Song, Martina Zambelli, Murilo F. Martins, Nicolas Heess, Raia Hadsell, Martin Riedmiller

Figure 1 for A Distributional View on Multi-Objective Policy Optimization
Figure 2 for A Distributional View on Multi-Objective Policy Optimization
Figure 3 for A Distributional View on Multi-Objective Policy Optimization
Figure 4 for A Distributional View on Multi-Objective Policy Optimization
Viaarxiv icon

Stabilizing Transformers for Reinforcement Learning

Oct 13, 2019
Emilio Parisotto, H. Francis Song, Jack W. Rae, Razvan Pascanu, Caglar Gulcehre, Siddhant M. Jayakumar, Max Jaderberg, Raphael Lopez Kaufman, Aidan Clark, Seb Noury, Matthew M. Botvinick, Nicolas Heess, Raia Hadsell

Figure 1 for Stabilizing Transformers for Reinforcement Learning
Figure 2 for Stabilizing Transformers for Reinforcement Learning
Figure 3 for Stabilizing Transformers for Reinforcement Learning
Figure 4 for Stabilizing Transformers for Reinforcement Learning
Viaarxiv icon

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Sep 26, 2019
H. Francis Song, Abbas Abdolmaleki, Jost Tobias Springenberg, Aidan Clark, Hubert Soyer, Jack W. Rae, Seb Noury, Arun Ahuja, Siqi Liu, Dhruva Tirumala, Nicolas Heess, Dan Belov, Martin Riedmiller, Matthew M. Botvinick

Figure 1 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 2 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 3 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Figure 4 for V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control
Viaarxiv icon

The Hanabi Challenge: A New Frontier for AI Research

Feb 01, 2019
Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare, Michael Bowling

Figure 1 for The Hanabi Challenge: A New Frontier for AI Research
Figure 2 for The Hanabi Challenge: A New Frontier for AI Research
Figure 3 for The Hanabi Challenge: A New Frontier for AI Research
Figure 4 for The Hanabi Challenge: A New Frontier for AI Research
Viaarxiv icon

Relational Forward Models for Multi-Agent Learning

Sep 28, 2018
Andrea Tacchetti, H. Francis Song, Pedro A. M. Mediano, Vinicius Zambaldi, Neil C. Rabinowitz, Thore Graepel, Matthew Botvinick, Peter W. Battaglia

Figure 1 for Relational Forward Models for Multi-Agent Learning
Figure 2 for Relational Forward Models for Multi-Agent Learning
Figure 3 for Relational Forward Models for Multi-Agent Learning
Figure 4 for Relational Forward Models for Multi-Agent Learning
Viaarxiv icon

Machine Theory of Mind

Mar 12, 2018
Neil C. Rabinowitz, Frank Perbet, H. Francis Song, Chiyuan Zhang, S. M. Ali Eslami, Matthew Botvinick

Figure 1 for Machine Theory of Mind
Figure 2 for Machine Theory of Mind
Figure 3 for Machine Theory of Mind
Figure 4 for Machine Theory of Mind
Viaarxiv icon