Alert button
Picture for Shimon Whiteson

Shimon Whiteson

Alert button

Generalized Off-Policy Actor-Critic

Add code
Bookmark button
Alert button
Mar 27, 2019
Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

Figure 1 for Generalized Off-Policy Actor-Critic
Figure 2 for Generalized Off-Policy Actor-Critic
Figure 3 for Generalized Off-Policy Actor-Critic
Figure 4 for Generalized Off-Policy Actor-Critic
Viaarxiv icon

The StarCraft Multi-Agent Challenge

Add code
Bookmark button
Alert button
Feb 26, 2019
Mikayel Samvelyan, Tabish Rashid, Christian Schroeder de Witt, Gregory Farquhar, Nantas Nardelli, Tim G. J. Rudner, Chia-Man Hung, Philip H. S. Torr, Jakob Foerster, Shimon Whiteson

Figure 1 for The StarCraft Multi-Agent Challenge
Figure 2 for The StarCraft Multi-Agent Challenge
Figure 3 for The StarCraft Multi-Agent Challenge
Figure 4 for The StarCraft Multi-Agent Challenge
Viaarxiv icon

Fast Efficient Hyperparameter Tuning for Policy Gradients

Add code
Bookmark button
Alert button
Feb 18, 2019
Supratik Paul, Vitaly Kurin, Shimon Whiteson

Figure 1 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 2 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 3 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Figure 4 for Fast Efficient Hyperparameter Tuning for Policy Gradients
Viaarxiv icon

Stable Opponent Shaping in Differentiable Games

Add code
Bookmark button
Alert button
Nov 20, 2018
Alistair Letcher, Jakob Foerster, David Balduzzi, Tim Rocktäschel, Shimon Whiteson

Figure 1 for Stable Opponent Shaping in Differentiable Games
Figure 2 for Stable Opponent Shaping in Differentiable Games
Figure 3 for Stable Opponent Shaping in Differentiable Games
Figure 4 for Stable Opponent Shaping in Differentiable Games
Viaarxiv icon

Learning from Demonstration in the Wild

Add code
Bookmark button
Alert button
Nov 08, 2018
Feryal Behbahani, Kyriacos Shiarlis, Xi Chen, Vitaly Kurin, Sudhanshu Kasewa, Ciprian Stirbu, João Gomes, Supratik Paul, Frans A. Oliehoek, João Messias, Shimon Whiteson

Figure 1 for Learning from Demonstration in the Wild
Figure 2 for Learning from Demonstration in the Wild
Figure 3 for Learning from Demonstration in the Wild
Figure 4 for Learning from Demonstration in the Wild
Viaarxiv icon

Multi-Agent Common Knowledge Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 05, 2018
Jakob N. Foerster, Christian A. Schroeder de Witt, Gregory Farquhar, Philip H. S. Torr, Wendelin Boehmer, Shimon Whiteson

Figure 1 for Multi-Agent Common Knowledge Reinforcement Learning
Figure 2 for Multi-Agent Common Knowledge Reinforcement Learning
Figure 3 for Multi-Agent Common Knowledge Reinforcement Learning
Figure 4 for Multi-Agent Common Knowledge Reinforcement Learning
Viaarxiv icon

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 04, 2018
Jakob N. Foerster, Francis Song, Edward Hughes, Neil Burch, Iain Dunning, Shimon Whiteson, Matthew Botvinick, Michael Bowling

Figure 1 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 2 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 3 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 4 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

VIREL: A Variational Inference Framework for Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 03, 2018
Matthew Fellows, Anuj Mahajan, Tim G. J. Rudner, Shimon Whiteson

Figure 1 for VIREL: A Variational Inference Framework for Reinforcement Learning
Figure 2 for VIREL: A Variational Inference Framework for Reinforcement Learning
Figure 3 for VIREL: A Variational Inference Framework for Reinforcement Learning
Figure 4 for VIREL: A Variational Inference Framework for Reinforcement Learning
Viaarxiv icon

CAML: Fast Context Adaptation via Meta-Learning

Add code
Bookmark button
Alert button
Oct 12, 2018
Luisa M Zintgraf, Kyriacos Shiarlis, Vitaly Kurin, Katja Hofmann, Shimon Whiteson

Figure 1 for CAML: Fast Context Adaptation via Meta-Learning
Figure 2 for CAML: Fast Context Adaptation via Meta-Learning
Figure 3 for CAML: Fast Context Adaptation via Meta-Learning
Figure 4 for CAML: Fast Context Adaptation via Meta-Learning
Viaarxiv icon