Alert button
Picture for Shimon Whiteson

Shimon Whiteson

Alert button

The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning

Jun 10, 2020
Maximilian Igl, Gregory Farquhar, Jelena Luketina, Wendelin Boehmer, Shimon Whiteson

Figure 1 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Figure 2 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Figure 3 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Figure 4 for The Impact of Non-stationarity on Generalisation in Deep Reinforcement Learning
Viaarxiv icon

AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning

Jun 07, 2020
Shariq Iqbal, Christian A. Schroeder de Witt, Bei Peng, Wendelin Böhmer, Shimon Whiteson, Fei Sha

Figure 1 for AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning
Figure 2 for AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning
Figure 3 for AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning
Figure 4 for AI-QMIX: Attention and Imagination for Dynamic Multi-Agent Reinforcement Learning
Viaarxiv icon

Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning

May 27, 2020
Shangtong Zhang, Bo Liu, Shimon Whiteson

Figure 1 for Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Figure 2 for Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Figure 3 for Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Figure 4 for Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
Viaarxiv icon

Privileged Information Dropout in Reinforcement Learning

May 19, 2020
Pierre-Alexandre Kamienny, Kai Arulkumaran, Feryal Behbahani, Wendelin Boehmer, Shimon Whiteson

Figure 1 for Privileged Information Dropout in Reinforcement Learning
Figure 2 for Privileged Information Dropout in Reinforcement Learning
Figure 3 for Privileged Information Dropout in Reinforcement Learning
Figure 4 for Privileged Information Dropout in Reinforcement Learning
Viaarxiv icon

Maximizing Information Gain in Partially Observable Environments via Prediction Reward

May 11, 2020
Yash Satsangi, Sungsu Lim, Shimon Whiteson, Frans Oliehoek, Martha White

Figure 1 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Figure 2 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Figure 3 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Figure 4 for Maximizing Information Gain in Partially Observable Environments via Prediction Reward
Viaarxiv icon

Per-Step Reward: A New Perspective for Risk-Averse Reinforcement Learning

Apr 22, 2020
Shangtong Zhang, Bo Liu, Shimon Whiteson

Figure 1 for Per-Step Reward: A New Perspective for Risk-Averse Reinforcement Learning
Figure 2 for Per-Step Reward: A New Perspective for Risk-Averse Reinforcement Learning
Figure 3 for Per-Step Reward: A New Perspective for Risk-Averse Reinforcement Learning
Figure 4 for Per-Step Reward: A New Perspective for Risk-Averse Reinforcement Learning
Viaarxiv icon

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Mar 19, 2020
Tabish Rashid, Mikayel Samvelyan, Christian Schroeder de Witt, Gregory Farquhar, Jakob Foerster, Shimon Whiteson

Figure 1 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 2 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 3 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 4 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control

Mar 18, 2020
Christian Schroeder de Witt, Bei Peng, Pierre-Alexandre Kamienny, Philip Torr, Wendelin Böhmer, Shimon Whiteson

Figure 1 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 2 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 3 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 4 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Viaarxiv icon