Alert button
Picture for Shimon Whiteson

Shimon Whiteson

Alert button

University of Oxford

Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control

Add code
Bookmark button
Alert button
Mar 14, 2020
Christian Schroeder de Witt, Bei Peng, Pierre-Alexandre Kamienny, Philip Torr, Wendelin Böhmer, Shimon Whiteson

Figure 1 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 2 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 3 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 4 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Viaarxiv icon

Optimistic Exploration even with a Pessimistic Initialisation

Add code
Bookmark button
Alert button
Feb 26, 2020
Tabish Rashid, Bei Peng, Wendelin Böhmer, Shimon Whiteson

Figure 1 for Optimistic Exploration even with a Pessimistic Initialisation
Figure 2 for Optimistic Exploration even with a Pessimistic Initialisation
Figure 3 for Optimistic Exploration even with a Pessimistic Initialisation
Figure 4 for Optimistic Exploration even with a Pessimistic Initialisation
Viaarxiv icon

Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization

Add code
Bookmark button
Alert button
Feb 14, 2020
Dmitrii Beloborodov, A. E. Ulanov, Jakob N. Foerster, Shimon Whiteson, A. I. Lvovsky

Figure 1 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Figure 2 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Figure 3 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Figure 4 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Viaarxiv icon

GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values

Add code
Bookmark button
Alert button
Feb 07, 2020
Shangtong Zhang, Bo Liu, Shimon Whiteson

Figure 1 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Figure 2 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Figure 3 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Figure 4 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Viaarxiv icon

Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework

Add code
Bookmark button
Alert button
Jan 23, 2020
Guangliang Li, Hamdi Dibeklioğlu, Shimon Whiteson, Hayley Hung

Figure 1 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Figure 2 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Figure 3 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Figure 4 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Viaarxiv icon

VIABLE: Fast Adaptation via Backpropagating Learned Loss

Add code
Bookmark button
Alert button
Nov 29, 2019
Leo Feng, Luisa Zintgraf, Bei Peng, Shimon Whiteson

Figure 1 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Figure 2 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Figure 3 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Figure 4 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Viaarxiv icon

Provably Convergent Off-Policy Actor-Critic with Function Approximation

Add code
Bookmark button
Alert button
Nov 11, 2019
Shangtong Zhang, Bo Liu, Hengshuai Yao, Shimon Whiteson

Figure 1 for Provably Convergent Off-Policy Actor-Critic with Function Approximation
Figure 2 for Provably Convergent Off-Policy Actor-Critic with Function Approximation
Figure 3 for Provably Convergent Off-Policy Actor-Critic with Function Approximation
Viaarxiv icon

VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

Add code
Bookmark button
Alert button
Oct 18, 2019
Luisa Zintgraf, Kyriacos Shiarlis, Maximilian Igl, Sebastian Schulze, Yarin Gal, Katja Hofmann, Shimon Whiteson

Figure 1 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Figure 2 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Figure 3 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Figure 4 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Viaarxiv icon