Picture for Shimon Whiteson

Shimon Whiteson

University of Oxford

Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning

Add code
Mar 19, 2020
Figure 1 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 2 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 3 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Figure 4 for Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control

Add code
Mar 18, 2020
Figure 1 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 2 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 3 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Figure 4 for Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control
Viaarxiv icon

Optimistic Exploration even with a Pessimistic Initialisation

Add code
Feb 26, 2020
Figure 1 for Optimistic Exploration even with a Pessimistic Initialisation
Figure 2 for Optimistic Exploration even with a Pessimistic Initialisation
Figure 3 for Optimistic Exploration even with a Pessimistic Initialisation
Figure 4 for Optimistic Exploration even with a Pessimistic Initialisation
Viaarxiv icon

Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization

Add code
Feb 14, 2020
Figure 1 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Figure 2 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Figure 3 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Figure 4 for Reinforcement Learning Enhanced Quantum-inspired Algorithm for Combinatorial Optimization
Viaarxiv icon

GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values

Add code
Feb 07, 2020
Figure 1 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Figure 2 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Figure 3 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Figure 4 for GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values
Viaarxiv icon

Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework

Add code
Jan 23, 2020
Figure 1 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Figure 2 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Figure 3 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Figure 4 for Facial Feedback for Reinforcement Learning: A Case Study and Offline Analysis Using the TAMER Framework
Viaarxiv icon

VIABLE: Fast Adaptation via Backpropagating Learned Loss

Add code
Nov 29, 2019
Figure 1 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Figure 2 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Figure 3 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Figure 4 for VIABLE: Fast Adaptation via Backpropagating Learned Loss
Viaarxiv icon

Provably Convergent Off-Policy Actor-Critic with Function Approximation

Add code
Nov 11, 2019
Figure 1 for Provably Convergent Off-Policy Actor-Critic with Function Approximation
Figure 2 for Provably Convergent Off-Policy Actor-Critic with Function Approximation
Figure 3 for Provably Convergent Off-Policy Actor-Critic with Function Approximation
Viaarxiv icon

VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning

Add code
Oct 18, 2019
Figure 1 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Figure 2 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Figure 3 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Figure 4 for VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning
Viaarxiv icon

MAVEN: Multi-Agent Variational Exploration

Add code
Oct 16, 2019
Figure 1 for MAVEN: Multi-Agent Variational Exploration
Figure 2 for MAVEN: Multi-Agent Variational Exploration
Figure 3 for MAVEN: Multi-Agent Variational Exploration
Figure 4 for MAVEN: Multi-Agent Variational Exploration
Viaarxiv icon