Picture for Shimon Whiteson

Shimon Whiteson

University of Oxford

Snowflake: Scaling GNNs to High-Dimensional Continuous Control via Parameter Freezing

Add code
Mar 01, 2021
Figure 1 for Snowflake: Scaling GNNs to High-Dimensional Continuous Control via Parameter Freezing
Figure 2 for Snowflake: Scaling GNNs to High-Dimensional Continuous Control via Parameter Freezing
Figure 3 for Snowflake: Scaling GNNs to High-Dimensional Continuous Control via Parameter Freezing
Figure 4 for Snowflake: Scaling GNNs to High-Dimensional Continuous Control via Parameter Freezing
Viaarxiv icon

Breaking the Deadly Triad with a Target Network

Add code
Feb 09, 2021
Figure 1 for Breaking the Deadly Triad with a Target Network
Figure 2 for Breaking the Deadly Triad with a Target Network
Figure 3 for Breaking the Deadly Triad with a Target Network
Figure 4 for Breaking the Deadly Triad with a Target Network
Viaarxiv icon

Deep Interactive Bayesian Reinforcement Learning via Meta-Learning

Add code
Jan 11, 2021
Figure 1 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Figure 2 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Figure 3 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Figure 4 for Deep Interactive Bayesian Reinforcement Learning via Meta-Learning
Viaarxiv icon

Average-Reward Off-Policy Policy Evaluation with Function Approximation

Add code
Jan 08, 2021
Figure 1 for Average-Reward Off-Policy Policy Evaluation with Function Approximation
Figure 2 for Average-Reward Off-Policy Policy Evaluation with Function Approximation
Figure 3 for Average-Reward Off-Policy Policy Evaluation with Function Approximation
Figure 4 for Average-Reward Off-Policy Policy Evaluation with Function Approximation
Viaarxiv icon

Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?

Add code
Nov 18, 2020
Figure 1 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Figure 2 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Figure 3 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Figure 4 for Is Independent Learning All You Need in the StarCraft Multi-Agent Challenge?
Viaarxiv icon

UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

Add code
Oct 06, 2020
Figure 1 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Figure 2 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Figure 3 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Figure 4 for UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
Viaarxiv icon

My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control

Add code
Oct 05, 2020
Figure 1 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Figure 2 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Figure 3 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Figure 4 for My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control
Viaarxiv icon

RODE: Learning Roles to Decompose Multi-Agent Tasks

Add code
Oct 04, 2020
Figure 1 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Figure 2 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Figure 3 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Figure 4 for RODE: Learning Roles to Decompose Multi-Agent Tasks
Viaarxiv icon

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

Add code
Oct 02, 2020
Figure 1 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 2 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 3 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 4 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Viaarxiv icon

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Add code
Oct 02, 2020
Figure 1 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Figure 2 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Figure 3 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Figure 4 for Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning
Viaarxiv icon