Picture for Shimon Whiteson

Shimon Whiteson

University of Oxford

The Waymo Open Sim Agents Challenge

Add code
May 19, 2023
Viaarxiv icon

Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning

Add code
Mar 19, 2023
Figure 1 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Figure 2 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Figure 3 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Figure 4 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Viaarxiv icon

Why Target Networks Stabilise Temporal Difference Methods

Add code
Feb 24, 2023
Figure 1 for Why Target Networks Stabilise Temporal Difference Methods
Figure 2 for Why Target Networks Stabilise Temporal Difference Methods
Figure 3 for Why Target Networks Stabilise Temporal Difference Methods
Figure 4 for Why Target Networks Stabilise Temporal Difference Methods
Viaarxiv icon

Universal Morphology Control via Contextual Modulation

Add code
Feb 22, 2023
Figure 1 for Universal Morphology Control via Contextual Modulation
Figure 2 for Universal Morphology Control via Contextual Modulation
Figure 3 for Universal Morphology Control via Contextual Modulation
Figure 4 for Universal Morphology Control via Contextual Modulation
Viaarxiv icon

Trust-Region-Free Policy Optimization for Stochastic Policies

Add code
Feb 15, 2023
Viaarxiv icon

A Survey of Meta-Reinforcement Learning

Add code
Jan 19, 2023
Figure 1 for A Survey of Meta-Reinforcement Learning
Figure 2 for A Survey of Meta-Reinforcement Learning
Figure 3 for A Survey of Meta-Reinforcement Learning
Figure 4 for A Survey of Meta-Reinforcement Learning
Viaarxiv icon

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

Add code
Dec 21, 2022
Figure 1 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 2 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 3 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 4 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Viaarxiv icon

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Add code
Dec 14, 2022
Figure 1 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving

Add code
Dec 14, 2022
Viaarxiv icon

Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula

Add code
Dec 02, 2022
Figure 1 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 2 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 3 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 4 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Viaarxiv icon