Picture for Shimon Whiteson

Shimon Whiteson

University of Oxford

Can Learned Optimization Make Reinforcement Learning Less Difficult?

Add code
Jul 09, 2024
Viaarxiv icon

A Bayesian Solution To The Imitation Gap

Add code
Jun 29, 2024
Viaarxiv icon

UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios

Add code
May 06, 2024
Figure 1 for UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios
Figure 2 for UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios
Figure 3 for UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios
Figure 4 for UniGen: Unified Modeling of Initial Agent States and Trajectories for Generating Autonomous Driving Scenarios
Viaarxiv icon

Policy-Guided Diffusion

Add code
Apr 09, 2024
Figure 1 for Policy-Guided Diffusion
Figure 2 for Policy-Guided Diffusion
Figure 3 for Policy-Guided Diffusion
Figure 4 for Policy-Guided Diffusion
Viaarxiv icon

SplAgger: Split Aggregation for Meta-Reinforcement Learning

Add code
Mar 08, 2024
Figure 1 for SplAgger: Split Aggregation for Meta-Reinforcement Learning
Figure 2 for SplAgger: Split Aggregation for Meta-Reinforcement Learning
Figure 3 for SplAgger: Split Aggregation for Meta-Reinforcement Learning
Figure 4 for SplAgger: Split Aggregation for Meta-Reinforcement Learning
Viaarxiv icon

Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control

Add code
Feb 09, 2024
Figure 1 for Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Figure 2 for Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Figure 3 for Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Figure 4 for Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Viaarxiv icon

Discovering Temporally-Aware Reinforcement Learning Algorithms

Add code
Feb 08, 2024
Viaarxiv icon

JaxMARL: Multi-Agent RL Environments in JAX

Add code
Nov 20, 2023
Figure 1 for JaxMARL: Multi-Agent RL Environments in JAX
Figure 2 for JaxMARL: Multi-Agent RL Environments in JAX
Figure 3 for JaxMARL: Multi-Agent RL Environments in JAX
Figure 4 for JaxMARL: Multi-Agent RL Environments in JAX
Viaarxiv icon

Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design

Add code
Oct 04, 2023
Figure 1 for Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Figure 2 for Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Figure 3 for Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Figure 4 for Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design
Viaarxiv icon

Recurrent Hypernetworks are Surprisingly Strong in Meta-RL

Add code
Sep 26, 2023
Figure 1 for Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Figure 2 for Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Figure 3 for Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Figure 4 for Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
Viaarxiv icon