Alert button
Picture for Shimon Whiteson

Shimon Whiteson

Alert button

Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 19, 2023
Yat Long Lo, Christian Schroeder de Witt, Samuel Sokota, Jakob Nicolaus Foerster, Shimon Whiteson

Figure 1 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Figure 2 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Figure 3 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Figure 4 for Cheap Talk Discovery and Utilization in Multi-Agent Reinforcement Learning
Viaarxiv icon

Why Target Networks Stabilise Temporal Difference Methods

Add code
Bookmark button
Alert button
Feb 24, 2023
Mattie Fellows, Matthew J. A. Smith, Shimon Whiteson

Figure 1 for Why Target Networks Stabilise Temporal Difference Methods
Figure 2 for Why Target Networks Stabilise Temporal Difference Methods
Figure 3 for Why Target Networks Stabilise Temporal Difference Methods
Figure 4 for Why Target Networks Stabilise Temporal Difference Methods
Viaarxiv icon

Universal Morphology Control via Contextual Modulation

Add code
Bookmark button
Alert button
Feb 22, 2023
Zheng Xiong, Jacob Beck, Shimon Whiteson

Figure 1 for Universal Morphology Control via Contextual Modulation
Figure 2 for Universal Morphology Control via Contextual Modulation
Figure 3 for Universal Morphology Control via Contextual Modulation
Figure 4 for Universal Morphology Control via Contextual Modulation
Viaarxiv icon

Trust-Region-Free Policy Optimization for Stochastic Policies

Add code
Bookmark button
Alert button
Feb 15, 2023
Mingfei Sun, Benjamin Ellis, Anuj Mahajan, Sam Devlin, Katja Hofmann, Shimon Whiteson

Figure 1 for Trust-Region-Free Policy Optimization for Stochastic Policies
Figure 2 for Trust-Region-Free Policy Optimization for Stochastic Policies
Viaarxiv icon

A Survey of Meta-Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 19, 2023
Jacob Beck, Risto Vuorio, Evan Zheran Liu, Zheng Xiong, Luisa Zintgraf, Chelsea Finn, Shimon Whiteson

Figure 1 for A Survey of Meta-Reinforcement Learning
Figure 2 for A Survey of Meta-Reinforcement Learning
Figure 3 for A Survey of Meta-Reinforcement Learning
Figure 4 for A Survey of Meta-Reinforcement Learning
Viaarxiv icon

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

Add code
Bookmark button
Alert button
Dec 21, 2022
Yiren Lu, Justin Fu, George Tucker, Xinlei Pan, Eli Bronstein, Becca Roelofs, Benjamin Sapp, Brandyn White, Aleksandra Faust, Shimon Whiteson, Dragomir Anguelov, Sergey Levine

Figure 1 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 2 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 3 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Figure 4 for Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios
Viaarxiv icon

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 14, 2022
Benjamin Ellis, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob N. Foerster, Shimon Whiteson

Figure 1 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving

Add code
Bookmark button
Alert button
Dec 14, 2022
Angad Singh, Omar Makhlouf, Maximilian Igl, Joao Messias, Arnaud Doucet, Shimon Whiteson

Figure 1 for Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving
Figure 2 for Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving
Figure 3 for Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving
Figure 4 for Particle-Based Score Estimation for State Space Model Learning in Autonomous Driving
Viaarxiv icon

Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula

Add code
Bookmark button
Alert button
Dec 02, 2022
Eli Bronstein, Sirish Srinivasan, Supratik Paul, Aman Sinha, Matthew O'Kelly, Payam Nikdel, Shimon Whiteson

Figure 1 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 2 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 3 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Figure 4 for Embedding Synthetic Off-Policy Experience for Autonomous Driving via Zero-Shot Curricula
Viaarxiv icon

Hypernetworks in Meta-Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 20, 2022
Jacob Beck, Matthew Thomas Jackson, Risto Vuorio, Shimon Whiteson

Figure 1 for Hypernetworks in Meta-Reinforcement Learning
Figure 2 for Hypernetworks in Meta-Reinforcement Learning
Figure 3 for Hypernetworks in Meta-Reinforcement Learning
Figure 4 for Hypernetworks in Meta-Reinforcement Learning
Viaarxiv icon