Picture for Shimon Whiteson

Shimon Whiteson

University of Oxford

Hypernetworks in Meta-Reinforcement Learning

Add code
Oct 20, 2022
Figure 1 for Hypernetworks in Meta-Reinforcement Learning
Figure 2 for Hypernetworks in Meta-Reinforcement Learning
Figure 3 for Hypernetworks in Meta-Reinforcement Learning
Figure 4 for Hypernetworks in Meta-Reinforcement Learning
Viaarxiv icon

Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving

Add code
Oct 18, 2022
Figure 1 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 2 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 3 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Figure 4 for Hierarchical Model-Based Imitation Learning for Planning in Autonomous Driving
Viaarxiv icon

An Investigation of the Bias-Variance Tradeoff in Meta-Gradients

Add code
Sep 22, 2022
Figure 1 for An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Figure 2 for An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Figure 3 for An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Figure 4 for An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
Viaarxiv icon

Generalized Beliefs for Cooperative AI

Add code
Jun 26, 2022
Figure 1 for Generalized Beliefs for Cooperative AI
Figure 2 for Generalized Beliefs for Cooperative AI
Figure 3 for Generalized Beliefs for Cooperative AI
Figure 4 for Generalized Beliefs for Cooperative AI
Viaarxiv icon

Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation

Add code
May 06, 2022
Figure 1 for Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Figure 2 for Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Figure 3 for Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Figure 4 for Symphony: Learning Realistic and Diverse Agents for Autonomous Driving Simulation
Viaarxiv icon

Generalization in Cooperative Multi-Agent Systems

Add code
Jan 31, 2022
Figure 1 for Generalization in Cooperative Multi-Agent Systems
Figure 2 for Generalization in Cooperative Multi-Agent Systems
Figure 3 for Generalization in Cooperative Multi-Agent Systems
Figure 4 for Generalization in Cooperative Multi-Agent Systems
Viaarxiv icon

Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO

Add code
Jan 31, 2022
Figure 1 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Figure 2 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Figure 3 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Figure 4 for Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
Viaarxiv icon

You May Not Need Ratio Clipping in PPO

Add code
Jan 31, 2022
Figure 1 for You May Not Need Ratio Clipping in PPO
Figure 2 for You May Not Need Ratio Clipping in PPO
Figure 3 for You May Not Need Ratio Clipping in PPO
Figure 4 for You May Not Need Ratio Clipping in PPO
Viaarxiv icon

In Defense of the Unitary Scalarization for Deep Multi-Task Learning

Add code
Jan 20, 2022
Figure 1 for In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Figure 2 for In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Figure 3 for In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Figure 4 for In Defense of the Unitary Scalarization for Deep Multi-Task Learning
Viaarxiv icon

Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency

Add code
Dec 11, 2021
Figure 1 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Figure 2 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Figure 3 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Figure 4 for Deterministic and Discriminative Imitation (D2-Imitation): Revisiting Adversarial Imitation for Sample Efficiency
Viaarxiv icon