Picture for Stuart Russell

Stuart Russell

Berkeley

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Add code
Mar 14, 2022
Figure 1 for Invariance in Policy Optimisation and Partial Identifiability in Reward Learning
Figure 2 for Invariance in Policy Optimisation and Partial Identifiability in Reward Learning
Viaarxiv icon

Cross-Domain Imitation Learning via Optimal Transport

Add code
Oct 14, 2021
Figure 1 for Cross-Domain Imitation Learning via Optimal Transport
Figure 2 for Cross-Domain Imitation Learning via Optimal Transport
Figure 3 for Cross-Domain Imitation Learning via Optimal Transport
Figure 4 for Cross-Domain Imitation Learning via Optimal Transport
Viaarxiv icon

Detecting Modularity in Deep Neural Networks

Add code
Oct 13, 2021
Figure 1 for Detecting Modularity in Deep Neural Networks
Figure 2 for Detecting Modularity in Deep Neural Networks
Figure 3 for Detecting Modularity in Deep Neural Networks
Figure 4 for Detecting Modularity in Deep Neural Networks
Viaarxiv icon

Scalable Online Planning via Reinforcement Learning Fine-Tuning

Add code
Sep 30, 2021
Figure 1 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 2 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 3 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 4 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Viaarxiv icon

Explore and Control with Adversarial Surprise

Add code
Jul 12, 2021
Figure 1 for Explore and Control with Adversarial Surprise
Figure 2 for Explore and Control with Adversarial Surprise
Figure 3 for Explore and Control with Adversarial Surprise
Figure 4 for Explore and Control with Adversarial Surprise
Viaarxiv icon

The MineRL BASALT Competition on Learning from Human Feedback

Add code
Jul 05, 2021
Figure 1 for The MineRL BASALT Competition on Learning from Human Feedback
Figure 2 for The MineRL BASALT Competition on Learning from Human Feedback
Viaarxiv icon

Learning the Preferences of Uncertain Humans with Inverse Decision Theory

Add code
Jun 19, 2021
Figure 1 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 2 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 3 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 4 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Viaarxiv icon

MADE: Exploration via Maximizing Deviation from Explored Regions

Add code
Jun 18, 2021
Figure 1 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 2 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 3 for MADE: Exploration via Maximizing Deviation from Explored Regions
Figure 4 for MADE: Exploration via Maximizing Deviation from Explored Regions
Viaarxiv icon

Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism

Add code
Mar 22, 2021
Figure 1 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 2 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 3 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Figure 4 for Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism
Viaarxiv icon

Clusterability in Neural Networks

Add code
Mar 04, 2021
Figure 1 for Clusterability in Neural Networks
Figure 2 for Clusterability in Neural Networks
Figure 3 for Clusterability in Neural Networks
Figure 4 for Clusterability in Neural Networks
Viaarxiv icon