Alert button
Picture for Stuart Russell

Stuart Russell

Alert button

An Empirical Investigation of Representation Learning for Imitation

Add code
Bookmark button
Alert button
May 16, 2022
Xin Chen, Sam Toyer, Cody Wild, Scott Emmons, Ian Fischer, Kuang-Huei Lee, Neel Alex, Steven H Wang, Ping Luo, Stuart Russell, Pieter Abbeel, Rohin Shah

Figure 1 for An Empirical Investigation of Representation Learning for Imitation
Figure 2 for An Empirical Investigation of Representation Learning for Imitation
Figure 3 for An Empirical Investigation of Representation Learning for Imitation
Figure 4 for An Empirical Investigation of Representation Learning for Imitation
Viaarxiv icon

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

Add code
Bookmark button
Alert button
Apr 25, 2022
Micah Carroll, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan

Figure 1 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 2 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 3 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Figure 4 for Estimating and Penalizing Induced Preference Shifts in Recommender Systems
Viaarxiv icon

Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

Add code
Bookmark button
Alert button
Mar 14, 2022
Joar Skalse, Matthew Farrugia-Roberts, Stuart Russell, Alessandro Abate, Adam Gleave

Figure 1 for Invariance in Policy Optimisation and Partial Identifiability in Reward Learning
Figure 2 for Invariance in Policy Optimisation and Partial Identifiability in Reward Learning
Viaarxiv icon

Cross-Domain Imitation Learning via Optimal Transport

Add code
Bookmark button
Alert button
Oct 14, 2021
Arnaud Fickinger, Samuel Cohen, Stuart Russell, Brandon Amos

Figure 1 for Cross-Domain Imitation Learning via Optimal Transport
Figure 2 for Cross-Domain Imitation Learning via Optimal Transport
Figure 3 for Cross-Domain Imitation Learning via Optimal Transport
Figure 4 for Cross-Domain Imitation Learning via Optimal Transport
Viaarxiv icon

Detecting Modularity in Deep Neural Networks

Add code
Bookmark button
Alert button
Oct 13, 2021
Shlomi Hod, Stephen Casper, Daniel Filan, Cody Wild, Andrew Critch, Stuart Russell

Figure 1 for Detecting Modularity in Deep Neural Networks
Figure 2 for Detecting Modularity in Deep Neural Networks
Figure 3 for Detecting Modularity in Deep Neural Networks
Figure 4 for Detecting Modularity in Deep Neural Networks
Viaarxiv icon

Scalable Online Planning via Reinforcement Learning Fine-Tuning

Add code
Bookmark button
Alert button
Sep 30, 2021
Arnaud Fickinger, Hengyuan Hu, Brandon Amos, Stuart Russell, Noam Brown

Figure 1 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 2 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 3 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Figure 4 for Scalable Online Planning via Reinforcement Learning Fine-Tuning
Viaarxiv icon

Explore and Control with Adversarial Surprise

Add code
Bookmark button
Alert button
Jul 12, 2021
Arnaud Fickinger, Natasha Jaques, Samyak Parajuli, Michael Chang, Nicholas Rhinehart, Glen Berseth, Stuart Russell, Sergey Levine

Figure 1 for Explore and Control with Adversarial Surprise
Figure 2 for Explore and Control with Adversarial Surprise
Figure 3 for Explore and Control with Adversarial Surprise
Figure 4 for Explore and Control with Adversarial Surprise
Viaarxiv icon

The MineRL BASALT Competition on Learning from Human Feedback

Add code
Bookmark button
Alert button
Jul 05, 2021
Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William Guss, Sharada Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca Dragan

Figure 1 for The MineRL BASALT Competition on Learning from Human Feedback
Figure 2 for The MineRL BASALT Competition on Learning from Human Feedback
Viaarxiv icon

Learning the Preferences of Uncertain Humans with Inverse Decision Theory

Add code
Bookmark button
Alert button
Jun 19, 2021
Cassidy Laidlaw, Stuart Russell

Figure 1 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 2 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 3 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Figure 4 for Learning the Preferences of Uncertain Humans with Inverse Decision Theory
Viaarxiv icon