Alert button
Picture for Scott Niekum

Scott Niekum

Alert button

Models of human preference for learning reward functions

Add code
Bookmark button
Alert button
Jun 05, 2022
W. Bradley Knox, Stephane Hatgis-Kessell, Serena Booth, Scott Niekum, Peter Stone, Alessandro Allievi

Figure 1 for Models of human preference for learning reward functions
Figure 2 for Models of human preference for learning reward functions
Figure 3 for Models of human preference for learning reward functions
Figure 4 for Models of human preference for learning reward functions
Viaarxiv icon

Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL

Add code
Bookmark button
Alert button
Jun 01, 2022
Wonjoon Goo, Scott Niekum

Figure 1 for Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL
Figure 2 for Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL
Figure 3 for Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL
Figure 4 for Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL
Viaarxiv icon

Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?

Add code
Bookmark button
Alert button
Apr 23, 2022
Yuchen Cui, Scott Niekum, Abhinav Gupta, Vikash Kumar, Aravind Rajeswaran

Figure 1 for Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Figure 2 for Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Figure 3 for Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Figure 4 for Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?
Viaarxiv icon

A Ranking Game for Imitation Learning

Add code
Bookmark button
Alert button
Feb 07, 2022
Harshit Sikchi, Akanksha Saran, Wonjoon Goo, Scott Niekum

Figure 1 for A Ranking Game for Imitation Learning
Figure 2 for A Ranking Game for Imitation Learning
Figure 3 for A Ranking Game for Imitation Learning
Figure 4 for A Ranking Game for Imitation Learning
Viaarxiv icon

SOPE: Spectrum of Off-Policy Estimators

Add code
Bookmark button
Alert button
Dec 02, 2021
Christina J. Yuan, Yash Chandak, Stephen Giguere, Philip S. Thomas, Scott Niekum

Figure 1 for SOPE: Spectrum of Off-Policy Estimators
Figure 2 for SOPE: Spectrum of Off-Policy Estimators
Figure 3 for SOPE: Spectrum of Off-Policy Estimators
Figure 4 for SOPE: Spectrum of Off-Policy Estimators
Viaarxiv icon

You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL

Add code
Bookmark button
Alert button
Oct 05, 2021
Wonjoon Goo, Scott Niekum

Figure 1 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Figure 2 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Figure 3 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Figure 4 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Viaarxiv icon

Distributional Depth-Based Estimation of Object Articulation Models

Add code
Bookmark button
Alert button
Aug 12, 2021
Ajinkya Jain, Stephen Giguere, Rudolf Lioutikov, Scott Niekum

Figure 1 for Distributional Depth-Based Estimation of Object Articulation Models
Figure 2 for Distributional Depth-Based Estimation of Object Articulation Models
Figure 3 for Distributional Depth-Based Estimation of Object Articulation Models
Figure 4 for Distributional Depth-Based Estimation of Object Articulation Models
Viaarxiv icon

Robust Generative Adversarial Imitation Learning via Local Lipschitzness

Add code
Bookmark button
Alert button
Jun 30, 2021
Farzan Memarian, Abolfazl Hashemi, Scott Niekum, Ufuk Topcu

Figure 1 for Robust Generative Adversarial Imitation Learning via Local Lipschitzness
Figure 2 for Robust Generative Adversarial Imitation Learning via Local Lipschitzness
Viaarxiv icon