Picture for Scott Niekum

Scott Niekum

SOPE: Spectrum of Off-Policy Estimators

Add code
Dec 02, 2021
Figure 1 for SOPE: Spectrum of Off-Policy Estimators
Figure 2 for SOPE: Spectrum of Off-Policy Estimators
Figure 3 for SOPE: Spectrum of Off-Policy Estimators
Figure 4 for SOPE: Spectrum of Off-Policy Estimators
Viaarxiv icon

You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL

Add code
Oct 05, 2021
Figure 1 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Figure 2 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Figure 3 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Figure 4 for You Only Evaluate Once: a Simple Baseline Algorithm for Offline RL
Viaarxiv icon

Distributional Depth-Based Estimation of Object Articulation Models

Add code
Aug 12, 2021
Figure 1 for Distributional Depth-Based Estimation of Object Articulation Models
Figure 2 for Distributional Depth-Based Estimation of Object Articulation Models
Figure 3 for Distributional Depth-Based Estimation of Object Articulation Models
Figure 4 for Distributional Depth-Based Estimation of Object Articulation Models
Viaarxiv icon

Robust Generative Adversarial Imitation Learning via Local Lipschitzness

Add code
Jun 30, 2021
Figure 1 for Robust Generative Adversarial Imitation Learning via Local Lipschitzness
Figure 2 for Robust Generative Adversarial Imitation Learning via Local Lipschitzness
Viaarxiv icon

Zero-shot Task Adaptation using Natural Language

Add code
Jun 05, 2021
Figure 1 for Zero-shot Task Adaptation using Natural Language
Figure 2 for Zero-shot Task Adaptation using Natural Language
Figure 3 for Zero-shot Task Adaptation using Natural Language
Figure 4 for Zero-shot Task Adaptation using Natural Language
Viaarxiv icon

Adversarial Intrinsic Motivation for Reinforcement Learning

Add code
May 30, 2021
Figure 1 for Adversarial Intrinsic Motivation for Reinforcement Learning
Figure 2 for Adversarial Intrinsic Motivation for Reinforcement Learning
Figure 3 for Adversarial Intrinsic Motivation for Reinforcement Learning
Figure 4 for Adversarial Intrinsic Motivation for Reinforcement Learning
Viaarxiv icon

Universal Off-Policy Evaluation

Add code
Apr 26, 2021
Figure 1 for Universal Off-Policy Evaluation
Figure 2 for Universal Off-Policy Evaluation
Figure 3 for Universal Off-Policy Evaluation
Figure 4 for Universal Off-Policy Evaluation
Viaarxiv icon

Self-Supervised Online Reward Shaping in Sparse-Reward Environments

Add code
Mar 08, 2021
Figure 1 for Self-Supervised Online Reward Shaping in Sparse-Reward Environments
Viaarxiv icon

SCAPE: Learning Stiffness Control from Augmented Position Control Experiences

Add code
Feb 16, 2021
Figure 1 for SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Figure 2 for SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Figure 3 for SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Figure 4 for SCAPE: Learning Stiffness Control from Augmented Position Control Experiences
Viaarxiv icon

Value Alignment Verification

Add code
Dec 02, 2020
Figure 1 for Value Alignment Verification
Figure 2 for Value Alignment Verification
Figure 3 for Value Alignment Verification
Figure 4 for Value Alignment Verification
Viaarxiv icon