Picture for Scott Niekum

Scott Niekum

Predicting Future Actions of Reinforcement Learning Agents

Add code
Oct 29, 2024
Figure 1 for Predicting Future Actions of Reinforcement Learning Agents
Figure 2 for Predicting Future Actions of Reinforcement Learning Agents
Figure 3 for Predicting Future Actions of Reinforcement Learning Agents
Figure 4 for Predicting Future Actions of Reinforcement Learning Agents
Viaarxiv icon

SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions

Add code
Oct 24, 2024
Figure 1 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Figure 2 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Figure 3 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Figure 4 for SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions
Viaarxiv icon

Pareto-Optimal Learning from Preferences with Hidden Context

Add code
Jun 21, 2024
Figure 1 for Pareto-Optimal Learning from Preferences with Hidden Context
Figure 2 for Pareto-Optimal Learning from Preferences with Hidden Context
Figure 3 for Pareto-Optimal Learning from Preferences with Hidden Context
Figure 4 for Pareto-Optimal Learning from Preferences with Hidden Context
Viaarxiv icon

A Dual Approach to Imitation Learning from Observations with Offline Datasets

Add code
Jun 13, 2024
Figure 1 for A Dual Approach to Imitation Learning from Observations with Offline Datasets
Figure 2 for A Dual Approach to Imitation Learning from Observations with Offline Datasets
Figure 3 for A Dual Approach to Imitation Learning from Observations with Offline Datasets
Figure 4 for A Dual Approach to Imitation Learning from Observations with Offline Datasets
Viaarxiv icon

Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

Add code
May 06, 2024
Figure 1 for Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Figure 2 for Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Figure 3 for Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Figure 4 for Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning
Viaarxiv icon

D2PO: Discriminator-Guided DPO with Response Evaluation Models

Add code
May 02, 2024
Figure 1 for D2PO: Discriminator-Guided DPO with Response Evaluation Models
Figure 2 for D2PO: Discriminator-Guided DPO with Response Evaluation Models
Figure 3 for D2PO: Discriminator-Guided DPO with Response Evaluation Models
Figure 4 for D2PO: Discriminator-Guided DPO with Response Evaluation Models
Viaarxiv icon

Automated Discovery of Functional Actual Causes in Complex Environments

Add code
Apr 16, 2024
Figure 1 for Automated Discovery of Functional Actual Causes in Complex Environments
Figure 2 for Automated Discovery of Functional Actual Causes in Complex Environments
Figure 3 for Automated Discovery of Functional Actual Causes in Complex Environments
Figure 4 for Automated Discovery of Functional Actual Causes in Complex Environments
Viaarxiv icon

Learning Action-based Representations Using Invariance

Add code
Mar 25, 2024
Viaarxiv icon

Score Models for Offline Goal-Conditioned Reinforcement Learning

Add code
Nov 03, 2023
Viaarxiv icon

Contrastive Preference Learning: Learning from Human Feedback without RL

Add code
Oct 24, 2023
Figure 1 for Contrastive Preference Learning: Learning from Human Feedback without RL
Figure 2 for Contrastive Preference Learning: Learning from Human Feedback without RL
Figure 3 for Contrastive Preference Learning: Learning from Human Feedback without RL
Figure 4 for Contrastive Preference Learning: Learning from Human Feedback without RL
Viaarxiv icon