Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

Aug 11, 2021

Xiaofei Wang, Kimin Lee, Kourosh Hakhamaneshi, Pieter Abbeel, Michael Laskin

Figure 1 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

Figure 2 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

Figure 3 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

Figure 4 for Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

Share this with someone who'll enjoy it:

Abstract:A promising approach to solving challenging long-horizon tasks has been to extract behavior priors (skills) by fitting generative models to large offline datasets of demonstrations. However, such generative models inherit the biases of the underlying data and result in poor and unusable skills when trained on imperfect demonstration data. To better align skill extraction with human intent we present Skill Preferences (SkiP), an algorithm that learns a model over human preferences and uses it to extract human-aligned skills from offline data. After extracting human-preferred skills, SkiP also utilizes human feedback to solve down-stream tasks with RL. We show that SkiP enables a simulated kitchen robot to solve complex multi-step manipulation tasks and substantially outperforms prior leading RL algorithms with human preferences as well as leading skill extraction algorithms without human preferences.

* 8 pages,6 figures. for associated code and video, see http://sites.google.com/view/skill-pref

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback

Paper and Code