Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Apr 10, 2023

Zihan Ding, Yuanpei Chen, Allen Z. Ren, Shixiang Shane Gu, Hao Dong, Chi Jin

Figure 1 for Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Figure 2 for Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Figure 3 for Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Figure 4 for Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Share this with someone who'll enjoy it:

Abstract:Generating human-like behavior on robots is a great challenge especially in dexterous manipulation tasks with robotic hands. Even in simulation with no sample constraints, scripting controllers is intractable due to high degrees of freedom, and manual reward engineering can also be hard and lead to non-realistic motions. Leveraging the recent progress on Reinforcement Learning from Human Feedback (RLHF), we propose a framework to learn a universal human prior using direct human preference feedback over videos, for efficiently tuning the RL policy on 20 dual-hand robot manipulation tasks in simulation, without a single human demonstration. One task-agnostic reward model is trained through iteratively generating diverse polices and collecting human preference over the trajectories; it is then applied for regularizing the behavior of polices in the fine-tuning stage. Our method empirically demonstrates more human-like behaviors on robot hands in diverse tasks including even unseen tasks, indicating its generalization capability.

View paper on

Share this with someone who'll enjoy it:

Title:Learning a Universal Human Prior for Dexterous Manipulation from Human Preference

Paper and Code