Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Sep 29, 2018

Arjun Sharma, Mohit Sharma, Nicholas Rhinehart, Kris M. Kitani

Figure 1 for Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Figure 2 for Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Figure 3 for Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Figure 4 for Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Share this with someone who'll enjoy it:

Abstract:The use of imitation learning to learn a single policy for a complex task that has multiple modes or hierarchical structure can be challenging. In fact, previous work has shown that when the modes are known, learning separate policies for each mode or sub-task can greatly improve the performance of imitation learning. In this work, we discover the interaction between sub-tasks from their resulting state-action trajectory sequences using a directed graphical model. We propose a new algorithm based on the generative adversarial imitation learning framework which automatically learns sub-task policies from unsegmented demonstrations. Our approach maximizes the directed information flow in the graphical model between sub-task latent variables and their generated trajectories. We also show how our approach connects with the existing Options framework, which is commonly used to learn hierarchical policies.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Directed-Info GAIL: Learning Hierarchical Policies from Unsegmented Demonstrations using Directed Information

Paper and Code