Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Implicit Generative Modeling for Efficient Exploration

Nov 19, 2019

Neale Ratzlaff, Qinxun Bai, Li Fuxin, Wei Xu

Figure 1 for Implicit Generative Modeling for Efficient Exploration

Figure 2 for Implicit Generative Modeling for Efficient Exploration

Figure 3 for Implicit Generative Modeling for Efficient Exploration

Figure 4 for Implicit Generative Modeling for Efficient Exploration

Share this with someone who'll enjoy it:

Abstract:Efficient exploration remains a challenging problem in reinforcement learning, especially for those tasks where rewards from environments are sparse. A commonly used approach for exploring such environments is to introduce some "intrinsic" reward. In this work, we focus on model uncertainty estimation as an intrinsic reward for efficient exploration. In particular, we introduce an implicit generative modeling approach to estimate a Bayesian uncertainty of the agent's belief of the environment dynamics. Each random draw from our generative model is a neural network that instantiates the dynamic function, hence multiple draws would approximate the posterior, and the variance in the future prediction based on this posterior is used as an intrinsic reward for exploration. We design a training algorithm for our generative model based on the amortized Stein Variational Gradient Descent. In experiments, we compare our implementation with state-of-the-art intrinsic reward-based exploration approaches, including two recent approaches based on an ensemble of dynamic models. In challenging exploration tasks, our implicit generative model consistently outperforms competing approaches regarding data efficiency in exploration.

* 14 pages, 9 figures

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Implicit Generative Modeling for Efficient Exploration

Paper and Code