Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Danilo Jimenez Rezende

Variational Inference with Normalizing Flows

Jun 14, 2016

Danilo Jimenez Rezende, Shakir Mohamed

Figure 1 for Variational Inference with Normalizing Flows

Figure 2 for Variational Inference with Normalizing Flows

Figure 3 for Variational Inference with Normalizing Flows

Figure 4 for Variational Inference with Normalizing Flows

Abstract:The choice of approximate posterior distribution is one of the core problems in variational inference. Most applications of variational inference employ simple families of posterior approximations in order to allow for efficient inference, focusing on mean-field or other simple structured approximations. This restriction has a significant impact on the quality of inferences made using variational methods. We introduce a new approach for specifying flexible, arbitrarily complex and scalable approximate posterior distributions. Our approximations are distributions constructed through a normalizing flow, whereby a simple initial density is transformed into a more complex one by applying a sequence of invertible transformations until a desired level of complexity is attained. We use this view of normalizing flows to develop categories of finite and infinitesimal flows and provide a unified view of approaches for constructing rich posterior approximations. We demonstrate that the theoretical advantages of having posteriors that better match the true posterior, combined with the scalability of amortized variational approaches, provides a clear improvement in performance and applicability of variational inference.

* Proceedings of the 32nd International Conference on Machine Learning

Via

Access Paper or Ask Questions

One-Shot Generalization in Deep Generative Models

May 25, 2016

Danilo Jimenez Rezende, Shakir Mohamed, Ivo Danihelka, Karol Gregor, Daan Wierstra

Figure 1 for One-Shot Generalization in Deep Generative Models

Figure 2 for One-Shot Generalization in Deep Generative Models

Figure 3 for One-Shot Generalization in Deep Generative Models

Figure 4 for One-Shot Generalization in Deep Generative Models

Abstract:Humans have an impressive ability to reason about new concepts and experiences from just a single example. In particular, humans have an ability for one-shot generalization: an ability to encounter a new concept, understand its structure, and then be able to generate compelling alternative variations of the concept. We develop machine learning systems with this important capacity by developing new deep generative models, models that combine the representational power of deep learning with the inferential power of Bayesian reasoning. We develop a class of sequential generative models that are built on the principles of feedback and attention. These two characteristics lead to generative models that are among the state-of-the art in density estimation and image generation. We demonstrate the one-shot generalization ability of our models using three tasks: unconditional sampling, generating new exemplars of a given concept, and generating new exemplars of a family of concepts. In all cases our models are able to generate compelling and diverse samples---having seen new examples just once---providing an important class of general-purpose models for one-shot machine learning.

* 8pgs, 1pg references, 1pg appendix, In Proceedings of the 33rd International Conference on Machine Learning, JMLR: W&CP volume 48, 2016

Via

Access Paper or Ask Questions

Towards Conceptual Compression

Apr 29, 2016

Karol Gregor, Frederic Besse, Danilo Jimenez Rezende, Ivo Danihelka, Daan Wierstra

Figure 1 for Towards Conceptual Compression

Figure 2 for Towards Conceptual Compression

Figure 3 for Towards Conceptual Compression

Figure 4 for Towards Conceptual Compression

Abstract:We introduce a simple recurrent variational auto-encoder architecture that significantly improves image modeling. The system represents the state-of-the-art in latent variable models for both the ImageNet and Omniglot datasets. We show that it naturally separates global conceptual information from lower level details, thus addressing one of the fundamentally desired properties of unsupervised learning. Furthermore, the possibility of restricting ourselves to storing only global information about an image allows us to achieve high quality 'conceptual compression'.

* 14 pages, 13 figures

Via

Access Paper or Ask Questions

Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning

Sep 29, 2015

Shakir Mohamed, Danilo Jimenez Rezende

Figure 1 for Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning

Figure 2 for Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning

Figure 3 for Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning

Figure 4 for Variational Information Maximisation for Intrinsically Motivated Reinforcement Learning

Abstract:The mutual information is a core statistical quantity that has applications in all areas of machine learning, whether this is in training of density models over multiple data modalities, in maximising the efficiency of noisy transmission channels, or when learning behaviour policies for exploration by artificial agents. Most learning algorithms that involve optimisation of the mutual information rely on the Blahut-Arimoto algorithm --- an enumerative algorithm with exponential complexity that is not suitable for modern machine learning applications. This paper provides a new approach for scalable optimisation of the mutual information by merging techniques from variational inference and deep learning. We develop our approach by focusing on the problem of intrinsically-motivated learning, where the mutual information forms the definition of a well-known internal drive known as empowerment. Using a variational lower bound on the mutual information, combined with convolutional networks for handling visual input streams, we develop a stochastic optimisation algorithm that allows for scalable information maximisation and empowerment-based reasoning directly from pixels to actions.

* Proceedings of the 29th Conference on Neural Information Processing Systems (NIPS 2015)

Via

Access Paper or Ask Questions

DRAW: A Recurrent Neural Network For Image Generation

May 20, 2015

Karol Gregor, Ivo Danihelka, Alex Graves, Danilo Jimenez Rezende, Daan Wierstra

Figure 1 for DRAW: A Recurrent Neural Network For Image Generation

Figure 2 for DRAW: A Recurrent Neural Network For Image Generation

Figure 3 for DRAW: A Recurrent Neural Network For Image Generation

Figure 4 for DRAW: A Recurrent Neural Network For Image Generation

Abstract:This paper introduces the Deep Recurrent Attentive Writer (DRAW) neural network architecture for image generation. DRAW networks combine a novel spatial attention mechanism that mimics the foveation of the human eye, with a sequential variational auto-encoding framework that allows for the iterative construction of complex images. The system substantially improves on the state of the art for generative models on MNIST, and, when trained on the Street View House Numbers dataset, it generates images that cannot be distinguished from real data with the naked eye.

Via

Access Paper or Ask Questions

Stochastic Backpropagation and Approximate Inference in Deep Generative Models

May 30, 2014

Danilo Jimenez Rezende, Shakir Mohamed, Daan Wierstra

Figure 1 for Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Figure 2 for Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Figure 3 for Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Figure 4 for Stochastic Backpropagation and Approximate Inference in Deep Generative Models

Abstract:We marry ideas from deep neural networks and approximate Bayesian inference to derive a generalised class of deep, directed generative models, endowed with a new algorithm for scalable inference and learning. Our algorithm introduces a recognition model to represent approximate posterior distributions, and that acts as a stochastic encoder of the data. We develop stochastic back-propagation -- rules for back-propagation through stochastic variables -- and use this to develop an algorithm that allows for joint optimisation of the parameters of both the generative and recognition model. We demonstrate on several real-world data sets that the model generates realistic samples, provides accurate imputations of missing data and is a useful tool for high-dimensional data visualisation.

* Appears In Proceedings of the 31st International Conference on Machine Learning (ICML), JMLR: W\&CP volume 32, 2014

Via

Access Paper or Ask Questions