Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lars Maaløe

Auxiliary Deep Generative Models

Jun 16, 2016

Lars Maaløe, Casper Kaae Sønderby, Søren Kaae Sønderby, Ole Winther

Figure 1 for Auxiliary Deep Generative Models

Figure 2 for Auxiliary Deep Generative Models

Figure 3 for Auxiliary Deep Generative Models

Figure 4 for Auxiliary Deep Generative Models

Abstract:Deep generative models parameterized by neural networks have recently achieved state-of-the-art performance in unsupervised and semi-supervised learning. We extend deep generative models with auxiliary variables which improves the variational approximation. The auxiliary variables leave the generative model unchanged but make the variational distribution more expressive. Inspired by the structure of the auxiliary variable we also propose a model with two stochastic layers and skip connections. Our findings suggest that more expressive and properly specified deep generative models converge faster with better results. We show state-of-the-art performance within semi-supervised learning on MNIST, SVHN and NORB datasets.

* Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 2016, JMLR: Workshop and Conference Proceedings volume 48, Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 2016

Via

Access Paper or Ask Questions

Ladder Variational Autoencoders

May 27, 2016

Casper Kaae Sønderby, Tapani Raiko, Lars Maaløe, Søren Kaae Sønderby, Ole Winther

Figure 1 for Ladder Variational Autoencoders

Figure 2 for Ladder Variational Autoencoders

Figure 3 for Ladder Variational Autoencoders

Figure 4 for Ladder Variational Autoencoders

Abstract:Variational Autoencoders are powerful models for unsupervised learning. However deep models with several layers of dependent stochastic variables are difficult to train which limits the improvements obtained using these highly expressive models. We propose a new inference model, the Ladder Variational Autoencoder, that recursively corrects the generative distribution by a data dependent approximate likelihood in a process resembling the recently proposed Ladder Network. We show that this model provides state of the art predictive log-likelihood and tighter log-likelihood lower bound compared to the purely bottom-up inference in layered Variational Autoencoders and other generative models. We provide a detailed analysis of the learned hierarchical latent representation and show that our new inference model is qualitatively different and utilizes a deeper more distributed hierarchy of latent variables. Finally, we observe that batch normalization and deterministic warm-up (gradually turning on the KL-term) are crucial for training variational models with many stochastic layers.

Via

Access Paper or Ask Questions

Recurrent Spatial Transformer Networks

Sep 17, 2015

Søren Kaae Sønderby, Casper Kaae Sønderby, Lars Maaløe, Ole Winther

Figure 1 for Recurrent Spatial Transformer Networks

Figure 2 for Recurrent Spatial Transformer Networks

Figure 3 for Recurrent Spatial Transformer Networks

Figure 4 for Recurrent Spatial Transformer Networks

Abstract:We integrate the recently proposed spatial transformer network (SPN) [Jaderberg et. al 2015] into a recurrent neural network (RNN) to form an RNN-SPN model. We use the RNN-SPN to classify digits in cluttered MNIST sequences. The proposed model achieves a single digit error of 1.5% compared to 2.9% for a convolutional networks and 2.0% for convolutional networks with SPN layers. The SPN outputs a zoomed, rotated and skewed version of the input image. We investigate different down-sampling factors (ratio of pixel in input and output) for the SPN and show that the RNN-SPN model is able to down-sample the input images without deteriorating performance. The down-sampling in RNN-SPN can be thought of as adaptive down-sampling that minimizes the information loss in the regions of interest. We attribute the superior performance of the RNN-SPN to the fact that it can attend to a sequence of regions of interest.

Via

Access Paper or Ask Questions