Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yingzhen Li

Reinforcement Learning with Efficient Active Feature Acquisition

Nov 02, 2020
Haiyan Yin, Yingzhen Li, Sinno Jialin Pan, Cheng Zhang, Sebastian Tschiatschek

Figure 1 for Reinforcement Learning with Efficient Active Feature Acquisition

Figure 2 for Reinforcement Learning with Efficient Active Feature Acquisition

Figure 3 for Reinforcement Learning with Efficient Active Feature Acquisition

Figure 4 for Reinforcement Learning with Efficient Active Feature Acquisition

Solving real-life sequential decision making problems under partial observability involves an exploration-exploitation problem. To be successful, an agent needs to efficiently gather valuable information about the state of the world for making rewarding decisions. However, in real-life, acquiring valuable information is often highly costly, e.g., in the medical domain, information acquisition might correspond to performing a medical test on a patient. This poses a significant challenge for the agent to perform optimally for the task while reducing the cost for information acquisition. In this paper, we propose a model-based reinforcement learning framework that learns an active feature acquisition policy to solve the exploration-exploitation problem during its execution. Key to the success is a novel sequential variational auto-encoder that learns high-quality representations from partially observed states, which are then used by the policy to maximize the task reward in a cost efficient manner. We demonstrate the efficacy of our proposed framework in a control domain as well as using a medical simulator. In both tasks, our proposed method outperforms conventional baselines and results in policies with greater cost efficiency.

Via

Access Paper or Ask Questions

A Study on Efficiency in Continual Learning Inspired by Human Learning

Oct 28, 2020
Philip J. Ball, Yingzhen Li, Angus Lamb, Cheng Zhang

Figure 1 for A Study on Efficiency in Continual Learning Inspired by Human Learning

Figure 2 for A Study on Efficiency in Continual Learning Inspired by Human Learning

Figure 3 for A Study on Efficiency in Continual Learning Inspired by Human Learning

Figure 4 for A Study on Efficiency in Continual Learning Inspired by Human Learning

Humans are efficient continual learning systems; we continually learn new skills from birth with finite cells and resources. Our learning is highly optimized both in terms of capacity and time while not suffering from catastrophic forgetting. In this work we study the efficiency of continual learning systems, taking inspiration from human learning. In particular, inspired by the mechanisms of sleep, we evaluate popular pruning-based continual learning algorithms, using PackNet as a case study. First, we identify that weight freezing, which is used in continual learning without biological justification, can result in over $2\times$ as many weights being used for a given level of performance. Secondly, we note the similarity in human day and night time behaviors to the training and pruning phases respectively of PackNet. We study a setting where the pruning phase is given a time budget, and identify connections between iterative pruning and multiple sleep cycles in humans. We show there exists an optimal choice of iteration v.s. epochs given different tasks.

* Accepted at NeurIPS 2020 BabyMind Workshop

Via

Access Paper or Ask Questions

Hierarchical Sparse Variational Autoencoder for Text Encoding

Sep 25, 2020
Victor Prokhorov, Yingzhen Li, Ehsan Shareghi, Nigel Collier

Figure 1 for Hierarchical Sparse Variational Autoencoder for Text Encoding

Figure 2 for Hierarchical Sparse Variational Autoencoder for Text Encoding

Figure 3 for Hierarchical Sparse Variational Autoencoder for Text Encoding

Figure 4 for Hierarchical Sparse Variational Autoencoder for Text Encoding

In this paper we focus on unsupervised representation learning and propose a novel framework, Hierarchical Sparse Variational Autoencoder (HSVAE), that imposes sparsity on sentence representations via direct optimisation of Evidence Lower Bound (ELBO). Our experimental results illustrate that HSVAE is flexible and adapts nicely to the underlying characteristics of the corpus which is reflected by the level of sparsity and its distributional patterns.

Via

Access Paper or Ask Questions

Interpreting Spatially Infinite Generative Models

Jul 24, 2020
Chaochao Lu, Richard E. Turner, Yingzhen Li, Nate Kushman

Figure 1 for Interpreting Spatially Infinite Generative Models

Figure 2 for Interpreting Spatially Infinite Generative Models

Figure 3 for Interpreting Spatially Infinite Generative Models

Figure 4 for Interpreting Spatially Infinite Generative Models

Traditional deep generative models of images and other spatial modalities can only generate fixed sized outputs. The generated images have exactly the same resolution as the training images, which is dictated by the number of layers in the underlying neural network. Recent work has shown, however, that feeding spatial noise vectors into a fully convolutional neural network enables both generation of arbitrary resolution output images as well as training on arbitrary resolution training images. While this work has provided impressive empirical results, little theoretical interpretation was provided to explain the underlying generative process. In this paper we provide a firm theoretical interpretation for infinite spatial generation, by drawing connections to spatial stochastic processes. We use the resulting intuition to improve upon existing spatially infinite generative models to enable more efficient training through a model that we call an infinite generative adversarial network, or $\infty$-GAN. Experiments on world map generation, panoramic images and texture synthesis verify the ability of $\infty$-GAN to efficiently generate images of arbitrary size.

* ICML 2020 workshop on Human Interpretability in Machine Learning (WHI 2020)

Via

Access Paper or Ask Questions

Meta-Learning for Variational Inference

Jul 06, 2020
Ruqi Zhang, Yingzhen Li, Christopher De Sa, Sam Devlin, Cheng Zhang

Figure 1 for Meta-Learning for Variational Inference

Figure 2 for Meta-Learning for Variational Inference

Figure 3 for Meta-Learning for Variational Inference

Figure 4 for Meta-Learning for Variational Inference

Variational inference (VI) plays an essential role in approximate Bayesian inference due to its computational efficiency and broad applicability. Crucial to the performance of VI is the selection of the associated divergence measure, as VI approximates the intractable distribution by minimizing this divergence. In this paper we propose a meta-learning algorithm to learn the divergence metric suited for the task of interest, automating the design of VI methods. In addition, we learn the initialization of the variational parameters without additional cost when our method is deployed in the few-shot learning scenarios. We demonstrate our approach outperforms standard VI on Gaussian mixture distribution approximation, Bayesian neural network regression, image generation with variational autoencoders and recommender systems with a partial variational autoencoder.

Via

Access Paper or Ask Questions

Sliced Kernelized Stein Discrepancy

Jun 30, 2020
Wenbo Gong, Yingzhen Li, José Miguel Hernández-Lobato

Figure 1 for Sliced Kernelized Stein Discrepancy

Figure 2 for Sliced Kernelized Stein Discrepancy

Figure 3 for Sliced Kernelized Stein Discrepancy

Figure 4 for Sliced Kernelized Stein Discrepancy

Kernelized Stein discrepancy (KSD), though being extensively used in goodness-of-fit tests and model learning, suffers from the curse-of-dimensionality. We address this issue by proposing the sliced Stein discrepancy and its scalable and kernelized variants, which employs kernel-based test functions defined on the optimal onedimensional projections instead of the full input in high dimensions. When applied to goodness-of-fit tests, extensive experiments show the proposed discrepancy significantly outperforms KSD and various baselines in high dimensions. For model learning, we show its advantages by training an independent component analysis when compared with existing Stein discrepancy baselines. We further propose a novel particle inference method called sliced Stein variational gradient descent (S-SVGD) which alleviates the mode-collapse issue of SVGD in training variational autoencoders.

Via

Access Paper or Ask Questions

A Causal View on Robustness of Neural Networks

May 03, 2020
Cheng Zhang, Kun Zhang, Yingzhen Li

Figure 1 for A Causal View on Robustness of Neural Networks

Figure 2 for A Causal View on Robustness of Neural Networks

Figure 3 for A Causal View on Robustness of Neural Networks

Figure 4 for A Causal View on Robustness of Neural Networks

We present a causal view on the robustness of neural networks against input manipulations, which applies not only to traditional classification tasks but also to general measurement data. Based on this view, we design a deep causal manipulation augmented model (deep CAMA) which explicitly models possible manipulations on certain causes leading to changes in the observed effect. We further develop data augmentation and test-time fine-tuning methods to improve deep CAMA's robustness. When compared with discriminative deep neural networks, our proposed model shows superior robustness against unseen manipulations. As a by-product, our model achieves disentangled representation which separates the representation of manipulations from those of other latent causes.

Via

Access Paper or Ask Questions

Inverse Graphics GAN: Learning to Generate 3D Shapes from Unstructured 2D Data

Feb 28, 2020
Sebastian Lunz, Yingzhen Li, Andrew Fitzgibbon, Nate Kushman

Figure 1 for Inverse Graphics GAN: Learning to Generate 3D Shapes from Unstructured 2D Data

Figure 2 for Inverse Graphics GAN: Learning to Generate 3D Shapes from Unstructured 2D Data

Figure 3 for Inverse Graphics GAN: Learning to Generate 3D Shapes from Unstructured 2D Data

Figure 4 for Inverse Graphics GAN: Learning to Generate 3D Shapes from Unstructured 2D Data

Recent work has shown the ability to learn generative models for 3D shapes from only unstructured 2D images. However, training such models requires differentiating through the rasterization step of the rendering process, therefore past work has focused on developing bespoke rendering models which smooth over this non-differentiable process in various ways. Such models are thus unable to take advantage of the photo-realistic, fully featured, industrial renderers built by the gaming and graphics industry. In this paper we introduce the first scalable training technique for 3D generative models from 2D data which utilizes an off-the-shelf non-differentiable renderer. To account for the non-differentiability, we introduce a proxy neural renderer to match the output of the non-differentiable renderer. We further propose discriminator output matching to ensure that the neural renderer learns to smooth over the rasterization appropriately. We evaluate our model on images rendered from our generated 3D shapes, and show that our model can consistently learn to generate better shapes than existing models when trained with exclusively unstructured 2D images.

* 8 pages paper, 3 pages references, 18 pages appendix

Via

Access Paper or Ask Questions

Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Oct 28, 2019
Maximilian Igl, Kamil Ciosek, Yingzhen Li, Sebastian Tschiatschek, Cheng Zhang, Sam Devlin, Katja Hofmann

Figure 1 for Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Figure 2 for Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Figure 3 for Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Figure 4 for Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

The ability for policies to generalize to new environments is key to the broad application of RL agents. A promising approach to prevent an agent's policy from overfitting to a limited set of training environments is to apply regularization techniques originally developed for supervised learning. However, there are stark differences between supervised learning and RL. We discuss those differences and propose modifications to existing regularization techniques in order to better adapt them to RL. In particular, we focus on regularization techniques relying on the injection of noise into the learned function, a family that includes some of the most widely used approaches such as Dropout and Batch Normalization. To adapt them to RL, we propose Selective Noise Injection (SNI), which maintains the regularizing effect the injected noise has, while mitigating the adverse effects it has on the gradient quality. Furthermore, we demonstrate that the Information Bottleneck (IB) is a particularly well suited regularization technique for RL as it is effective in the low-data regime encountered early on in training RL agents. Combining the IB with SNI, we significantly outperform current state of the art results, including on the recently proposed generalization benchmark Coinrun.

* Published at Neurips 2019

Via

Access Paper or Ask Questions

On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Sep 30, 2019
Victor Prokhorov, Ehsan Shareghi, Yingzhen Li, Mohammad Taher Pilehvar, Nigel Collier

Figure 1 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Figure 2 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Figure 3 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Figure 4 for On the Importance of the Kullback-Leibler Divergence Term in Variational Autoencoders for Text Generation

Variational Autoencoders (VAEs) are known to suffer from learning uninformative latent representation of the input due to issues such as approximated posterior collapse, or entanglement of the latent space. We impose an explicit constraint on the Kullback-Leibler (KL) divergence term inside the VAE objective function. While the explicit constraint naturally avoids posterior collapse, we use it to further understand the significance of the KL term in controlling the information transmitted through the VAE channel. Within this framework, we explore different properties of the estimated posterior distribution, and highlight the trade-off between the amount of information encoded in a latent code during training, and the generative capacity of the model.

* 10 pages; Accepted in 3rd Workshop on Neural Generation and Translation (WNGT 2019)

Via

Access Paper or Ask Questions