Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tuan Anh Le

Drawing out of Distribution with Neuro-Symbolic Generative Models

Jun 03, 2022

Yichao Liang, Joshua B. Tenenbaum, Tuan Anh Le, N. Siddharth

Figure 1 for Drawing out of Distribution with Neuro-Symbolic Generative Models

Figure 2 for Drawing out of Distribution with Neuro-Symbolic Generative Models

Figure 3 for Drawing out of Distribution with Neuro-Symbolic Generative Models

Figure 4 for Drawing out of Distribution with Neuro-Symbolic Generative Models

Abstract:Learning general-purpose representations from perceptual inputs is a hallmark of human intelligence. For example, people can write out numbers or characters, or even draw doodles, by characterizing these tasks as different instantiations of the same generic underlying process -- compositional arrangements of different forms of pen strokes. Crucially, learning to do one task, say writing, implies reasonable competence at another, say drawing, on account of this shared process. We present Drawing out of Distribution (DooD), a neuro-symbolic generative model of stroke-based drawing that can learn such general-purpose representations. In contrast to prior work, DooD operates directly on images, requires no supervision or expensive test-time inference, and performs unsupervised amortised inference with a symbolic stroke model that better enables both interpretability and generalization. We evaluate DooD on its ability to generalise across both data and tasks. We first perform zero-shot transfer from one dataset (e.g. MNIST) to another (e.g. Quickdraw), across five different datasets, and show that DooD clearly outperforms different baselines. An analysis of the learnt representations further highlights the benefits of adopting a symbolic stroke model. We then adopt a subset of the Omniglot challenge tasks, and evaluate its ability to generate new exemplars (both unconditionally and conditionally), and perform one-shot classification, showing that DooD matches the state of the art. Taken together, we demonstrate that DooD does indeed capture general-purpose representations across both data and task, and takes a further step towards building general and robust concept-learning systems.

* Submitted to NeurIPS 2022

Via

Access Paper or Ask Questions

Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

Jul 04, 2021

Tuan Anh Le, Katherine M. Collins, Luke Hewitt, Kevin Ellis, Siddharth N, Samuel J. Gershman, Joshua B. Tenenbaum

Figure 1 for Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

Figure 2 for Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

Figure 3 for Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

Figure 4 for Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

Abstract:Modeling complex phenomena typically involves the use of both discrete and continuous variables. Such a setting applies across a wide range of problems, from identifying trends in time-series data to performing effective compositional scene understanding in images. Here, we propose Hybrid Memoised Wake-Sleep (HMWS), an algorithm for effective inference in such hybrid discrete-continuous models. Prior approaches to learning suffer as they need to perform repeated expensive inner-loop discrete inference. We build on a recent approach, Memoised Wake-Sleep (MWS), which alleviates part of the problem by memoising discrete variables, and extend it to allow for a principled and effective way to handle continuous variables by learning a separate recognition model used for importance-sampling based approximate inference and marginalization. We evaluate HMWS in the GP-kernel learning and 3D scene understanding domains, and show that it outperforms current state-of-the-art inference methods.

Via

Access Paper or Ask Questions

Learning Evolved Combinatorial Symbols with a Neuro-symbolic Generative Model

Apr 16, 2021

Matthias Hofer, Tuan Anh Le, Roger Levy, Josh Tenenbaum

Figure 1 for Learning Evolved Combinatorial Symbols with a Neuro-symbolic Generative Model

Figure 2 for Learning Evolved Combinatorial Symbols with a Neuro-symbolic Generative Model

Figure 3 for Learning Evolved Combinatorial Symbols with a Neuro-symbolic Generative Model

Figure 4 for Learning Evolved Combinatorial Symbols with a Neuro-symbolic Generative Model

Abstract:Humans have the ability to rapidly understand rich combinatorial concepts from limited data. Here we investigate this ability in the context of auditory signals, which have been evolved in a cultural transmission experiment to study the emergence of combinatorial structure in language. We propose a neuro-symbolic generative model which combines the strengths of previous approaches to concept learning. Our model performs fast inference drawing on neural network methods, while still retaining the interpretability and generalization from limited data seen in structured generative approaches. This model outperforms a purely neural network-based approach on classification as evaluated against both ground truth and human experimental classification preferences, and produces superior reproductions of observed signals as well. Our results demonstrate the power of flexible combined neural-symbolic architectures for human-like generalization in raw perceptual domains and offers a step towards developing precise computational models of inductive biases in language evolution.

Via

Access Paper or Ask Questions

Learning to learn generative programs with Memoised Wake-Sleep

Jul 06, 2020

Luke B. Hewitt, Tuan Anh Le, Joshua B. Tenenbaum

Figure 1 for Learning to learn generative programs with Memoised Wake-Sleep

Figure 2 for Learning to learn generative programs with Memoised Wake-Sleep

Figure 3 for Learning to learn generative programs with Memoised Wake-Sleep

Figure 4 for Learning to learn generative programs with Memoised Wake-Sleep

Abstract:We study a class of neuro-symbolic generative models in which neural networks are used both for inference and as priors over symbolic, data-generating programs. As generative models, these programs capture compositional structures in a naturally explainable form. To tackle the challenge of performing program induction as an 'inner-loop' to learning, we propose the Memoised Wake-Sleep (MWS) algorithm, which extends Wake Sleep by explicitly storing and reusing the best programs discovered by the inference network throughout training. We use MWS to learn accurate, explainable models in three challenging domains: stroke-based character modelling, cellular automata, and few-shot learning in a novel dataset of real-world string concepts.

Via

Access Paper or Ask Questions

Semi-supervised Sequential Generative Models

Jun 30, 2020

Michael Teng, Tuan Anh Le, Adam Scibior, Frank Wood

Figure 1 for Semi-supervised Sequential Generative Models

Figure 2 for Semi-supervised Sequential Generative Models

Figure 3 for Semi-supervised Sequential Generative Models

Figure 4 for Semi-supervised Sequential Generative Models

Abstract:We introduce a novel objective for training deep generative time-series models with discrete latent variables for which supervision is only sparsely available. This instance of semi-supervised learning is challenging for existing methods, because the exponential number of possible discrete latent configurations results in high variance gradient estimators. We first overcome this problem by extending the standard semi-supervised generative modeling objective with reweighted wake-sleep. However, we find that this approach still suffers when the frequency of available labels varies between training sequences. Finally, we introduce a unified objective inspired by teacher-forcing and show that this approach is robust to variable length supervision. We call the resulting method caffeinated wake-sleep (CWS) to emphasize its additional dependence on real data. We demonstrate its effectiveness with experiments on MNIST, handwriting, and fruit fly trajectory data.

* Accepted to Uncertainty in Artificial Intelligence 2020

Via

Access Paper or Ask Questions

Amortized Population Gibbs Samplers with Neural Sufficient Statistics

Nov 04, 2019

Hao Wu, Heiko Zimmermann, Eli Sennesh, Tuan Anh Le, Jan-Willem van de Meent

Figure 1 for Amortized Population Gibbs Samplers with Neural Sufficient Statistics

Figure 2 for Amortized Population Gibbs Samplers with Neural Sufficient Statistics

Figure 3 for Amortized Population Gibbs Samplers with Neural Sufficient Statistics

Figure 4 for Amortized Population Gibbs Samplers with Neural Sufficient Statistics

Abstract:We develop amortized population Gibbs (APG) samplers, a new class of autoencoding variational methods for deep probabilistic models. APG samplers construct high-dimensional proposals by iterating over updates to lower-dimensional blocks of variables. Each conditional update is a neural proposal, which we train by minimizing the inclusive KL divergence relative to the conditional posterior. To appropriately account for the size of the input data, we develop a new parameterization in terms of neural sufficient statistics, resulting in quasi-conjugate variational approximations. Experiments demonstrate that learned proposals converge to the known analytical conditional posterior in conjugate models, and that APG samplers can learn inference networks for highly-structured deep generative models when the conditional posteriors are intractable. Here APG samplers offer a path toward scaling up stochastic variational methods to models in which standard autoencoding architectures fail to produce accurate samples.

Via

Access Paper or Ask Questions

The Thermodynamic Variational Objective

Jun 28, 2019

Vaden Masrani, Tuan Anh Le, Frank Wood

Figure 1 for The Thermodynamic Variational Objective

Figure 2 for The Thermodynamic Variational Objective

Figure 3 for The Thermodynamic Variational Objective

Figure 4 for The Thermodynamic Variational Objective

Abstract:We introduce the thermodynamic variational objective (TVO) for learning in both continuous and discrete deep generative models. The TVO arises from a key connection between variational inference and thermodynamic integration that results in a tighter lower bound to the log marginal likelihood than the standard variational evidence lower bound (ELBO), while remaining as broadly applicable. We provide a computationally efficient gradient estimator for the TVO that applies to continuous, discrete, and non-reparameterizable distributions and show that the objective functions used in variational inference, variational autoencoders, wake sleep, and inference compilation are all special cases of the TVO. We evaluate the TVO for learning of discrete and continuous variational auto encoders, and find it achieves state of the art for learning in discrete variable models, and outperform VAEs on continuous variable models without using the reparameterization trick.

Via

Access Paper or Ask Questions

Imitation Learning of Factored Multi-agent Reactive Models

Mar 12, 2019

Michael Teng, Tuan Anh Le, Adam Scibior, Frank Wood

Figure 1 for Imitation Learning of Factored Multi-agent Reactive Models

Figure 2 for Imitation Learning of Factored Multi-agent Reactive Models

Figure 3 for Imitation Learning of Factored Multi-agent Reactive Models

Figure 4 for Imitation Learning of Factored Multi-agent Reactive Models

Abstract:We apply recent advances in deep generative modeling to the task of imitation learning from biological agents. Specifically, we apply variations of the variational recurrent neural network model to a multi-agent setting where we learn policies of individual uncoordinated agents acting based on their perceptual inputs and their hidden belief state. We learn stochastic policies for these agents directly from observational data, without constructing a reward function. An inference network learned jointly with the policy allows for efficient inference over the agent's belief state given a sequence of its current perceptual inputs and the prior actions it performed, which lets us extrapolate observed sequences of behavior into the future while maintaining uncertainty estimates over future trajectories. We test our approach on a dataset of flies interacting in a 2D environment, where we demonstrate better predictive performance than existing approaches which learn deterministic policies with recurrent neural networks. We further show that the uncertainty estimates over future trajectories we obtain are well calibrated, which makes them useful for a variety of downstream processing tasks.

Via

Access Paper or Ask Questions

Tighter Variational Bounds are Not Necessarily Better

Jun 25, 2018

Tom Rainforth, Adam R. Kosiorek, Tuan Anh Le, Chris J. Maddison, Maximilian Igl, Frank Wood, Yee Whye Teh

Figure 1 for Tighter Variational Bounds are Not Necessarily Better

Figure 2 for Tighter Variational Bounds are Not Necessarily Better

Figure 3 for Tighter Variational Bounds are Not Necessarily Better

Figure 4 for Tighter Variational Bounds are Not Necessarily Better

Abstract:We provide theoretical and empirical evidence that using tighter evidence lower bounds (ELBOs) can be detrimental to the process of learning an inference network by reducing the signal-to-noise ratio of the gradient estimator. Our results call into question common implicit assumptions that tighter ELBOs are better variational objectives for simultaneous model learning and inference amortization schemes. Based on our insights, we introduce three new algorithms: the partially importance weighted auto-encoder (PIWAE), the multiply importance weighted auto-encoder (MIWAE), and the combination importance weighted auto-encoder (CIWAE), each of which includes the standard importance weighted auto-encoder (IWAE) as a special case. We show that each can deliver improvements over IWAE, even when performance is measured by the IWAE target itself. Furthermore, our results suggest that PIWAE may be able to deliver simultaneous improvements in the training of both the inference and generative networks.

* To appear at ICML 2018

Via

Access Paper or Ask Questions

Deep Variational Reinforcement Learning for POMDPs

Jun 06, 2018

Maximilian Igl, Luisa Zintgraf, Tuan Anh Le, Frank Wood, Shimon Whiteson

Figure 1 for Deep Variational Reinforcement Learning for POMDPs

Figure 2 for Deep Variational Reinforcement Learning for POMDPs

Figure 3 for Deep Variational Reinforcement Learning for POMDPs

Figure 4 for Deep Variational Reinforcement Learning for POMDPs

Abstract:Many real-world sequential decision making problems are partially observable by nature, and the environment model is typically unknown. Consequently, there is great need for reinforcement learning methods that can tackle such problems given only a stream of incomplete and noisy observations. In this paper, we propose deep variational reinforcement learning (DVRL), which introduces an inductive bias that allows an agent to learn a generative model of the environment and perform inference in that model to effectively aggregate the available information. We develop an n-step approximation to the evidence lower bound (ELBO), allowing the model to be trained jointly with the policy. This ensures that the latent state representation is suitable for the control task. In experiments on Mountain Hike and flickering Atari we show that our method outperforms previous approaches relying on recurrent neural networks to encode the past.

Via

Access Paper or Ask Questions