Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Arnaud Doucet

CMLA

Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling

Jun 16, 2021

Valentin De Bortoli, James Thornton, Jeremy Heng, Arnaud Doucet

Figure 1 for Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling

Figure 2 for Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling

Figure 3 for Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling

Figure 4 for Diffusion Schrödinger Bridge with Applications to Score-Based Generative Modeling

Abstract:Progressively applying Gaussian noise transforms complex data distributions to approximately Gaussian. Reversing this dynamic defines a generative model. When the forward noising process is given by a Stochastic Differential Equation (SDE), Song et al. (2021) demonstrate how the time inhomogeneous drift of the associated reverse-time SDE may be estimated using score-matching. A limitation of this approach is that the forward-time SDE must be run for a sufficiently long time for the final distribution to be approximately Gaussian. In contrast, solving the Schr\"odinger Bridge problem (SB), i.e. an entropy-regularized optimal transport problem on path spaces, yields diffusions which generate samples from the data distribution in finite time. We present Diffusion SB (DSB), an original approximation of the Iterative Proportional Fitting (IPF) procedure to solve the SB problem, and provide theoretical analysis along with generative modeling experiments. The first DSB iteration recovers the methodology proposed by Song et al. (2021), with the flexibility of using shorter time intervals, as subsequent DSB iterations reduce the discrepancy between the final-time marginal of the forward (resp. backward) SDE with respect to the prior (resp. data) distribution. Beyond generative modeling, DSB offers a widely applicable computational optimal transport tool as the continuous state-space analogue of the popular Sinkhorn algorithm (Cuturi, 2013).

* 57 pages, 17 figures

Via

Access Paper or Ask Questions

On Instrumental Variable Regression for Deep Offline Policy Evaluation

May 21, 2021

Yutian Chen, Liyuan Xu, Caglar Gulcehre, Tom Le Paine, Arthur Gretton, Nando de Freitas, Arnaud Doucet

Figure 1 for On Instrumental Variable Regression for Deep Offline Policy Evaluation

Figure 2 for On Instrumental Variable Regression for Deep Offline Policy Evaluation

Figure 3 for On Instrumental Variable Regression for Deep Offline Policy Evaluation

Figure 4 for On Instrumental Variable Regression for Deep Offline Policy Evaluation

Abstract:We show that the popular reinforcement learning (RL) strategy of estimating the state-action value (Q-function) by minimizing the mean squared Bellman error leads to a regression problem with confounding, the inputs and output noise being correlated. Hence, direct minimization of the Bellman error can result in significantly biased Q-function estimates. We explain why fixing the target Q-network in Deep Q-Networks and Fitted Q Evaluation provides a way of overcoming this confounding, thus shedding new light on this popular but not well understood trick in the deep RL literature. An alternative approach to address confounding is to leverage techniques developed in the causality literature, notably instrumental variables (IV). We bring together here the literature on IV and RL by investigating whether IV approaches can lead to improved Q-function estimates. This paper analyzes and compares a wide range of recent IV methods in the context of offline policy evaluation (OPE), where the goal is to estimate the value of a policy using logged data only. By applying different IV techniques to OPE, we are not only able to recover previously proposed OPE methods such as model-based techniques but also to obtain competitive new techniques. We find empirically that state-of-the-art OPE methods are closely matched in performance by some IV methods such as AGMM, which were not developed for OPE. We open-source all our code and datasets at https://github.com/liyuan9988/IVOPEwithACME.

Via

Access Paper or Ask Questions

Invertible Flow Non Equilibrium sampling

Mar 17, 2021

Achille Thin, Yazid Janati, Sylvain Le Corff, Charles Ollion, Arnaud Doucet, Alain Durmus, Eric Moulines, Christian Robert

Figure 1 for Invertible Flow Non Equilibrium sampling

Figure 2 for Invertible Flow Non Equilibrium sampling

Figure 3 for Invertible Flow Non Equilibrium sampling

Figure 4 for Invertible Flow Non Equilibrium sampling

Abstract:Simultaneously sampling from a complex distribution with intractable normalizing constant and approximating expectations under this distribution is a notoriously challenging problem. We introduce a novel scheme, Invertible Flow Non Equilibrium Sampling (InFine), which departs from classical Sequential Monte Carlo (SMC) and Markov chain Monte Carlo (MCMC) approaches. InFine constructs unbiased estimators of expectations and in particular of normalizing constants by combining the orbits of a deterministic transform started from random initializations.When this transform is chosen as an appropriate integrator of a conformal Hamiltonian system, these orbits are optimization paths. InFine is also naturally suited to design new MCMC sampling schemes by selecting samples on the optimization paths.Additionally, InFine can be used to construct an Evidence Lower Bound (ELBO) leading to a new class of Variational AutoEncoders (VAE).

Via

Access Paper or Ask Questions

COIN: COmpression with Implicit Neural representations

Mar 03, 2021

Emilien Dupont, Adam Goliński, Milad Alizadeh, Yee Whye Teh, Arnaud Doucet

Figure 1 for COIN: COmpression with Implicit Neural representations

Figure 2 for COIN: COmpression with Implicit Neural representations

Figure 3 for COIN: COmpression with Implicit Neural representations

Figure 4 for COIN: COmpression with Implicit Neural representations

Abstract:We propose a new simple approach for image compression: instead of storing the RGB values for each pixel of an image, we store the weights of a neural network overfitted to the image. Specifically, to encode an image, we fit it with an MLP which maps pixel locations to RGB values. We then quantize and store the weights of this MLP as a code for the image. To decode the image, we simply evaluate the MLP at every pixel location. We found that this simple approach outperforms JPEG at low bit-rates, even without entropy coding or learning a distribution over weights. While our framework is not yet competitive with state of the art compression methods, we show that it has various attractive properties which could make it a viable alternative to other neural data compression approaches.

Via

Access Paper or Ask Questions

Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Feb 22, 2021

Yangjun Ruan, Karen Ullrich, Daniel Severo, James Townsend, Ashish Khisti, Arnaud Doucet, Alireza Makhzani, Chris J. Maddison

Figure 1 for Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Figure 2 for Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Figure 3 for Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Figure 4 for Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding

Abstract:Latent variable models have been successfully applied in lossless compression with the bits-back coding algorithm. However, bits-back suffers from an increase in the bitrate equal to the KL divergence between the approximate posterior and the true posterior. In this paper, we show how to remove this gap asymptotically by deriving bits-back coding algorithms from tighter variational bounds. The key idea is to exploit extended space representations of Monte Carlo estimators of the marginal likelihood. Naively applied, our schemes would require more initial bits than the standard bits-back coder, but we show how to drastically reduce this additional cost with couplings in the latent space. When parallel architectures can be exploited, our coders can achieve better rates than bits-back with little additional cost. We demonstrate improved lossless compression rates in a variety of settings, including entropy coding for lossy compression.

Via

Access Paper or Ask Questions

Differentiable Particle Filtering via Entropy-Regularized Optimal Transport

Feb 15, 2021

Adrien Corenflos, James Thornton, Arnaud Doucet, George Deligiannidis

Figure 1 for Differentiable Particle Filtering via Entropy-Regularized Optimal Transport

Figure 2 for Differentiable Particle Filtering via Entropy-Regularized Optimal Transport

Figure 3 for Differentiable Particle Filtering via Entropy-Regularized Optimal Transport

Figure 4 for Differentiable Particle Filtering via Entropy-Regularized Optimal Transport

Abstract:Particle Filtering (PF) methods are an established class of procedures for performing inference in non-linear state-space models. Resampling is a key ingredient of PF, necessary to obtain low variance likelihood and states estimates. However, traditional resampling methods result in PF-based loss functions being non-differentiable with respect to model and PF parameters. In a variational inference context, resampling also yields high variance gradient estimates of the PF-based evidence lower bound. By leveraging optimal transport ideas, we introduce a principled differentiable particle filter and provide convergence results. We demonstrate this novel method on a variety of applications.

Via

Access Paper or Ask Questions

Annealed Flow Transport Monte Carlo

Feb 15, 2021

Michael Arbel, Alexander G. D. G. Matthews, Arnaud Doucet

Figure 1 for Annealed Flow Transport Monte Carlo

Figure 2 for Annealed Flow Transport Monte Carlo

Abstract:Annealed Importance Sampling (AIS) and its Sequential Monte Carlo (SMC) extensions are state-of-the-art methods for estimating normalizing constants of probability distributions. We propose here a novel Monte Carlo algorithm, Annealed Flow Transport (AFT), that builds upon AIS and SMC and combines them with normalizing flows (NF) for improved performance. This method transports a set of particles using not only importance sampling (IS), Markov chain Monte Carlo (MCMC) and resampling steps - as in SMC, but also relies on NF which are learned sequentially to push particles towards the successive annealed targets. We provide limit theorems for the resulting Monte Carlo estimates of the normalizing constant and expectations with respect to the target distribution. Additionally, we show that a continuous-time scaling limit of the population version of AFT is given by a Feynman--Kac measure which simplifies to the law of a controlled diffusion for expressive NF. We demonstrate experimentally the benefits and limitations of our methodology on a variety of applications.

Via

Access Paper or Ask Questions

Generative Models as Distributions of Functions

Feb 09, 2021

Emilien Dupont, Yee Whye Teh, Arnaud Doucet

Figure 1 for Generative Models as Distributions of Functions

Figure 2 for Generative Models as Distributions of Functions

Figure 3 for Generative Models as Distributions of Functions

Figure 4 for Generative Models as Distributions of Functions

Abstract:Generative models are typically trained on grid-like data such as images. As a result, the size of these models usually scales directly with the underlying grid resolution. In this paper, we abandon discretized grids and instead parameterize individual data points by continuous functions. We then build generative models by learning distributions over such functions. By treating data points as functions, we can abstract away from the specific type of data we train on and construct models that scale independently of signal resolution and dimension. To train our model, we use an adversarial approach with a discriminator that acts directly on continuous signals. Through experiments on both images and 3D shapes, we demonstrate that our model can learn rich distributions of functions independently of data type and resolution.

Via

Access Paper or Ask Questions

Learning Deep Features in Instrumental Variable Regression

Nov 01, 2020

Liyuan Xu, Yutian Chen, Siddarth Srinivasan, Nando de Freitas, Arnaud Doucet, Arthur Gretton

Figure 1 for Learning Deep Features in Instrumental Variable Regression

Figure 2 for Learning Deep Features in Instrumental Variable Regression

Figure 3 for Learning Deep Features in Instrumental Variable Regression

Figure 4 for Learning Deep Features in Instrumental Variable Regression

Abstract:Instrumental variable (IV) regression is a standard strategy for learning causal relationships between confounded treatment and outcome variables from observational data by utilizing an instrumental variable, which affects the outcome only through the treatment. In classical IV regression, learning proceeds in two stages: stage 1 performs linear regression from the instrument to the treatment; and stage 2 performs linear regression from the treatment to the outcome, conditioned on the instrument. We propose a novel method, deep feature instrumental variable regression (DFIV), to address the case where relations between instruments, treatments, and outcomes may be nonlinear. In this case, deep neural nets are trained to define informative nonlinear features on the instruments and treatments. We propose an alternating training regime for these features to ensure good end-to-end performance when composing stages 1 and 2, thus obtaining highly flexible feature maps in a computationally efficient manner. DFIV outperforms recent state-of-the-art methods on challenging IV benchmarks, including settings involving high dimensional image data. DFIV also exhibits competitive performance in off-policy policy evaluation for reinforcement learning, which can be understood as an IV regression task.

Via

Access Paper or Ask Questions

Stable ResNet

Oct 24, 2020

Soufiane Hayou, Eugenio Clerico, Bobby He, George Deligiannidis, Arnaud Doucet, Judith Rousseau

Abstract:Deep ResNet architectures have achieved state of the art performance on many tasks. While they solve the problem of gradient vanishing, they might suffer from gradient exploding as the depth becomes large (Yang et al. 2017). Moreover, recent results have shown that ResNet might lose expressivity as the depth goes to infinity (Yang et al. 2017, Hayou et al. 2019). To resolve these issues, we introduce a new class of ResNet architectures, called Stable ResNet, that have the property of stabilizing the gradient while ensuring expressivity in the infinite depth limit.

* 42 pages, 3 figures

Via

Access Paper or Ask Questions