Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tom Rainforth

Deep Stochastic Processes via Functional Markov Transition Operators

May 24, 2023

Jin Xu, Emilien Dupont, Kaspar Märtens, Tom Rainforth, Yee Whye Teh

Abstract:We introduce Markov Neural Processes (MNPs), a new class of Stochastic Processes (SPs) which are constructed by stacking sequences of neural parameterised Markov transition operators in function space. We prove that these Markov transition operators can preserve the exchangeability and consistency of SPs. Therefore, the proposed iterative construction adds substantial flexibility and expressivity to the original framework of Neural Processes (NPs) without compromising consistency or adding restrictions. Our experiments demonstrate clear advantages of MNPs over baseline models on a variety of tasks.

* 18 pages, 5 figures

Via

Access Paper or Ask Questions

Prediction-Oriented Bayesian Active Learning

Apr 17, 2023

Freddie Bickford Smith, Andreas Kirsch, Sebastian Farquhar, Yarin Gal, Adam Foster, Tom Rainforth

Figure 1 for Prediction-Oriented Bayesian Active Learning

Figure 2 for Prediction-Oriented Bayesian Active Learning

Figure 3 for Prediction-Oriented Bayesian Active Learning

Figure 4 for Prediction-Oriented Bayesian Active Learning

Abstract:Information-theoretic approaches to active learning have traditionally focused on maximising the information gathered about the model parameters, most commonly by optimising the BALD score. We highlight that this can be suboptimal from the perspective of predictive performance. For example, BALD lacks a notion of an input distribution and so is prone to prioritise data of limited relevance. To address this we propose the expected predictive information gain (EPIG), an acquisition function that measures information gain in the space of predictions rather than parameters. We find that using EPIG leads to stronger predictive performance compared with BALD across a range of datasets and models, and thus provides an appealing drop-in replacement.

* Published at AISTATS 2023

Via

Access Paper or Ask Questions

Incorporating Unlabelled Data into Bayesian Neural Networks

Apr 04, 2023

Mrinank Sharma, Tom Rainforth, Yee Whye Teh, Vincent Fortuin

Abstract:We develop a contrastive framework for learning better prior distributions for Bayesian Neural Networks (BNNs) using unlabelled data. With this framework, we propose a practical BNN algorithm that offers the label-efficiency of self-supervised learning and the principled uncertainty estimates of Bayesian methods. Finally, we demonstrate the advantages of our approach for data-efficient learning in semi-supervised and low-budget active learning problems.

Via

Access Paper or Ask Questions

Modern Bayesian Experimental Design

Feb 28, 2023

Tom Rainforth, Adam Foster, Desi R Ivanova, Freddie Bickford Smith

Figure 1 for Modern Bayesian Experimental Design

Figure 2 for Modern Bayesian Experimental Design

Abstract:Bayesian experimental design (BED) provides a powerful and general framework for optimizing the design of experiments. However, its deployment often poses substantial computational challenges that can undermine its practical use. In this review, we outline how recent advances have transformed our ability to overcome these challenges and thus utilize BED effectively, before discussing some key areas for future development in the field.

Via

Access Paper or Ask Questions

CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

Feb 27, 2023

Desi R. Ivanova, Joel Jennings, Tom Rainforth, Cheng Zhang, Adam Foster

Figure 1 for CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

Figure 2 for CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

Figure 3 for CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

Figure 4 for CO-BED: Information-Theoretic Contextual Optimization via Bayesian Experimental Design

Abstract:We formalize the problem of contextual optimization through the lens of Bayesian experimental design and propose CO-BED -- a general, model-agnostic framework for designing contextual experiments using information-theoretic principles. After formulating a suitable information-based objective, we employ black-box variational methods to simultaneously estimate it and optimize the designs in a single stochastic gradient scheme. We further introduce a relaxation scheme to allow discrete actions to be accommodated. As a result, CO-BED provides a general and automated solution to a wide range of contextual optimization problems. We illustrate its effectiveness in a number of experiments, where CO-BED demonstrates competitive performance even when compared to bespoke, model-specific alternatives.

* 9 pages, 6 figures

Via

Access Paper or Ask Questions

Do Bayesian Neural Networks Need To Be Fully Stochastic?

Nov 11, 2022

Mrinank Sharma, Sebastian Farquhar, Eric Nalisnick, Tom Rainforth

Figure 1 for Do Bayesian Neural Networks Need To Be Fully Stochastic?

Figure 2 for Do Bayesian Neural Networks Need To Be Fully Stochastic?

Figure 3 for Do Bayesian Neural Networks Need To Be Fully Stochastic?

Figure 4 for Do Bayesian Neural Networks Need To Be Fully Stochastic?

Abstract:We investigate the efficacy of treating all the parameters in a Bayesian neural network stochastically and find compelling theoretical and empirical evidence that this standard construction may be unnecessary. To this end, we prove that expressive predictive distributions require only small amounts of stochasticity. In particular, partially stochastic networks with only $n$ stochastic biases are universal probabilistic predictors for $n$-dimensional predictive problems. In empirical investigations, we find no systematic benefit of full stochasticity across four different inference modalities and eight datasets; partially stochastic networks can match and sometimes even outperform fully stochastic networks, despite their reduced memory costs.

Via

Access Paper or Ask Questions

Learning Instance-Specific Data Augmentations

May 31, 2022

Ning Miao, Emile Mathieu, Yann Dubois, Tom Rainforth, Yee Whye Teh, Adam Foster, Hyunjik Kim

Figure 1 for Learning Instance-Specific Data Augmentations

Figure 2 for Learning Instance-Specific Data Augmentations

Figure 3 for Learning Instance-Specific Data Augmentations

Figure 4 for Learning Instance-Specific Data Augmentations

Abstract:Existing data augmentation methods typically assume independence between transformations and inputs: they use the same transformation distribution for all input instances. We explain why this can be problematic and propose InstaAug, a method for automatically learning input-specific augmentations from data. This is achieved by introducing an augmentation module that maps an input to a distribution over transformations. This is simultaneously trained alongside the base model in a fully end-to-end manner using only the training data. We empirically demonstrate that InstaAug learns meaningful augmentations for a wide range of transformation classes, which in turn provides better performance on supervised and self-supervised tasks compared with augmentations that assume input--transformation independence.

Via

Access Paper or Ask Questions

A Continuous Time Framework for Discrete Denoising Models

May 30, 2022

Andrew Campbell, Joe Benton, Valentin De Bortoli, Tom Rainforth, George Deligiannidis, Arnaud Doucet

Figure 1 for A Continuous Time Framework for Discrete Denoising Models

Figure 2 for A Continuous Time Framework for Discrete Denoising Models

Figure 3 for A Continuous Time Framework for Discrete Denoising Models

Figure 4 for A Continuous Time Framework for Discrete Denoising Models

Abstract:We provide the first complete continuous time framework for denoising diffusion models of discrete data. This is achieved by formulating the forward noising process and corresponding reverse time generative process as Continuous Time Markov Chains (CTMCs). The model can be efficiently trained using a continuous time version of the ELBO. We simulate the high dimensional CTMC using techniques developed in chemical physics and exploit our continuous time framework to derive high performance samplers that we show can outperform discrete time methods for discrete data. The continuous time treatment also enables us to derive a novel theoretical result bounding the error between the generated sample distribution and the true data distribution.

* 41 pages, 12 figures

Via

Access Paper or Ask Questions

Active Surrogate Estimators: An Active Learning Approach to Label-Efficient Model Evaluation

Feb 14, 2022

Jannik Kossen, Sebastian Farquhar, Yarin Gal, Tom Rainforth

Figure 1 for Active Surrogate Estimators: An Active Learning Approach to Label-Efficient Model Evaluation

Figure 2 for Active Surrogate Estimators: An Active Learning Approach to Label-Efficient Model Evaluation

Figure 3 for Active Surrogate Estimators: An Active Learning Approach to Label-Efficient Model Evaluation

Figure 4 for Active Surrogate Estimators: An Active Learning Approach to Label-Efficient Model Evaluation

Abstract:We propose Active Surrogate Estimators (ASEs), a new method for label-efficient model evaluation. Evaluating model performance is a challenging and important problem when labels are expensive. ASEs address this active testing problem using a surrogate-based estimation approach, whereas previous methods have focused on Monte Carlo estimates. ASEs actively learn the underlying surrogate, and we propose a novel acquisition strategy, XWING, that tailors this learning to the final estimation task. We find that ASEs offer greater label-efficiency than the current state-of-the-art when applied to challenging model evaluation problems for deep neural networks. We further theoretically analyze ASEs' errors.

Via

Access Paper or Ask Questions

Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

Nov 03, 2021

Desi R. Ivanova, Adam Foster, Steven Kleinegesse, Michael U. Gutmann, Tom Rainforth

Figure 1 for Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

Figure 2 for Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

Figure 3 for Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

Figure 4 for Implicit Deep Adaptive Design: Policy-Based Experimental Design without Likelihoods

Abstract:We introduce implicit Deep Adaptive Design (iDAD), a new method for performing adaptive experiments in real-time with implicit models. iDAD amortizes the cost of Bayesian optimal experimental design (BOED) by learning a design policy network upfront, which can then be deployed quickly at the time of the experiment. The iDAD network can be trained on any model which simulates differentiable samples, unlike previous design policy work that requires a closed form likelihood and conditionally independent experiments. At deployment, iDAD allows design decisions to be made in milliseconds, in contrast to traditional BOED approaches that require heavy computation during the experiment itself. We illustrate the applicability of iDAD on a number of experiments, and show that it provides a fast and effective mechanism for performing adaptive design with implicit models.

* 33 pages, 8 figures. Published as a conference paper at NeurIPS 2021

Via

Access Paper or Ask Questions