Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Ole Winther

Image Super-Resolution With Deep Variational Autoencoders

Mar 17, 2022

Darius Chira, Ilian Haralampiev, Ole Winther, Andrea Dittadi, Valentin Liévin

Figure 1 for Image Super-Resolution With Deep Variational Autoencoders

Figure 2 for Image Super-Resolution With Deep Variational Autoencoders

Figure 3 for Image Super-Resolution With Deep Variational Autoencoders

Figure 4 for Image Super-Resolution With Deep Variational Autoencoders

Abstract:Image super-resolution (SR) techniques are used to generate a high-resolution image from a low-resolution image. Until now, deep generative models such as autoregressive models and Generative Adversarial Networks (GANs) have proven to be effective at modelling high-resolution images. Models based on Variational Autoencoders (VAEs) have often been criticized for their feeble generative performance, but with new advancements such as VDVAE (very deep VAE), there is now strong evidence that deep VAEs have the potential to outperform current state-of-the-art models for high-resolution image generation. In this paper, we introduce VDVAE-SR, a new model that aims to exploit the most recent deep VAE methodologies to improve upon image super-resolution using transfer learning on pretrained VDVAEs. Through qualitative and quantitative evaluations, we show that the proposed model is competitive with other state-of-the-art methods.

Via

Access Paper or Ask Questions

DermX: an end-to-end framework for explainable automated dermatological diagnosis

Feb 14, 2022

Raluca Jalaboi, Frederik Faye, Mauricio Orbes-Arteaga, Dan Jørgensen, Ole Winther, Alfiia Galimzianova

Figure 1 for DermX: an end-to-end framework for explainable automated dermatological diagnosis

Figure 2 for DermX: an end-to-end framework for explainable automated dermatological diagnosis

Figure 3 for DermX: an end-to-end framework for explainable automated dermatological diagnosis

Figure 4 for DermX: an end-to-end framework for explainable automated dermatological diagnosis

Abstract:Dermatological diagnosis automation is essential in addressing the high prevalence of skin diseases and critical shortage of dermatologists. Despite approaching expert-level diagnosis performance, convolutional neural network (ConvNet) adoption in clinical practice is impeded by their limited explainability, and by subjective, expensive explainability validations. We introduce DermX and DermX+, an end-to-end framework for explainable automated dermatological diagnosis. DermX is a clinically-inspired explainable dermatological diagnosis ConvNet, trained using DermXDB, a 554 images dataset annotated by eight dermatologists with diagnoses and supporting explanations. DermX+ extends DermX with guided attention training for explanation attention maps. Both methods achieve near-expert diagnosis performance, with DermX, DermX+, and dermatologist F1 scores of 0.79, 0.79, and 0.87, respectively. We assess the explanation plausibility in terms of identification and localization, by comparing model-selected with dermatologist-selected explanations, and gradient-weighted class-activation maps with dermatologist explanation maps. Both DermX and DermX+ obtain an identification F1 score of 0.78. The localization F1 score is 0.39 for DermX and 0.35 for DermX+. Explanation faithfulness is assessed through contrasting samples, DermX obtaining 0.53 faithfulness and DermX+ 0.25. These results show that explainability does not necessarily come at the expense of predictive power, as our high-performance models provide both plausible and faithful explanations for their diagnoses.

Via

Access Paper or Ask Questions

Hierarchical Few-Shot Generative Models

Oct 23, 2021

Giorgio Giannone, Ole Winther

Figure 1 for Hierarchical Few-Shot Generative Models

Figure 2 for Hierarchical Few-Shot Generative Models

Figure 3 for Hierarchical Few-Shot Generative Models

Figure 4 for Hierarchical Few-Shot Generative Models

Abstract:A few-shot generative model should be able to generate data from a distribution by only observing a limited set of examples. In few-shot learning the model is trained on data from many sets from different distributions sharing some underlying properties such as sets of characters from different alphabets or sets of images of different type objects. We study a latent variables approach that extends the Neural Statistician to a fully hierarchical approach with an attention-based point to set-level aggregation. We extend the previous work to iterative data sampling, likelihood-based model comparison, and adaptation-free out of distribution generalization. Our results show that the hierarchical formulation better captures the intrinsic variability within the sets in the small data regime. With this work we generalize deep latent variable approaches to few-shot learning, taking a step towards large-scale few-shot generation with a formulation that readily can work with current state-of-the-art deep generative models.

* 5th Workshop on Meta-Learning at NeurIPS 2021

Via

Access Paper or Ask Questions

Calibrated Uncertainty for Molecular Property Prediction using Ensembles of Message Passing Neural Networks

Jul 13, 2021

Jonas Busk, Peter Bjørn Jørgensen, Arghya Bhowmik, Mikkel N. Schmidt, Ole Winther, Tejs Vegge

Figure 1 for Calibrated Uncertainty for Molecular Property Prediction using Ensembles of Message Passing Neural Networks

Figure 2 for Calibrated Uncertainty for Molecular Property Prediction using Ensembles of Message Passing Neural Networks

Figure 3 for Calibrated Uncertainty for Molecular Property Prediction using Ensembles of Message Passing Neural Networks

Figure 4 for Calibrated Uncertainty for Molecular Property Prediction using Ensembles of Message Passing Neural Networks

Abstract:Data-driven methods based on machine learning have the potential to accelerate analysis of atomic structures. However, machine learning models can produce overconfident predictions and it is therefore crucial to detect and handle uncertainty carefully. Here, we extend a message passing neural network designed specifically for predicting properties of molecules and materials with a calibrated probabilistic predictive distribution. The method presented in this paper differs from the previous work by considering both aleatoric and epistemic uncertainty in a unified framework, and by re-calibrating the predictive distribution on unseen data. Through computer experiments, we show that our approach results in accurate models for predicting molecular formation energies with calibrated uncertainty in and out of the training data distribution on two public molecular benchmark datasets, QM9 and PC9. The proposed method provides a general framework for training and evaluating neural network ensemble models that are able to produce accurate predictions of properties of molecules with calibrated uncertainty.

Via

Access Paper or Ask Questions

Representation Learning for Out-Of-Distribution Generalization in Reinforcement Learning

Jul 12, 2021

Andrea Dittadi, Frederik Träuble, Manuel Wüthrich, Felix Widmaier, Peter Gehler, Ole Winther, Francesco Locatello, Olivier Bachem, Bernhard Schölkopf, Stefan Bauer

Figure 1 for Representation Learning for Out-Of-Distribution Generalization in Reinforcement Learning

Figure 2 for Representation Learning for Out-Of-Distribution Generalization in Reinforcement Learning

Figure 3 for Representation Learning for Out-Of-Distribution Generalization in Reinforcement Learning

Figure 4 for Representation Learning for Out-Of-Distribution Generalization in Reinforcement Learning

Abstract:Learning data representations that are useful for various downstream tasks is a cornerstone of artificial intelligence. While existing methods are typically evaluated on downstream tasks such as classification or generative image quality, we propose to assess representations through their usefulness in downstream control tasks, such as reaching or pushing objects. By training over 10,000 reinforcement learning policies, we extensively evaluate to what extent different representation properties affect out-of-distribution (OOD) generalization. Finally, we demonstrate zero-shot transfer of these policies from simulation to the real world, without any domain randomization or fine-tuning. This paper aims to establish the first systematic characterization of the usefulness of learned representations for real-world OOD downstream tasks.

Via

Access Paper or Ask Questions

Generalization and Robustness Implications in Object-Centric Learning

Jul 01, 2021

Andrea Dittadi, Samuele Papa, Michele De Vita, Bernhard Schölkopf, Ole Winther, Francesco Locatello

Figure 1 for Generalization and Robustness Implications in Object-Centric Learning

Figure 2 for Generalization and Robustness Implications in Object-Centric Learning

Figure 3 for Generalization and Robustness Implications in Object-Centric Learning

Figure 4 for Generalization and Robustness Implications in Object-Centric Learning

Abstract:The idea behind object-centric representation learning is that natural scenes can better be modeled as compositions of objects and their relations as opposed to distributed representations. This inductive bias can be injected into neural networks to potentially improve systematic generalization and learning efficiency of downstream tasks in scenes with multiple objects. In this paper, we train state-of-the-art unsupervised models on five common multi-object datasets and evaluate segmentation accuracy and downstream object property prediction. In addition, we study systematic generalization and robustness by investigating the settings where either single objects are out-of-distribution -- e.g., having unseen colors, textures, and shapes -- or global properties of the scene are altered -- e.g., by occlusions, cropping, or increasing the number of objects. From our experimental study, we find object-centric representations to be generally useful for downstream tasks and robust to shifts in the data distribution, especially if shifts affect single objects.

Via

Access Paper or Ask Questions

On the Transfer of Disentangled Representations in Realistic Settings

Oct 27, 2020

Andrea Dittadi, Frederik Träuble, Francesco Locatello, Manuel Wüthrich, Vaibhav Agrawal, Ole Winther, Stefan Bauer, Bernhard Schölkopf

Figure 1 for On the Transfer of Disentangled Representations in Realistic Settings

Figure 2 for On the Transfer of Disentangled Representations in Realistic Settings

Figure 3 for On the Transfer of Disentangled Representations in Realistic Settings

Figure 4 for On the Transfer of Disentangled Representations in Realistic Settings

Abstract:Learning meaningful representations that disentangle the underlying structure of the data generating process is considered to be of key importance in machine learning. While disentangled representations were found to be useful for diverse tasks such as abstract reasoning and fair classification, their scalability and real-world impact remain questionable. We introduce a new high-resolution dataset with 1M simulated images and over 1,800 annotated real-world images of the same robotic setup. In contrast to previous work, this new dataset exhibits correlations, a complex underlying structure, and allows to evaluate transfer to unseen simulated and real-world settings where the encoder i) remains in distribution or ii) is out of distribution. We propose new architectures in order to scale disentangled representation learning to realistic high-resolution settings and conduct a large-scale empirical study of disentangled representations on this dataset. We observe that disentanglement is a good predictor for out-of-distribution (OOD) task performance.

Via

Access Paper or Ask Questions

Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds

Aug 05, 2020

Valentin Liévin, Andrea Dittadi, Anders Christensen, Ole Winther

Figure 1 for Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds

Figure 2 for Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds

Figure 3 for Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds

Figure 4 for Optimal Variance Control of the Score Function Gradient Estimator for Importance Weighted Bounds

Abstract:This paper introduces novel results for the score function gradient estimator of the importance weighted variational bound (IWAE). We prove that in the limit of large $K$ (number of importance samples) one can choose the control variate such that the Signal-to-Noise ratio (SNR) of the estimator grows as $\sqrt{K}$. This is in contrast to the standard pathwise gradient estimator where the SNR decreases as $1/\sqrt{K}$. Based on our theoretical findings we develop a novel control variate that extends on VIMCO. Empirically, for the training of both continuous and discrete generative models, the proposed method yields superior variance reduction, resulting in an SNR for IWAE that increases with $K$ without relying on the reparameterization trick. The novel estimator is competitive with state-of-the-art reparameterization-free gradient estimators such as Reweighted Wake-Sleep (RWS) and the thermodynamic variational objective (TVO) when training generative models.

Via

Access Paper or Ask Questions

SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows

Jul 06, 2020

Didrik Nielsen, Priyank Jaini, Emiel Hoogeboom, Ole Winther, Max Welling

Figure 1 for SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows

Figure 2 for SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows

Figure 3 for SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows

Figure 4 for SurVAE Flows: Surjections to Bridge the Gap between VAEs and Flows

Abstract:Normalizing flows and variational autoencoders are powerful generative models that can represent complicated density functions. However, they both impose constraints on the models: Normalizing flows use bijective transformations to model densities whereas VAEs learn stochastic transformations that are non-invertible and thus typically do not provide tractable estimates of the marginal likelihood. In this paper, we introduce SurVAE Flows: A modular framework of composable transformations that encompasses VAEs and normalizing flows. SurVAE Flows bridge the gap between normalizing flows and VAEs with surjective transformations, wherein the transformations are deterministic in one direction -- thereby allowing exact likelihood computation, and stochastic in the reverse direction -- hence providing a lower bound on the corresponding likelihood. We show that several recently proposed methods, including dequantization and augmented normalizing flows, can be expressed as SurVAE Flows. Finally, we introduce common operations such as the max value, the absolute value, sorting and stochastic permutation as composable layers in SurVAE Flows.

Via

Access Paper or Ask Questions

Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow

Feb 06, 2020

Didrik Nielsen, Ole Winther

Figure 1 for Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow

Figure 2 for Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow

Figure 3 for Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow

Figure 4 for Closing the Dequantization Gap: PixelCNN as a Single-Layer Flow

Abstract:Flow models have recently made great progress at modeling quantized sensor data such as images and audio. Due to the continuous nature of flow models, dequantization is typically applied when using them for such quantized data. In this paper, we propose subset flows, a class of flows which can tractably transform subsets of the input space in one pass. As a result, they can be applied directly to quantized data without the need for dequantization. Based on this class of flows, we present a novel interpretation of several existing autoregressive models, including WaveNet and PixelCNN, as single-layer flow models defined through an invertible transformation between uniform noise and data samples. This interpretation suggests that these existing models, 1) admit a latent representation of data and 2) can be stacked in multiple flow layers. We demonstrate this by exploring the latent space of a PixelCNN and by stacking PixelCNNs in multiple flow layers.

Via

Access Paper or Ask Questions