Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lars Kai Hansen

IROF: a low resource evaluation metric for explanation methods

Mar 09, 2020

Laura Rieger, Lars Kai Hansen

Figure 1 for IROF: a low resource evaluation metric for explanation methods

Figure 2 for IROF: a low resource evaluation metric for explanation methods

Figure 3 for IROF: a low resource evaluation metric for explanation methods

Figure 4 for IROF: a low resource evaluation metric for explanation methods

Abstract:The adoption of machine learning in health care hinges on the transparency of the used algorithms, necessitating the need for explanation methods. However, despite a growing literature on explaining neural networks, no consensus has been reached on how to evaluate those explanation methods. We propose IROF, a new approach to evaluating explanation methods that circumvents the need for manual evaluation. Compared to other recent work, our approach requires several orders of magnitude less computational resources and no human input, making it accessible to lower resource groups and robust to human bias.

Via

Access Paper or Ask Questions

Towards Federated Learning: Robustness Analytics to Data Heterogeneity

Feb 12, 2020

Jia Qian, Xenofon Fafoutis, Lars Kai Hansen

Figure 1 for Towards Federated Learning: Robustness Analytics to Data Heterogeneity

Figure 2 for Towards Federated Learning: Robustness Analytics to Data Heterogeneity

Figure 3 for Towards Federated Learning: Robustness Analytics to Data Heterogeneity

Figure 4 for Towards Federated Learning: Robustness Analytics to Data Heterogeneity

Abstract:Federated Learning allows remote centralized server training models without to access the data stored in distributed (edge) devices. Most work assume the data generated from edge devices is identically and independently sampled from a common population distribution. However, such ideal sampling may not be realistic in many contexts where edge devices correspond to units in variable context. Also, models based on intrinsic agency, such as active sampling schemes, may lead to highly biased sampling. So an imminent question is how robust Federated Learning is to biased sampling? In this work, we investigate two such scenarios. First, we study Federated Learning of a classifier from data with edge device class distribution heterogeneity. Second, we study Federated Learning of a classifier with active sampling at the edge. We present evidence in both scenarios, that federated learning is robust to data heterogeneity.

Via

Access Paper or Ask Questions

Active Learning Solution on Distributed Edge Computing

Jun 25, 2019

Jia Qian, Sayantan Sengupta, Lars Kai Hansen

Figure 1 for Active Learning Solution on Distributed Edge Computing

Figure 2 for Active Learning Solution on Distributed Edge Computing

Figure 3 for Active Learning Solution on Distributed Edge Computing

Figure 4 for Active Learning Solution on Distributed Edge Computing

Abstract:Industry 4.0 becomes possible through the convergence between Operational and Information Technologies. All the requirements to realize the convergence is integrated on the Fog Platform. Fog Platform is introduced between the cloud server and edge devices when the unprecedented generation of data causes the burden of the cloud server, leading the ineligible latency. In this new paradigm, we divide the computation tasks and push it down to edge devices. Furthermore, local computing (at edge side) may improve privacy and trust. To address these problems, we present a new method, in which we decompose the data aggregation and processing, by dividing them between edge devices and fog nodes intelligently. We apply active learning on edge devices; and federated learning on the fog node which significantly reduces the data samples to train the model as well as the communication cost. To show the effectiveness of the proposed method, we implemented and evaluated its performance for an image classification task. In addition, we consider two settings: massively distributed and non-massively distributed and offer the corresponding solutions.

Via

Access Paper or Ask Questions

Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size!

May 02, 2019

Niels Bruun Ipsen, Lars Kai Hansen

Figure 1 for Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size!

Figure 2 for Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size!

Figure 3 for Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size!

Figure 4 for Phase transition in PCA with missing data: Reduced signal-to-noise ratio, not sample size!

Abstract:How does missing data affect our ability to learn signal structures? It has been shown that learning signal structure in terms of principal components is dependent on the ratio of sample size and dimensionality and that a critical number of observations is needed before learning starts (Biehl and Mietzner, 1993). Here we generalize this analysis to include missing data. Probabilistic principal component analysis is regularly used for estimating signal structures in datasets with missing data. Our analytic result suggests that the effect of missing data is to effectively reduce signal-to-noise ratio rather than - as generally believed - to reduce sample size. The theory predicts a phase transition in the learning curves and this is indeed found both in simulation data and in real datasets.

* Accepted to ICML 2019. This version is the submitted paper

Via

Access Paper or Ask Questions

Aggregating explainability methods for neural networks stabilizes explanations

Mar 01, 2019

Laura Rieger, Lars Kai Hansen

Figure 1 for Aggregating explainability methods for neural networks stabilizes explanations

Figure 2 for Aggregating explainability methods for neural networks stabilizes explanations

Figure 3 for Aggregating explainability methods for neural networks stabilizes explanations

Figure 4 for Aggregating explainability methods for neural networks stabilizes explanations

Abstract:Despite a growing literature on explaining neural networks, no consensus has been reached on how to explain a neural network decision or how to evaluate an explanation. In fact, most works rely on manually assessing the explanation to evaluate the quality of a method. This injects uncertainty in the explanation process along several dimensions: Which explanation method to apply? Who should we ask to evaluate it and which criteria should be used for the evaluation? Our contributions in this paper are twofold. First, we investigate schemes to combine explanation methods and reduce model uncertainty to obtain a single aggregated explanation. Our findings show that the aggregation is more robust, well-aligned with human explanations and can attribute relevance to a broader set of features (completeness). Second, we propose a novel way of evaluating explanation methods that circumvents the need for manual evaluation and is not reliant on the alignment of neural networks and humans decision processes.

Via

Access Paper or Ask Questions

Multi-View Bayesian Correlated Component Analysis

Feb 07, 2018

Simon Kamronn, Andreas Trier Poulsen, Lars Kai Hansen

Abstract:Correlated component analysis as proposed by Dmochowski et al. (2012) is a tool for investigating brain process similarity in the responses to multiple views of a given stimulus. Correlated components are identified under the assumption that the involved spatial networks are identical. Here we propose a hierarchical probabilistic model that can infer the level of universality in such multi-view data, from completely unrelated representations, corresponding to canonical correlation analysis, to identical representations as in correlated component analysis. This new model, which we denote Bayesian correlated component analysis, evaluates favourably against three relevant algorithms in simulated data. A well-established benchmark EEG dataset is used to further validate the new model and infer the variability of spatial representations across multiple subjects.

* Neural Computation, 27, (10):220730, 2015

Via

Access Paper or Ask Questions

Latent Space Oddity: on the Curvature of Deep Generative Models

Jan 31, 2018

Georgios Arvanitidis, Lars Kai Hansen, Søren Hauberg

Figure 1 for Latent Space Oddity: on the Curvature of Deep Generative Models

Figure 2 for Latent Space Oddity: on the Curvature of Deep Generative Models

Figure 3 for Latent Space Oddity: on the Curvature of Deep Generative Models

Figure 4 for Latent Space Oddity: on the Curvature of Deep Generative Models

Abstract:Deep generative models provide a systematic way to learn nonlinear data distributions, through a set of latent variables and a nonlinear "generator" function that maps latent points into the input space. The nonlinearity of the generator imply that the latent space gives a distorted view of the input space. Under mild conditions, we show that this distortion can be characterized by a stochastic Riemannian metric, and demonstrate that distances and interpolants are significantly improved under this metric. This in turn improves probability distributions, sampling algorithms and clustering in the latent space. Our geometric analysis further reveals that current generators provide poor variance estimates and we propose a new generator architecture with vastly improved variance estimates. Results are demonstrated on convolutional and fully connected variational autoencoders, but the formalism easily generalize to other deep generative models.

* Accepted at International Conference on Learning Representations (ICLR) 2018

Via

Access Paper or Ask Questions

Bayesian inference for spatio-temporal spike-and-slab priors

Dec 01, 2017

Michael Riis Andersen, Aki Vehtari, Ole Winther, Lars Kai Hansen

Figure 1 for Bayesian inference for spatio-temporal spike-and-slab priors

Figure 2 for Bayesian inference for spatio-temporal spike-and-slab priors

Figure 3 for Bayesian inference for spatio-temporal spike-and-slab priors

Figure 4 for Bayesian inference for spatio-temporal spike-and-slab priors

Abstract:In this work, we address the problem of solving a series of underdetermined linear inverse problems subject to a sparsity constraint. We generalize the spike-and-slab prior distribution to encode a priori correlation of the support of the solution in both space and time by imposing a transformed Gaussian process on the spike-and-slab probabilities. An expectation propagation (EP) algorithm for posterior inference under the proposed model is derived. For large scale problems, the standard EP algorithm can be prohibitively slow. We therefore introduce three different approximation schemes to reduce the computational complexity. Finally, we demonstrate the proposed model using numerical experiments based on both synthetic and real data sets.

* Journal of Machine Learning Research, 18(139):1-58, 2017
* 58 pages, 17 figures

Via

Access Paper or Ask Questions

Adaptive Smoothing in fMRI Data Processing Neural Networks

Oct 02, 2017

Albert Vilamala, Kristoffer Hougaard Madsen, Lars Kai Hansen

Figure 1 for Adaptive Smoothing in fMRI Data Processing Neural Networks

Figure 2 for Adaptive Smoothing in fMRI Data Processing Neural Networks

Figure 3 for Adaptive Smoothing in fMRI Data Processing Neural Networks

Figure 4 for Adaptive Smoothing in fMRI Data Processing Neural Networks

Abstract:Functional Magnetic Resonance Imaging (fMRI) relies on multi-step data processing pipelines to accurately determine brain activity; among them, the crucial step of spatial smoothing. These pipelines are commonly suboptimal, given the local optimisation strategy they use, treating each step in isolation. With the advent of new tools for deep learning, recent work has proposed to turn these pipelines into end-to-end learning networks. This change of paradigm offers new avenues to improvement as it allows for a global optimisation. The current work aims at benefitting from this paradigm shift by defining a smoothing step as a layer in these networks able to adaptively modulate the degree of smoothing required by each brain volume to better accomplish a given data analysis task. The viability is evaluated on real fMRI data where subjects did alternate between left and right finger tapping tasks.

* 4 pages, 3 figures, 1 table, IEEE 2017 International Workshop on Pattern Recognition in Neuroimaging (PRNI)

Via

Access Paper or Ask Questions

Towards end-to-end optimisation of functional image analysis pipelines

Oct 13, 2016

Albert Vilamala, Kristoffer Hougaard Madsen, Lars Kai Hansen

Figure 1 for Towards end-to-end optimisation of functional image analysis pipelines

Figure 2 for Towards end-to-end optimisation of functional image analysis pipelines

Figure 3 for Towards end-to-end optimisation of functional image analysis pipelines

Abstract:The study of neurocognitive tasks requiring accurate localisation of activity often rely on functional Magnetic Resonance Imaging, a widely adopted technique that makes use of a pipeline of data processing modules, each involving a variety of parameters. These parameters are frequently set according to the local goal of each specific module, not accounting for the rest of the pipeline. Given recent success of neural network research in many different domains, we propose to convert the whole data pipeline into a deep neural network, where the parameters involved are jointly optimised by the network to best serve a common global goal. As a proof of concept, we develop a module able to adaptively apply the most suitable spatial smoothing to every brain volume for each specific neuroimaging task, and we validate its results in a standard brain decoding experiment.

* 7 pages, 2 figures

Via

Access Paper or Ask Questions