Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Max Welling

UC Irvine

Defending Variational Autoencoders from Adversarial Attacks with MCMC

Mar 18, 2022

Anna Kuzina, Max Welling, Jakub M. Tomczak

Figure 1 for Defending Variational Autoencoders from Adversarial Attacks with MCMC

Figure 2 for Defending Variational Autoencoders from Adversarial Attacks with MCMC

Figure 3 for Defending Variational Autoencoders from Adversarial Attacks with MCMC

Figure 4 for Defending Variational Autoencoders from Adversarial Attacks with MCMC

Abstract:Variational autoencoders (VAEs) are deep generative models used in various domains. VAEs can generate complex objects and provide meaningful latent representations, which can be further used in downstream tasks such as classification. As previous work has shown, one can easily fool VAEs to produce unexpected latent representations and reconstructions for a visually slightly modified input. Here, we examine several objective functions for adversarial attacks construction, suggest metrics assess the model robustness, and propose a solution to alleviate the effect of an attack. Our method utilizes the Markov Chain Monte Carlo (MCMC) technique in the inference step and is motivated by our theoretical analysis. Thus, we do not incorporate any additional costs during training or we do not decrease the performance on non-attacked inputs. We validate our approach on a variety of datasets (MNIST, Fashion MNIST, Color MNIST, CelebA) and VAE configurations ($\beta$-VAE, NVAE, TC-VAE) and show that it consistently improves the model robustness to adversarial attacks.

Via

Access Paper or Ask Questions

Neural RF SLAM for unsupervised positioning and mapping with channel state information

Mar 15, 2022

Shreya Kadambi, Arash Behboodi, Joseph B. Soriaga, Max Welling, Roohollah Amiri, Srinivas Yerramalli, Taesang Yoo

Figure 1 for Neural RF SLAM for unsupervised positioning and mapping with channel state information

Figure 2 for Neural RF SLAM for unsupervised positioning and mapping with channel state information

Figure 3 for Neural RF SLAM for unsupervised positioning and mapping with channel state information

Figure 4 for Neural RF SLAM for unsupervised positioning and mapping with channel state information

Abstract:We present a neural network architecture for jointly learning user locations and environment mapping up to isometry, in an unsupervised way, from channel state information (CSI) values with no location information. The model is based on an encoder-decoder architecture. The encoder network maps CSI values to the user location. The decoder network models the physics of propagation by parametrizing the environment using virtual anchors. It aims at reconstructing, from the encoder output and virtual anchor location, the set of time of flights (ToFs) that are extracted from CSI using super-resolution methods. The neural network task is set prediction and is accordingly trained end-to-end. The proposed model learns an interpretable latent, i.e., user location, by just enforcing a physics-based decoder. It is shown that the proposed model achieves sub-meter accuracy on synthetic ray tracing based datasets with single anchor SISO setup while recovering the environment map up to 4cm median error in a 2D environment and 15cm in a 3D environment

* Accepted at IEEE International Conference on Communications 2022. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other work

Via

Access Paper or Ask Questions

Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Feb 15, 2022

Johannes Brandstetter, Max Welling, Daniel E. Worrall

Figure 1 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Figure 2 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Figure 3 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Figure 4 for Lie Point Symmetry Data Augmentation for Neural PDE Solvers

Abstract:Neural networks are increasingly being used to solve partial differential equations (PDEs), replacing slower numerical solvers. However, a critical issue is that neural PDE solvers require high-quality ground truth data, which usually must come from the very solvers they are designed to replace. Thus, we are presented with a proverbial chicken-and-egg problem. In this paper, we present a method, which can partially alleviate this problem, by improving neural PDE solver sample complexity -- Lie point symmetry data augmentation (LPSDA). In the context of PDEs, it turns out that we are able to quantitatively derive an exhaustive list of data transformations, based on the Lie point symmetry group of the PDEs in question, something not possible in other application areas. We present this framework and demonstrate how it can easily be deployed to improve neural PDE solver sample complexity by an order of magnitude.

Via

Access Paper or Ask Questions

Message Passing Neural PDE Solvers

Feb 07, 2022

Johannes Brandstetter, Daniel Worrall, Max Welling

Figure 1 for Message Passing Neural PDE Solvers

Figure 2 for Message Passing Neural PDE Solvers

Figure 3 for Message Passing Neural PDE Solvers

Figure 4 for Message Passing Neural PDE Solvers

Abstract:The numerical solution of partial differential equations (PDEs) is difficult, having led to a century of research so far. Recently, there have been pushes to build neural--numerical hybrid solvers, which piggy-backs the modern trend towards fully end-to-end learned systems. Most works so far can only generalize over a subset of properties to which a generic solver would be faced, including: resolution, topology, geometry, boundary conditions, domain discretization regularity, dimensionality, etc. In this work, we build a solver, satisfying these properties, where all the components are based on neural message passing, replacing all heuristically designed components in the computation graph with backprop-optimized neural function approximators. We show that neural message passing solvers representationally contain some classical methods, such as finite differences, finite volumes, and WENO schemes. In order to encourage stability in training autoregressive models, we put forward a method that is based on the principle of zero-stability, posing stability as a domain adaptation problem. We validate our method on various fluid-like flow problems, demonstrating fast, stable, and accurate performance across different domain topologies, discretization, etc. in 1D and 2D. Our model outperforms state-of-the-art numerical solvers in the low resolution regime in terms of speed and accuracy.

* Published at ICLR 2022

Via

Access Paper or Ask Questions

Particle Dynamics for Learning EBMs

Nov 26, 2021

Kirill Neklyudov, Priyank Jaini, Max Welling

Figure 1 for Particle Dynamics for Learning EBMs

Figure 2 for Particle Dynamics for Learning EBMs

Abstract:Energy-based modeling is a promising approach to unsupervised learning, which yields many downstream applications from a single model. The main difficulty in learning energy-based models with the "contrastive approaches" is the generation of samples from the current energy function at each iteration. Many advances have been made to accomplish this subroutine cheaply. Nevertheless, all such sampling paradigms run MCMC targeting the current model, which requires infinitely long chains to generate samples from the true energy distribution and is problematic in practice. This paper proposes an alternative approach to getting these samples and avoiding crude MCMC sampling from the current model. We accomplish this by viewing the evolution of the modeling distribution as (i) the evolution of the energy function, and (ii) the evolution of the samples from this distribution along some vector field. We subsequently derive this time-dependent vector field such that the particles following this field are approximately distributed as the current density model. Thereby we match the evolution of the particles with the evolution of the energy function prescribed by the learning procedure. Importantly, unlike Monte Carlo sampling, our method targets to match the current distribution in a finite time. Finally, we demonstrate its effectiveness empirically compared to MCMC-based learning methods.

Via

Access Paper or Ask Questions

An Expectation-Maximization Perspective on Federated Learning

Nov 19, 2021

Christos Louizos, Matthias Reisser, Joseph Soriaga, Max Welling

Figure 1 for An Expectation-Maximization Perspective on Federated Learning

Figure 2 for An Expectation-Maximization Perspective on Federated Learning

Figure 3 for An Expectation-Maximization Perspective on Federated Learning

Figure 4 for An Expectation-Maximization Perspective on Federated Learning

Abstract:Federated learning describes the distributed training of models across multiple clients while keeping the data private on-device. In this work, we view the server-orchestrated federated learning process as a hierarchical latent variable model where the server provides the parameters of a prior distribution over the client-specific model parameters. We show that with simple Gaussian priors and a hard version of the well known Expectation-Maximization (EM) algorithm, learning in such a model corresponds to FedAvg, the most popular algorithm for the federated learning setting. This perspective on FedAvg unifies several recent works in the field and opens up the possibility for extensions through different choices for the hierarchical model. Based on this view, we further propose a variant of the hierarchical model that employs prior distributions to promote sparsity. By similarly using the hard-EM algorithm for learning, we obtain FedSparse, a procedure that can learn sparse neural networks in the federated learning setting. FedSparse reduces communication costs from client to server and vice-versa, as well as the computational costs for inference with the sparsified network - both of which are of great practical importance in federated learning.

Via

Access Paper or Ask Questions

Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Oct 25, 2021

T. Anderson Keller, Qinghe Gao, Max Welling

Figure 1 for Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Figure 2 for Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Figure 3 for Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Figure 4 for Modeling Category-Selective Cortical Regions with Topographic Variational Autoencoders

Abstract:Category-selectivity in the brain describes the observation that certain spatially localized areas of the cerebral cortex tend to respond robustly and selectively to stimuli from specific limited categories. One of the most well known examples of category-selectivity is the Fusiform Face Area (FFA), an area of the inferior temporal cortex in primates which responds preferentially to images of faces when compared with objects or other generic stimuli. In this work, we leverage the newly introduced Topographic Variational Autoencoder to model of the emergence of such localized category-selectivity in an unsupervised manner. Experimentally, we demonstrate our model yields spatially dense neural clusters selective to faces, bodies, and places through visualized maps of Cohen's d metric. We compare our model with related supervised approaches, namely the TDANN, and discuss both theoretical and empirical similarities. Finally, we show preliminary results suggesting that our model yields a nested spatial hierarchy of increasingly abstract categories, analogous to observations from the human ventral temporal cortex.

Via

Access Paper or Ask Questions

Multi-Agent MDP Homomorphic Networks

Oct 09, 2021

Elise van der Pol, Herke van Hoof, Frans A. Oliehoek, Max Welling

Figure 1 for Multi-Agent MDP Homomorphic Networks

Figure 2 for Multi-Agent MDP Homomorphic Networks

Figure 3 for Multi-Agent MDP Homomorphic Networks

Figure 4 for Multi-Agent MDP Homomorphic Networks

Abstract:This paper introduces Multi-Agent MDP Homomorphic Networks, a class of networks that allows distributed execution using only local information, yet is able to share experience between global symmetries in the joint state-action space of cooperative multi-agent systems. In cooperative multi-agent systems, complex symmetries arise between different configurations of the agents and their local observations. For example, consider a group of agents navigating: rotating the state globally results in a permutation of the optimal joint policy. Existing work on symmetries in single agent reinforcement learning can only be generalized to the fully centralized setting, because such approaches rely on the global symmetry in the full state-action spaces, and these can result in correspondences across agents. To encode such symmetries while still allowing distributed execution we propose a factorization that decomposes global symmetries into local transformations. Our proposed factorization allows for distributing the computation that enforces global symmetries over local agents and local interactions. We introduce a multi-agent equivariant policy network based on this factorization. We show empirically on symmetric multi-agent problems that distributed execution of globally symmetric policies improves data efficiency compared to non-equivariant baselines.

Via

Access Paper or Ask Questions

Geometric and Physical Quantities improve E(3) Equivariant Message Passing

Oct 06, 2021

Johannes Brandstetter, Rob Hesselink, Elise van der Pol, Erik Bekkers, Max Welling

Figure 1 for Geometric and Physical Quantities improve E(3) Equivariant Message Passing

Figure 2 for Geometric and Physical Quantities improve E(3) Equivariant Message Passing

Figure 3 for Geometric and Physical Quantities improve E(3) Equivariant Message Passing

Figure 4 for Geometric and Physical Quantities improve E(3) Equivariant Message Passing

Abstract:Including covariant information, such as position, force, velocity or spin is important in many tasks in computational physics and chemistry. We introduce Steerable E(3) Equivariant Graph Neural Networks (SEGNNs) that generalise equivariant graph networks, such that node and edge attributes are not restricted to invariant scalars, but can contain covariant information, such as vectors or tensors. This model, composed of steerable MLPs, is able to incorporate geometric and physical information in both the message and update functions. Through the definition of steerable node attributes, the MLPs provide a new class of activation functions for general use with steerable feature fields. We discuss ours and related work through the lens of equivariant non-linear convolutions, which further allows us to pin-point the successful components of SEGNNs: non-linear message aggregation improves upon classic linear (steerable) point convolutions; steerable messages improve upon recent equivariant graph networks that send invariant messages. We demonstrate the effectiveness of our method on several tasks in computational physics and chemistry and provide extensive ablation studies.

Via

Access Paper or Ask Questions

Neural Augmentation of Kalman Filter with Hypernetwork for Channel Tracking

Sep 26, 2021

Kumar Pratik, Rana Ali Amjad, Arash Behboodi, Joseph B. Soriaga, Max Welling

Figure 1 for Neural Augmentation of Kalman Filter with Hypernetwork for Channel Tracking

Figure 2 for Neural Augmentation of Kalman Filter with Hypernetwork for Channel Tracking

Figure 3 for Neural Augmentation of Kalman Filter with Hypernetwork for Channel Tracking

Figure 4 for Neural Augmentation of Kalman Filter with Hypernetwork for Channel Tracking

Abstract:We propose Hypernetwork Kalman Filter (HKF) for tracking applications with multiple different dynamics. The HKF combines generalization power of Kalman filters with expressive power of neural networks. Instead of keeping a bank of Kalman filters and choosing one based on approximating the actual dynamics, HKF adapts itself to each dynamics based on the observed sequence. Through extensive experiments on CDL-B channel model, we show that the HKF can be used for tracking the channel over a wide range of Doppler values, matching Kalman filter performance with genie Doppler information. At high Doppler values, it achieves around 2dB gain over genie Kalman filter. The HKF generalizes well to unseen Doppler, SNR values and pilot patterns unlike LSTM, which suffers from severe performance degradation.

* Accepted at IEEE Globecom 2021. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

Via

Access Paper or Ask Questions