Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jan Hendrik Metzen

Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

Jun 28, 2021
Chaithanya Kumar Mummadi, Robin Hutmacher, Kilian Rambach, Evgeny Levinkov, Thomas Brox, Jan Hendrik Metzen

Figure 1 for Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

Figure 2 for Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

Figure 3 for Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

Figure 4 for Test-Time Adaptation to Distribution Shift by Confidence Maximization and Input Transformation

Deep neural networks often exhibit poor performance on data that is unlikely under the train-time data distribution, for instance data affected by corruptions. Previous works demonstrate that test-time adaptation to data shift, for instance using entropy minimization, effectively improves performance on such shifted distributions. This paper focuses on the fully test-time adaptation setting, where only unlabeled data from the target distribution is required. This allows adapting arbitrary pretrained networks. Specifically, we propose a novel loss that improves test-time adaptation by addressing both premature convergence and instability of entropy minimization. This is achieved by replacing the entropy by a non-saturating surrogate and adding a diversity regularizer based on batch-wise entropy maximization that prevents convergence to trivial collapsed solutions. Moreover, we propose to prepend an input transformation module to the network that can partially undo test-time distribution shifts. Surprisingly, this preprocessing can be learned solely using the fully test-time adaptation loss in an end-to-end fashion without any target domain labels or source domain data. We show that our approach outperforms previous work in improving the robustness of publicly available pretrained image classifiers to common corruptions on such challenging benchmarks as ImageNet-C.

* 16 pages, 5 figures, 7 tables

Via

Access Paper or Ask Questions

Does enhanced shape bias improve neural network robustness to common corruptions?

Apr 20, 2021
Chaithanya Kumar Mummadi, Ranjitha Subramaniam, Robin Hutmacher, Julien Vitay, Volker Fischer, Jan Hendrik Metzen

Figure 1 for Does enhanced shape bias improve neural network robustness to common corruptions?

Figure 2 for Does enhanced shape bias improve neural network robustness to common corruptions?

Figure 3 for Does enhanced shape bias improve neural network robustness to common corruptions?

Figure 4 for Does enhanced shape bias improve neural network robustness to common corruptions?

Convolutional neural networks (CNNs) learn to extract representations of complex features, such as object shapes and textures to solve image recognition tasks. Recent work indicates that CNNs trained on ImageNet are biased towards features that encode textures and that these alone are sufficient to generalize to unseen test data from the same distribution as the training data but often fail to generalize to out-of-distribution data. It has been shown that augmenting the training data with different image styles decreases this texture bias in favor of increased shape bias while at the same time improving robustness to common corruptions, such as noise and blur. Commonly, this is interpreted as shape bias increasing corruption robustness. However, this relationship is only hypothesized. We perform a systematic study of different ways of composing inputs based on natural images, explicit edge information, and stylization. While stylization is essential for achieving high corruption robustness, we do not find a clear correlation between shape bias and robustness. We conclude that the data augmentation caused by style-variation accounts for the improved corruption robustness and increased shape bias is only a byproduct.

* 20 pages, 9 figures, 12 tables, accepted at ICLR 2021

Via

Access Paper or Ask Questions

Efficient Certified Defenses Against Patch Attacks on Image Classifiers

Feb 08, 2021
Jan Hendrik Metzen, Maksym Yatsura

Figure 1 for Efficient Certified Defenses Against Patch Attacks on Image Classifiers

Figure 2 for Efficient Certified Defenses Against Patch Attacks on Image Classifiers

Figure 3 for Efficient Certified Defenses Against Patch Attacks on Image Classifiers

Figure 4 for Efficient Certified Defenses Against Patch Attacks on Image Classifiers

Adversarial patches pose a realistic threat model for physical world attacks on autonomous systems via their perception component. Autonomous systems in safety-critical domains such as automated driving should thus contain a fail-safe fallback component that combines certifiable robustness against patches with efficient inference while maintaining high performance on clean inputs. We propose BagCert, a novel combination of model architecture and certification procedure that allows efficient certification. We derive a loss that enables end-to-end optimization of certified robustness against patches of different sizes and locations. On CIFAR10, BagCert certifies 10.000 examples in 43 seconds on a single GPU and obtains 86% clean and 60% certified accuracy against 5x5 patches.

* accepted at ICLR 2021

Via

Access Paper or Ask Questions

Meta Adversarial Training

Jan 27, 2021
Jan Hendrik Metzen, Nicole Finnie, Robin Hutmacher

Recently demonstrated physical-world adversarial attacks have exposed vulnerabilities in perception systems that pose severe risks for safety-critical applications such as autonomous driving. These attacks place adversarial artifacts in the physical world that indirectly cause the addition of universal perturbations to inputs of a model that can fool it in a variety of contexts. Adversarial training is the most effective defense against image-dependent adversarial attacks. However, tailoring adversarial training to universal perturbations is computationally expensive since the optimal universal perturbations depend on the model weights which change during training. We propose meta adversarial training (MAT), a novel combination of adversarial training with meta-learning, which overcomes this challenge by meta-learning universal perturbations along with model training. MAT requires little extra computation while continuously adapting a large set of perturbations to the current model. We present results for universal patch and universal perturbation attacks on image classification and traffic-light detection. MAT considerably increases robustness against universal patch attacks compared to prior work.

Via

Access Paper or Ask Questions

Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

Oct 12, 2020
Christoph Kamann, Burkhard Güssefeld, Robin Hutmacher, Jan Hendrik Metzen, Carsten Rother

Figure 1 for Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

Figure 2 for Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

Figure 3 for Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

Figure 4 for Increasing the Robustness of Semantic Segmentation Models with Painting-by-Numbers

For safety-critical applications such as autonomous driving, CNNs have to be robust with respect to unavoidable image corruptions, such as image noise. While previous works addressed the task of robust prediction in the context of full-image classification, we consider it for dense semantic segmentation. We build upon an insight from image classification that output robustness can be improved by increasing the network-bias towards object shapes. We present a new training schema that increases this shape bias. Our basic idea is to alpha-blend a portion of the RGB training images with faked images, where each class-label is given a fixed, randomly chosen color that is not likely to appear in real imagery. This forces the network to rely more strongly on shape cues. We call this data augmentation technique ``Painting-by-Numbers''. We demonstrate the effectiveness of our training schema for DeepLabv3+ with various network backbones, MobileNet-V2, ResNets, and Xception, and evaluate it on the Cityscapes dataset. With respect to our 16 different types of image corruptions and 5 different network backbones, we are in 74% better than training with clean data. For cases where we are worse than a model trained without our training schema, it is mostly only marginally worse. However, for some image corruptions such as images with noise, we see a considerable performance gain of up to 25%.

Via

Access Paper or Ask Questions

Adversarial and Natural Perturbations for General Robustness

Oct 03, 2020
Sadaf Gulshad, Jan Hendrik Metzen, Arnold Smeulders

Figure 1 for Adversarial and Natural Perturbations for General Robustness

Figure 2 for Adversarial and Natural Perturbations for General Robustness

Figure 3 for Adversarial and Natural Perturbations for General Robustness

Figure 4 for Adversarial and Natural Perturbations for General Robustness

In this paper we aim to explore the general robustness of neural network classifiers by utilizing adversarial as well as natural perturbations. Different from previous works which mainly focus on studying the robustness of neural networks against adversarial perturbations, we also evaluate their robustness on natural perturbations before and after robustification. After standardizing the comparison between adversarial and natural perturbations, we demonstrate that although adversarial training improves the performance of the networks against adversarial perturbations, it leads to drop in the performance for naturally perturbed samples besides clean samples. In contrast, natural perturbations like elastic deformations, occlusions and wave does not only improve the performance against natural perturbations, but also lead to improvement in the performance for the adversarial perturbations. Additionally they do not drop the accuracy on the clean images.

* Currently under review

Via

Access Paper or Ask Questions

Meta-Learning of Neural Architectures for Few-Shot Learning

Nov 25, 2019
Thomas Elsken, Benedikt Staffler, Jan Hendrik Metzen, Frank Hutter

Figure 1 for Meta-Learning of Neural Architectures for Few-Shot Learning

Figure 2 for Meta-Learning of Neural Architectures for Few-Shot Learning

Figure 3 for Meta-Learning of Neural Architectures for Few-Shot Learning

Figure 4 for Meta-Learning of Neural Architectures for Few-Shot Learning

The recent progress in neural architectures search (NAS) has allowed scaling the automated design of neural architectures to real-world domains such as object detection and semantic segmentation. However, one prerequisite for the application of NAS are large amounts of labeled data and compute resources. This renders its application challenging in few-shot learning scenarios, where many related tasks need to be learned, each with limited amounts of data and compute time. Thus, few-shot learning is typically done with a fixed neural architecture. To improve upon this, we propose MetaNAS, the first method which fully integrates NAS with gradient-based meta-learning. MetaNAS optimizes a meta-architecture along with the meta-weights during meta-training. During meta-testing, architectures can be adapted to a novel task with a few steps of the task optimizer, that is: task adaptation becomes computationally cheap and requires only little data per task. Moreover, MetaNAS is agnostic in that it can be used with arbitrary model-agnostic meta-learning algorithms and arbitrary gradient-based NAS methods. Empirical results on standard few-shot classification benchmarks show that MetaNAS with a combination of DARTS and REPTILE yields state-of-the-art results.

Via

Access Paper or Ask Questions

Understanding Misclassifications by Attributes

Oct 15, 2019
Sadaf Gulshad, Zeynep Akata, Jan Hendrik Metzen, Arnold Smeulders

Figure 1 for Understanding Misclassifications by Attributes

Figure 2 for Understanding Misclassifications by Attributes

Figure 3 for Understanding Misclassifications by Attributes

Figure 4 for Understanding Misclassifications by Attributes

In this paper, we aim to understand and explain the decisions of deep neural networks by studying the behavior of predicted attributes when adversarial examples are introduced. We study the changes in attributes for clean as well as adversarial images in both standard and adversarially robust networks. We propose a metric to quantify the robustness of an adversarially robust network against adversarial attacks. In a standard network, attributes predicted for adversarial images are consistent with the wrong class, while attributes predicted for the clean images are consistent with the true class. In an adversarially robust network, the attributes predicted for adversarial images classified correctly are consistent with the true class. Finally, we show that the ability to robustify a network varies for different datasets. For the fine grained dataset, it is higher as compared to the coarse-grained dataset. Additionally, the ability to robustify a network increases with the increase in adversarial noise.

* arXiv admin note: substantial text overlap with arXiv:1904.08279

Via

Access Paper or Ask Questions

Interpreting Adversarial Examples with Attributes

Apr 17, 2019
Sadaf Gulshad, Jan Hendrik Metzen, Arnold Smeulders, Zeynep Akata

Figure 1 for Interpreting Adversarial Examples with Attributes

Figure 2 for Interpreting Adversarial Examples with Attributes

Figure 3 for Interpreting Adversarial Examples with Attributes

Figure 4 for Interpreting Adversarial Examples with Attributes

Deep computer vision systems being vulnerable to imperceptible and carefully crafted noise have raised questions regarding the robustness of their decisions. We take a step back and approach this problem from an orthogonal direction. We propose to enable black-box neural networks to justify their reasoning both for clean and for adversarial examples by leveraging attributes, i.e. visually discriminative properties of objects. We rank attributes based on their class relevance, i.e. how the classification decision changes when the input is visually slightly perturbed, as well as image relevance, i.e. how well the attributes can be localized on both clean and perturbed images. We present comprehensive experiments for attribute prediction, adversarial example generation, adversarially robust learning, and their qualitative and quantitative analysis using predicted attributes on three benchmark datasets.

Via

Access Paper or Ask Questions

Defending against Universal Perturbations with Shared Adversarial Training

Dec 10, 2018
Chaithanya Kumar Mummadi, Thomas Brox, Jan Hendrik Metzen

Figure 1 for Defending against Universal Perturbations with Shared Adversarial Training

Figure 2 for Defending against Universal Perturbations with Shared Adversarial Training

Figure 3 for Defending against Universal Perturbations with Shared Adversarial Training

Figure 4 for Defending against Universal Perturbations with Shared Adversarial Training

Classifiers such as deep neural networks have been shown to be vulnerable against adversarial perturbations on problems with high-dimensional input space. While adversarial training improves the robustness of image classifiers against such adversarial perturbations, it leaves them sensitive to perturbations on a non-negligible fraction of the inputs. In this work, we show that adversarial training is more effective in preventing universal perturbations, where the same perturbation needs to fool a classifier on many inputs. Moreover, we investigate the trade-off between robustness against universal perturbations and performance on unperturbed data and propose an extension of adversarial training that handles this trade-off more gracefully. We present results for image classification and semantic segmentation to showcase that universal perturbations that fool a model hardened with adversarial training become clearly perceptible and show patterns of the target scene.

Via

Access Paper or Ask Questions