Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Dimitris Tsipras

Tony

From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

May 22, 2020

Dimitris Tsipras, Shibani Santurkar, Logan Engstrom, Andrew Ilyas, Aleksander Madry

Figure 1 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

Figure 2 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

Figure 3 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

Figure 4 for From ImageNet to Image Classification: Contextualizing Progress on Benchmarks

Abstract:Building rich machine learning datasets in a scalable manner often necessitates a crowd-sourced data collection pipeline. In this work, we use human studies to investigate the consequences of employing such a pipeline, focusing on the popular ImageNet dataset. We study how specific design choices in the ImageNet creation process impact the fidelity of the resulting dataset---including the introduction of biases that state-of-the-art models exploit. Our analysis pinpoints how a noisy data collection pipeline can lead to a systematic misalignment between the resulting benchmark and the real-world task it serves as a proxy for. Finally, our findings emphasize the need to augment our current model training and evaluation toolkit to take such misalignments into account. To facilitate further research, we release our refined ImageNet annotations at https://github.com/MadryLab/ImageNetMultiLabel.

Via

Access Paper or Ask Questions

Identifying Statistical Bias in Dataset Replication

May 19, 2020

Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Jacob Steinhardt, Aleksander Madry

Figure 1 for Identifying Statistical Bias in Dataset Replication

Figure 2 for Identifying Statistical Bias in Dataset Replication

Figure 3 for Identifying Statistical Bias in Dataset Replication

Figure 4 for Identifying Statistical Bias in Dataset Replication

Abstract:Dataset replication is a useful tool for assessing whether improvements in test accuracy on a specific benchmark correspond to improvements in models' ability to generalize reliably. In this work, we present unintuitive yet significant ways in which standard approaches to dataset replication introduce statistical bias, skewing the resulting observations. We study ImageNet-v2, a replication of the ImageNet dataset on which models exhibit a significant (11-14%) drop in accuracy, even after controlling for a standard human-in-the-loop measure of data quality. We show that after correcting for the identified statistical bias, only an estimated $3.6\% \pm 1.5\%$ of the original $11.7\% \pm 1.0\%$ accuracy drop remains unaccounted for. We conclude with concrete recommendations for recognizing and avoiding bias in dataset replication. Code for our study is publicly available at http://github.com/MadryLab/dataset-replication-analysis .

Via

Access Paper or Ask Questions

Label-Consistent Backdoor Attacks

Dec 06, 2019

Alexander Turner, Dimitris Tsipras, Aleksander Madry

Figure 1 for Label-Consistent Backdoor Attacks

Figure 2 for Label-Consistent Backdoor Attacks

Figure 3 for Label-Consistent Backdoor Attacks

Figure 4 for Label-Consistent Backdoor Attacks

Abstract:Deep neural networks have been demonstrated to be vulnerable to backdoor attacks. Specifically, by injecting a small number of maliciously constructed inputs into the training set, an adversary is able to plant a backdoor into the trained model. This backdoor can then be activated during inference by a backdoor trigger to fully control the model's behavior. While such attacks are very effective, they crucially rely on the adversary injecting arbitrary inputs that are---often blatantly---mislabeled. Such samples would raise suspicion upon human inspection, potentially revealing the attack. Thus, for backdoor attacks to remain undetected, it is crucial that they maintain label-consistency---the condition that injected inputs are consistent with their labels. In this work, we leverage adversarial perturbations and generative models to execute efficient, yet label-consistent, backdoor attacks. Our approach is based on injecting inputs that appear plausible, yet are hard to classify, hence causing the model to rely on the (easier-to-learn) backdoor trigger.

Via

Access Paper or Ask Questions

Computer Vision with a Single (Robust) Classifier

Jun 06, 2019

Shibani Santurkar, Dimitris Tsipras, Brandon Tran, Andrew Ilyas, Logan Engstrom, Aleksander Madry

Figure 1 for Computer Vision with a Single (Robust) Classifier

Figure 2 for Computer Vision with a Single (Robust) Classifier

Figure 3 for Computer Vision with a Single (Robust) Classifier

Figure 4 for Computer Vision with a Single (Robust) Classifier

Abstract:We show that the basic classification framework alone can be used to tackle some of the most challenging computer vision tasks. In contrast to other state-of-the-art approaches, the toolkit we develop is rather minimal: it uses a single, off-the-shelf classifier for all these tasks. The crux of our approach is that we train this classifier to be adversarially robust. It turns out that adversarial robustness is precisely what we need to directly manipulate salient features of the input. Overall, our findings demonstrate the utility of robustness in the broader machine learning context. Code and models for our experiments can be found at https://git.io/robust-apps.

Via

Access Paper or Ask Questions

Learning Perceptually-Aligned Representations via Adversarial Robustness

Jun 03, 2019

Logan Engstrom, Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Brandon Tran, Aleksander Madry

Figure 1 for Learning Perceptually-Aligned Representations via Adversarial Robustness

Figure 2 for Learning Perceptually-Aligned Representations via Adversarial Robustness

Figure 3 for Learning Perceptually-Aligned Representations via Adversarial Robustness

Figure 4 for Learning Perceptually-Aligned Representations via Adversarial Robustness

Abstract:Many applications of machine learning require models that are human-aligned, i.e., that make decisions based on human-meaningful information about the input. We identify the pervasive brittleness of deep networks' learned representations as a fundamental barrier to attaining this goal. We then re-cast robust optimization as a tool for enforcing human priors on the features learned by deep neural networks. The resulting robust feature representations turn out to be significantly more aligned with human perception. We leverage these representations to perform input interpolation, feature manipulation, and sensitivity mapping, without any post-processing or human intervention after model training. Our code and models for reproducing these results is available at https://git.io/robust-reps.

Via

Access Paper or Ask Questions

Adversarial Examples Are Not Bugs, They Are Features

May 07, 2019

Andrew Ilyas, Shibani Santurkar, Dimitris Tsipras, Logan Engstrom, Brandon Tran, Aleksander Madry

Figure 1 for Adversarial Examples Are Not Bugs, They Are Features

Figure 2 for Adversarial Examples Are Not Bugs, They Are Features

Figure 3 for Adversarial Examples Are Not Bugs, They Are Features

Figure 4 for Adversarial Examples Are Not Bugs, They Are Features

Abstract:Adversarial examples have attracted significant attention in machine learning, but the reasons for their existence and pervasiveness remain unclear. We demonstrate that adversarial examples can be directly attributed to the presence of non-robust features: features derived from patterns in the data distribution that are highly predictive, yet brittle and incomprehensible to humans. After capturing these features within a theoretical framework, we establish their widespread existence in standard datasets. Finally, we present a simple setting where we can rigorously tie the phenomena we observe in practice to a misalignment between the (human-specified) notion of robustness and the inherent geometry of the data.

Via

Access Paper or Ask Questions

On Evaluating Adversarial Robustness

Feb 20, 2019

Nicholas Carlini, Anish Athalye, Nicolas Papernot, Wieland Brendel, Jonas Rauber, Dimitris Tsipras, Ian Goodfellow, Aleksander Madry, Alexey Kurakin

Abstract:Correctly evaluating defenses against adversarial examples has proven to be extremely difficult. Despite the significant amount of recent work attempting to design defenses that withstand adaptive attacks, few have succeeded; most papers that propose defenses are quickly shown to be incorrect. We believe a large contributing factor is the difficulty of performing security evaluations. In this paper, we discuss the methodological foundations, review commonly accepted best practices, and suggest new methods for evaluating defenses to adversarial examples. We hope that both researchers developing defenses as well as readers and reviewers who wish to understand the completeness of an evaluation consider our advice in order to avoid common pitfalls.

* Living document; source available at https://github.com/evaluating-adversarial-robustness/adv-eval-paper/

Via

Access Paper or Ask Questions

Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms?

Dec 02, 2018

Andrew Ilyas, Logan Engstrom, Shibani Santurkar, Dimitris Tsipras, Firdaus Janoos, Larry Rudolph, Aleksander Madry

Figure 1 for Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms?

Figure 2 for Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms?

Figure 3 for Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms?

Figure 4 for Are Deep Policy Gradient Algorithms Truly Policy Gradient Algorithms?

Abstract:We study how the behavior of deep policy gradient algorithms reflects the conceptual framework motivating their development. We propose a fine-grained analysis of state-of-the-art methods based on key aspects of this framework: gradient estimation, value prediction, optimization landscapes, and trust region enforcement. We find that from this perspective, the behavior of deep policy gradient algorithms often deviates from what their motivating framework would predict. Our analysis suggests first steps towards solidifying the foundations of these algorithms, and in particular indicates that we may need to move beyond the current benchmark-centric evaluation methodology.

Via

Access Paper or Ask Questions

How Does Batch Normalization Help Optimization?

Oct 27, 2018

Shibani Santurkar, Dimitris Tsipras, Andrew Ilyas, Aleksander Madry

Figure 1 for How Does Batch Normalization Help Optimization?

Figure 2 for How Does Batch Normalization Help Optimization?

Figure 3 for How Does Batch Normalization Help Optimization?

Figure 4 for How Does Batch Normalization Help Optimization?

Abstract:Batch Normalization (BatchNorm) is a widely adopted technique that enables faster and more stable training of deep neural networks (DNNs). Despite its pervasiveness, the exact reasons for BatchNorm's effectiveness are still poorly understood. The popular belief is that this effectiveness stems from controlling the change of the layers' input distributions during training to reduce the so-called "internal covariate shift". In this work, we demonstrate that such distributional stability of layer inputs has little to do with the success of BatchNorm. Instead, we uncover a more fundamental impact of BatchNorm on the training process: it makes the optimization landscape significantly smoother. This smoothness induces a more predictive and stable behavior of the gradients, allowing for faster training.

* To appear in NIPS'18

Via

Access Paper or Ask Questions

Robustness May Be at Odds with Accuracy

Oct 11, 2018

Dimitris Tsipras, Shibani Santurkar, Logan Engstrom, Alexander Turner, Aleksander Madry

Figure 1 for Robustness May Be at Odds with Accuracy

Figure 2 for Robustness May Be at Odds with Accuracy

Figure 3 for Robustness May Be at Odds with Accuracy

Figure 4 for Robustness May Be at Odds with Accuracy

Abstract:We show that there exists an inherent tension between the goal of adversarial robustness and that of standard generalization. Specifically, training robust models may not only be more resource-consuming, but also lead to a reduction of standard accuracy. We demonstrate that this trade-off between the standard accuracy of a model and its robustness to adversarial perturbations provably exists even in a fairly simple and natural setting. These findings also corroborate a similar phenomenon observed in practice. Further, we argue that this phenomenon is a consequence of robust classifiers learning fundamentally different feature representations than standard classifiers. These differences, in particular, seem to result in unexpected benefits: the representations learned by robust models tend to align better with salient data characteristics and human perception.

Via

Access Paper or Ask Questions