Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Jul 05, 2019

Yao Qin, Nicholas Frosst, Sara Sabour, Colin Raffel, Garrison Cottrell, Geoffrey Hinton

Figure 1 for Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Figure 2 for Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Figure 3 for Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Figure 4 for Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Share this with someone who'll enjoy it:

Abstract:Adversarial examples raise questions about whether neural network models are sensitive to the same visual features as humans. Most of the proposed methods for mitigating adversarial examples have subsequently been defeated by stronger attacks. Motivated by these issues, we take a different approach and propose to instead detect adversarial examples based on class-conditional reconstructions of the input. Our method uses the reconstruction network proposed as part of Capsule Networks (CapsNets), but is general enough to be applied to standard convolutional networks. We find that adversarial or otherwise corrupted images result in much larger reconstruction errors than normal inputs, prompting a simple detection method by thresholding the reconstruction error. Based on these findings, we propose the Reconstructive Attack which seeks both to cause a misclassification and a low reconstruction error. While this attack produces undetected adversarial examples, we find that for CapsNets the resulting perturbations can cause the images to appear visually more like the target class. This suggests that CapsNets utilize features that are more aligned with human perception and address the central issue raised by adversarial examples.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Detecting and Diagnosing Adversarial Images with Class-Conditional Capsule Reconstructions

Paper and Code