Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:RAID: Randomized Adversarial-Input Detection for Neural Networks

Feb 07, 2020

Hasan Ferit Eniser, Maria Christakis, Valentin Wüstholz

Figure 1 for RAID: Randomized Adversarial-Input Detection for Neural Networks

Figure 2 for RAID: Randomized Adversarial-Input Detection for Neural Networks

Figure 3 for RAID: Randomized Adversarial-Input Detection for Neural Networks

Figure 4 for RAID: Randomized Adversarial-Input Detection for Neural Networks

Share this with someone who'll enjoy it:

Abstract:In recent years, neural networks have become the default choice for image classification and many other learning tasks, even though they are vulnerable to so-called adversarial attacks. To increase their robustness against these attacks, there have emerged numerous detection mechanisms that aim to automatically determine if an input is adversarial. However, state-of-the-art detection mechanisms either rely on being tuned for each type of attack, or they do not generalize across different attack types. To alleviate these issues, we propose a novel technique for adversarial-image detection, RAID, that trains a secondary classifier to identify differences in neuron activation values between benign and adversarial inputs. Our technique is both more reliable and more effective than the state of the art when evaluated against six popular attacks. Moreover, a straightforward extension of RAID increases its robustness against detection-aware adversaries without affecting its effectiveness.

* 10 pages of content plus 2 pages of bibliography. Submitted to ISSTA

View paper on

Share this with someone who'll enjoy it:

Title:RAID: Randomized Adversarial-Input Detection for Neural Networks

Paper and Code