Alert button

Toward Few-step Adversarial Training from a Frequency Perspective

Oct 13, 2020
Hans Shih-Han Wang, Cory Cornelius, Brandon Edwards, Jason Martin

Figure 1 for Toward Few-step Adversarial Training from a Frequency Perspective
Figure 2 for Toward Few-step Adversarial Training from a Frequency Perspective
Figure 3 for Toward Few-step Adversarial Training from a Frequency Perspective
Figure 4 for Toward Few-step Adversarial Training from a Frequency Perspective

Share this with someone who'll enjoy it:

We investigate adversarial-sample generation methods from a frequency domain perspective and extend standard $l_{\infty}$ Projected Gradient Descent (PGD) to the frequency domain. The resulting method, which we call Spectral Projected Gradient Descent (SPGD), has better success rate compared to PGD during early steps of the method. Adversarially training models using SPGD achieves greater adversarial accuracy compared to PGD when holding the number of attack steps constant. The use of SPGD can, therefore, reduce the overhead of adversarial training when utilizing adversarial generation with a smaller number of steps. However, we also prove that SPGD is equivalent to a variant of the PGD ordinarily used for the $l_{\infty}$ threat model. This PGD variant omits the sign function which is ordinarily applied to the gradient. SPGD can, therefore, be performed without explicitly transforming into the frequency domain. Finally, we visualize the perturbations SPGD generates and find they use both high and low-frequency components, which suggests that removing either high-frequency components or low-frequency components is not an effective defense.

* Proceedings of the 1st ACM Workshop on Security and Privacy on Artificial Intelligence (2020)  * 9 pages, 9 figures, SPAI'20, ACM ASIACCS 2020  
View paper onarxiv icon

Share this with someone who'll enjoy it: