Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Sep 19, 2020

Yuan Zang, Bairu Hou, Fanchao Qi, Zhiyuan Liu, Xiaojun Meng, Maosong Sun

Figure 1 for Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Figure 2 for Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Figure 3 for Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Figure 4 for Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Share this with someone who'll enjoy it:

Abstract:Adversarial attacking aims to fool deep neural networks with adversarial examples. In the field of natural language processing, various textual adversarial attack models have been proposed, varying in the accessibility to the victim model. Among them, the attack models that only require the output of the victim model are more fit for real-world situations of adversarial attacking. However, to achieve high attack performance, these models usually need to query the victim model too many times, which is neither efficient nor viable in practice. To tackle this problem, we propose a reinforcement learning based attack model, which can learn from attack history and launch attacks more efficiently. In experiments, we evaluate our model by attacking several state-of-the-art models on the benchmark datasets of multiple tasks including sentiment analysis, text classification and natural language inference. Experimental results demonstrate that our model consistently achieves both better attack performance and higher efficiency than recently proposed baseline methods. We also find our attack model can bring more robustness improvement to the victim model by adversarial training. All the code and data of this paper will be made public.

* work in progress, 10 pages, 6 figures

View paper on

Share this with someone who'll enjoy it:

Title:Learning to Attack: Towards Textual Adversarial Attacking in Real-world Situations

Paper and Code