Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Contextualized Perturbation for Textual Adversarial Attack

Sep 16, 2020

Dianqi Li, Yizhe Zhang, Hao Peng, Liqun Chen, Chris Brockett, Ming-Ting Sun, Bill Dolan

Figure 1 for Contextualized Perturbation for Textual Adversarial Attack

Figure 2 for Contextualized Perturbation for Textual Adversarial Attack

Figure 3 for Contextualized Perturbation for Textual Adversarial Attack

Figure 4 for Contextualized Perturbation for Textual Adversarial Attack

Share this with someone who'll enjoy it:

Abstract:Adversarial examples expose the vulnerabilities of natural language processing (NLP) models, and can be used to evaluate and improve their robustness. Existing techniques of generating such examples are typically driven by local heuristic rules that are agnostic to the context, often resulting in unnatural and ungrammatical outputs. This paper presents CLARE, a ContextuaLized AdversaRial Example generation model that produces fluent and grammatical outputs through a mask-then-infill procedure. CLARE builds on a pre-trained masked language model and modifies the inputs in a context-aware manner. We propose three contextualized perturbations, Replace, Insert and Merge, allowing for generating outputs of varied lengths. With a richer range of available strategies, CLARE is able to attack a victim model more efficiently with fewer edits. Extensive experiments and human evaluation demonstrate that CLARE outperforms the baselines in terms of attack success rate, textual similarity, fluency and grammaticality.

View paper on

Share this with someone who'll enjoy it:

Title:Contextualized Perturbation for Textual Adversarial Attack

Paper and Code