Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?

Oct 05, 2020

Shayne Longpre, Yu Wang, Christopher DuBois

Figure 1 for How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?

Figure 2 for How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?

Figure 3 for How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?

Figure 4 for How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?

Share this with someone who'll enjoy it:

Abstract:Task-agnostic forms of data augmentation have proven widely effective in computer vision, even on pretrained models. In NLP similar results are reported most commonly for low data regimes, non-pretrained models, or situationally for pretrained models. In this paper we ask how effective these techniques really are when applied to pretrained transformers. Using two popular varieties of task-agnostic data augmentation (not tailored to any particular task), Easy Data Augmentation (Wei and Zou, 2019) and Back-Translation (Sennrichet al., 2015), we conduct a systematic examination of their effects across 5 classification tasks, 6 datasets, and 3 variants of modern pretrained transformers, including BERT, XLNet, and RoBERTa. We observe a negative result, finding that techniques which previously reported strong improvements for non-pretrained models fail to consistently improve performance for pretrained transformers, even when training data is limited. We hope this empirical analysis helps inform practitioners where data augmentation techniques may confer improvements.

* 2 tables; 1 figure; EMNLP Findings

View paper on

Share this with someone who'll enjoy it:

Title:How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?

Paper and Code