Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning to Annotate: Modularizing Data Augmentation for Text Classifiers with Natural Language Explanations

Nov 07, 2019

Ziqi Wang, Yujia Qin, Wenxuan Zhou, Jun Yan, Qinyuan Ye, Leonardo Neves, Zhiyuan Liu, Xiang Ren

Figure 1 for Learning to Annotate: Modularizing Data Augmentation for Text Classifiers with Natural Language Explanations

Figure 2 for Learning to Annotate: Modularizing Data Augmentation for Text Classifiers with Natural Language Explanations

Figure 3 for Learning to Annotate: Modularizing Data Augmentation for Text Classifiers with Natural Language Explanations

Figure 4 for Learning to Annotate: Modularizing Data Augmentation for Text Classifiers with Natural Language Explanations

Share this with someone who'll enjoy it:

Abstract:Deep neural networks usually require massive labeled data, which restricts their applications in scenarios where data annotation is expensive. Natural language (NL) explanations have been demonstrated very useful additional supervision, which can provide sufficient domain knowledge for generating more labeled data over new instances, while the annotation time only doubles. However, directly applying them for augmenting model learning encounters two challenges: (1) NL explanations are unstructured and inherently compositional. (2) NL explanations often have large numbers of linguistic variants, resulting in low recall and limited generalization ability. In this paper, we propose a novel Neural EXecution Tree (NEXT) framework to augment training data for text classification using NL explanations. After transforming NL explanations into executable logical forms by semantic parsing, NEXT generalizes different types of actions specified by the logical forms for labeling data instances, which substantially increases the coverage of each NL explanation. Experiments on two NLP tasks (relation extraction and sentiment analysis) demonstrate its superiority over baseline methods. Its extension to multi-hop question answering achieves performance gain with light annotation effort.

* 16 pages, 7 figures, 13 tables

View paper on

Share this with someone who'll enjoy it:

Title:Learning to Annotate: Modularizing Data Augmentation for Text Classifiers with Natural Language Explanations

Paper and Code