Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Label Semantic Aware Pre-training for Few-shot Text Classification

Apr 14, 2022

Aaron Mueller, Jason Krone, Salvatore Romeo, Saab Mansour, Elman Mansimov, Yi Zhang, Dan Roth

Figure 1 for Label Semantic Aware Pre-training for Few-shot Text Classification

Figure 2 for Label Semantic Aware Pre-training for Few-shot Text Classification

Figure 3 for Label Semantic Aware Pre-training for Few-shot Text Classification

Figure 4 for Label Semantic Aware Pre-training for Few-shot Text Classification

Share this with someone who'll enjoy it:

Abstract:In text classification tasks, useful information is encoded in the label names. Label semantic aware systems have leveraged this information for improved text classification performance during fine-tuning and prediction. However, use of label-semantics during pre-training has not been extensively explored. We therefore propose Label Semantic Aware Pre-training (LSAP) to improve the generalization and data efficiency of text classification systems. LSAP incorporates label semantics into pre-trained generative models (T5 in our case) by performing secondary pre-training on labeled sentences from a variety of domains. As domain-general pre-training requires large amounts of data, we develop a filtering and labeling pipeline to automatically create sentence-label pairs from unlabeled text. We perform experiments on intent (ATIS, Snips, TOPv2) and topic classification (AG News, Yahoo! Answers). LSAP obtains significant accuracy improvements over state-of-the-art models for few-shot text classification while maintaining performance comparable to state of the art in high-resource settings.

* Accepted at ACL 2022

View paper on

Share this with someone who'll enjoy it:

Title:Label Semantic Aware Pre-training for Few-shot Text Classification

Paper and Code