Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Train No Evil: Selective Masking for Task-guided Pre-training

Apr 21, 2020

Yuxian Gu, Zhengyan Zhang, Xiaozhi Wang, Zhiyuan Liu, Maosong Sun

Figure 1 for Train No Evil: Selective Masking for Task-guided Pre-training

Figure 2 for Train No Evil: Selective Masking for Task-guided Pre-training

Figure 3 for Train No Evil: Selective Masking for Task-guided Pre-training

Figure 4 for Train No Evil: Selective Masking for Task-guided Pre-training

Share this with someone who'll enjoy it:

Abstract:Recently, pre-trained language models mostly follow the pre-training-then-fine-tuning paradigm and have achieved great performances on various downstream tasks. However, due to the aimlessness of pre-training and the small in-domain supervised data scale of fine-tuning, the two-stage models typically cannot capture the domain-specific and task-specific language patterns well. In this paper, we propose a selective masking task-guided pre-training method and add it between the general pre-training and fine-tuning. In this stage, we train the masked language modeling task on in-domain unsupervised data, which enables our model to effectively learn the domain-specific language patterns. To efficiently learn the task-specific language patterns, we adopt a selective masking strategy instead of the conventional random masking, which means we only mask the tokens that are important to the downstream task. Specifically, we define the importance of tokens as their impacts on the final classification results and use a neural model to learn the implicit selecting rules. Experimental results on two sentiment analysis tasks show that our method can achieve comparable or even better performance with less than 50\% overall computation cost, which indicates our method is both effective and efficient. The source code will be released in the future.

* 6 pages, 2 figures

View paper on

Share this with someone who'll enjoy it:

Title:Train No Evil: Selective Masking for Task-guided Pre-training

Paper and Code