Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks

Apr 13, 2022

Haoran Yang, Piji Li, Wai Lam

Figure 1 for Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks

Figure 2 for Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks

Figure 3 for Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks

Figure 4 for Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks

Share this with someone who'll enjoy it:

Abstract:Parameter-efficient tuning aims to distill knowledge for downstream tasks by optimizing a few introduced parameters while freezing the pretrained language models (PLMs). Continuous prompt tuning which prepends a few trainable vectors to the embeddings of input is one of these methods and has drawn much attention due to its effectiveness and efficiency. This family of methods can be illustrated as exerting nonlinear transformations of hidden states inside PLMs. However, a natural question is ignored: can the hidden states be directly used for classification without changing them? In this paper, we aim to answer this question by proposing a simple tuning method which only introduces three trainable vectors. Firstly, we integrate all layers hidden states using the introduced vectors. And then, we input the integrated hidden state(s) to a task-specific linear classifier to predict categories. This scheme is similar to the way ELMo utilises hidden states except that they feed the hidden states to LSTM-based models. Although our proposed tuning scheme is simple, it achieves comparable performance with prompt tuning methods like P-tuning and P-tuning v2, verifying that original hidden states do contain useful information for classification tasks. Moreover, our method has an advantage over prompt tuning in terms of time and the number of parameters.

View paper on

Share this with someone who'll enjoy it:

Title:Parameter-Efficient Tuning by Manipulating Hidden States of Pretrained Language Models For Classification Tasks

Paper and Code