Alert button

A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models

Add code
Bookmark button
Alert button
May 26, 2023
Hayeon Lee, Rui Hou, Jongpil Kim, Davis Liang, Sung Ju Hwang, Alexander Min

Figure 1 for A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models
Figure 2 for A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models
Figure 3 for A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models
Figure 4 for A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: