Alert button

Distilling Transformers into Simple Neural Networks with Unlabeled Transfer Data

Oct 04, 2019
Subhabrata Mukherjee, Ahmed Hassan Awadallah

Figure 1 for Distilling Transformers into Simple Neural Networks with Unlabeled Transfer Data
Figure 2 for Distilling Transformers into Simple Neural Networks with Unlabeled Transfer Data
Figure 3 for Distilling Transformers into Simple Neural Networks with Unlabeled Transfer Data
Figure 4 for Distilling Transformers into Simple Neural Networks with Unlabeled Transfer Data

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: