Alert button

Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Feb 26, 2020
Zhuohan Li, Eric Wallace, Sheng Shen, Kevin Lin, Kurt Keutzer, Dan Klein, Joseph E. Gonzalez

Figure 1 for Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
Figure 2 for Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
Figure 3 for Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers
Figure 4 for Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: