Alert button

2x Faster Language Model Pre-training via Masked Structural Growth

May 04, 2023
Yiqun Yao, Zheng Zhang, Jing Li, Yequan Wang

Figure 1 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 2 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 3 for 2x Faster Language Model Pre-training via Masked Structural Growth
Figure 4 for 2x Faster Language Model Pre-training via Masked Structural Growth

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: