Alert button

Accelerating Transformer Pre-Training with 2:4 Sparsity

Apr 02, 2024
Yuezhou Hu, Kang Zhao, Weiyu Huang, Jianfei Chen, Jun Zhu

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: