Picture for Qiujieli Qin

Qiujieli Qin

Exploring the Benefit of Activation Sparsity in Pre-training

Add code
Oct 04, 2024
Figure 1 for Exploring the Benefit of Activation Sparsity in Pre-training
Figure 2 for Exploring the Benefit of Activation Sparsity in Pre-training
Figure 3 for Exploring the Benefit of Activation Sparsity in Pre-training
Figure 4 for Exploring the Benefit of Activation Sparsity in Pre-training
Viaarxiv icon