Alert button

SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Dec 17, 2020
Hanrui Wang, Zhekai Zhang, Song Han

Figure 1 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Figure 2 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Figure 3 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
Figure 4 for SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: