Alert button

ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention

Mar 23, 2022
Yang Liu, Jiaxiang Liu, Li Chen, Yuxiang Lu, Shikun Feng, Zhida Feng, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Figure 2 for ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Figure 3 for ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention
Figure 4 for ERNIE-SPARSE: Learning Hierarchical Efficient Transformer Through Regularized Self-Attention

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: