Alert button

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Add code
Bookmark button
Alert button
Jul 05, 2023
Jiayu Ding, Shuming Ma, Li Dong, Xingxing Zhang, Shaohan Huang, Wenhui Wang, Furu Wei

Figure 1 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 2 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 3 for LongNet: Scaling Transformers to 1,000,000,000 Tokens
Figure 4 for LongNet: Scaling Transformers to 1,000,000,000 Tokens

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: