Alert button

Retentive Network: A Successor to Transformer for Large Language Models

Jul 19, 2023
Yutao Sun, Li Dong, Shaohan Huang, Shuming Ma, Yuqing Xia, Jilong Xue, Jianyong Wang, Furu Wei

Figure 1 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 2 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 3 for Retentive Network: A Successor to Transformer for Large Language Models
Figure 4 for Retentive Network: A Successor to Transformer for Large Language Models

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: