Alert button

Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size

Aug 16, 2020
Davis Yoshida, Allyson Ettinger, Kevin Gimpel

Figure 1 for Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Figure 2 for Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Figure 3 for Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size
Figure 4 for Adding Recurrence to Pretrained Transformers for Improved Efficiency and Context Size

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: