Alert button

Fast Transformer Decoding: One Write-Head is All You Need

Nov 06, 2019
Noam Shazeer

Figure 1 for Fast Transformer Decoding: One Write-Head is All You Need
Figure 2 for Fast Transformer Decoding: One Write-Head is All You Need
Figure 3 for Fast Transformer Decoding: One Write-Head is All You Need

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: