Alert button

Faster Transformer Decoding: N-gram Masked Self-Attention

Add code
Bookmark button
Alert button
Jan 14, 2020
Ciprian Chelba, Mia Chen, Ankur Bapna, Noam Shazeer

Figure 1 for Faster Transformer Decoding: N-gram Masked Self-Attention
Figure 2 for Faster Transformer Decoding: N-gram Masked Self-Attention

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: