Alert button

Tricks for Training Sparse Translation Models

Oct 15, 2021
Dheeru Dua, Shruti Bhosale, Vedanuj Goswami, James Cross, Mike Lewis, Angela Fan

Figure 1 for Tricks for Training Sparse Translation Models
Figure 2 for Tricks for Training Sparse Translation Models
Figure 3 for Tricks for Training Sparse Translation Models
Figure 4 for Tricks for Training Sparse Translation Models

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: