Alert button

MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture

Feb 22, 2021
Wancong Zhang, Ieshan Vaidya

Figure 1 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Figure 2 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Figure 3 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture
Figure 4 for MixUp Training Leads to Reduced Overfitting and Improved Calibration for the Transformer Architecture

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: