Alert button

Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change

May 05, 2020
Hongfei Xu, Josef van Genabith, Deyi Xiong, Qiuhui Liu

Figure 1 for Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change
Figure 2 for Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change
Figure 3 for Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change
Figure 4 for Dynamically Adjusting Transformer Batch Size by Monitoring Gradient Direction Change

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: