Alert button

Accumulated Decoupled Learning: Mitigating Gradient Staleness in Inter-Layer Model Parallelization

Dec 03, 2020
Huiping Zhuang, Zhiping Lin, Kar-Ann Toh

Figure 1 for Accumulated Decoupled Learning: Mitigating Gradient Staleness in Inter-Layer Model Parallelization
Figure 2 for Accumulated Decoupled Learning: Mitigating Gradient Staleness in Inter-Layer Model Parallelization
Figure 3 for Accumulated Decoupled Learning: Mitigating Gradient Staleness in Inter-Layer Model Parallelization
Figure 4 for Accumulated Decoupled Learning: Mitigating Gradient Staleness in Inter-Layer Model Parallelization

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: