Alert button
Picture for Shengwei Li

Shengwei Li

Alert button

Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent

Add code
Bookmark button
Alert button
Aug 18, 2023
Xiaoge Deng, Li Shen, Shengwei Li, Tao Sun, Dongsheng Li, Dacheng Tao

Figure 1 for Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent
Figure 2 for Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent
Figure 3 for Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent
Figure 4 for Towards Understanding the Generalizability of Delayed Stochastic Gradient Descent
Viaarxiv icon

Merak: An Efficient Distributed DNN Training Framework with Automated 3D Parallelism for Giant Foundation Models

Add code
Bookmark button
Alert button
Jun 21, 2022
Zhiquan Lai, Shengwei Li, Xudong Tang, Keshi Ge, Weijie Liu, Yabo Duan, Linbo Qiao, Dongsheng Li

Figure 1 for Merak: An Efficient Distributed DNN Training Framework with Automated 3D Parallelism for Giant Foundation Models
Figure 2 for Merak: An Efficient Distributed DNN Training Framework with Automated 3D Parallelism for Giant Foundation Models
Figure 3 for Merak: An Efficient Distributed DNN Training Framework with Automated 3D Parallelism for Giant Foundation Models
Figure 4 for Merak: An Efficient Distributed DNN Training Framework with Automated 3D Parallelism for Giant Foundation Models
Viaarxiv icon

EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks

Add code
Bookmark button
Alert button
Oct 18, 2021
Shengwei Li, Zhiquan Lai, Dongsheng Li, Xiangyu Ye, Yabo Duan

Figure 1 for EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks
Figure 2 for EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks
Figure 3 for EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks
Figure 4 for EmbRace: Accelerating Sparse Communication for Distributed Training of NLP Neural Networks
Viaarxiv icon