Alert button
Picture for Youhe Jiang

Youhe Jiang

Alert button

Improving Automatic Parallel Training via Balanced Memory Workload Optimization

Add code
Bookmark button
Alert button
Jul 05, 2023
Yujie Wang, Youhe Jiang, Xupeng Miao, Fangcheng Fu, Xiaonan Nie, Bin Cui

Figure 1 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Figure 2 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Figure 3 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Figure 4 for Improving Automatic Parallel Training via Balanced Memory Workload Optimization
Viaarxiv icon

Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism

Add code
Bookmark button
Alert button
Nov 25, 2022
Xupeng Miao, Yujie Wang, Youhe Jiang, Chunan Shi, Xiaonan Nie, Hailin Zhang, Bin Cui

Figure 1 for Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism
Figure 2 for Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism
Figure 3 for Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism
Figure 4 for Galvatron: Efficient Transformer Training over Multiple GPUs Using Automatic Parallelism
Viaarxiv icon