Picture for Le Qin

Le Qin

Partial Experts Checkpoint: Efficient Fault Tolerance for Sparse Mixture-of-Experts Model Training

Add code
Aug 08, 2024
Viaarxiv icon

Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts

Add code
Apr 07, 2024
Figure 1 for Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Figure 2 for Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Figure 3 for Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Figure 4 for Shortcut-connected Expert Parallelism for Accelerating Mixture-of-Experts
Viaarxiv icon