Picture for Shifang Xu

Shifang Xu

NVIDIA

Heterogeneous Parallelism for Multimodal Large Language Model Training

Add code
May 26, 2026
Viaarxiv icon

Scalable Training of Mixture-of-Experts Models with Megatron Core

Add code
Mar 10, 2026
Viaarxiv icon