Picture for Pingtian Li

Pingtian Li

NVIDIA

Scalable Training of Mixture-of-Experts Models with Megatron Core

Add code
Mar 10, 2026
Viaarxiv icon