Picture for Chandler Zhou

Chandler Zhou

MoE Parallel Folding: Heterogeneous Parallelism Mappings for Efficient Large-Scale MoE Model Training with Megatron Core

Add code
Apr 21, 2025
Viaarxiv icon

Aligning Language Models with Offline Reinforcement Learning from Human Feedback

Add code
Aug 23, 2023
Viaarxiv icon