Picture for Shangchun Zhao

Shangchun Zhao

A Unified Sequence Parallelism Approach for Long Context Generative AI

Add code
May 15, 2024
Viaarxiv icon

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems

Add code
Jan 09, 2024
Figure 1 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 2 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 3 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 4 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Viaarxiv icon

An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training

Add code
Dec 19, 2023
Figure 1 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 2 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 3 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 4 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Viaarxiv icon