Picture for Shangchun Zhao

Shangchun Zhao

A Unified Sequence Parallelism Approach for Long Context Generative AI

Add code
May 15, 2024
Figure 1 for A Unified Sequence Parallelism Approach for Long Context Generative AI
Figure 2 for A Unified Sequence Parallelism Approach for Long Context Generative AI
Figure 3 for A Unified Sequence Parallelism Approach for Long Context Generative AI
Figure 4 for A Unified Sequence Parallelism Approach for Long Context Generative AI
Viaarxiv icon

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems

Add code
Jan 09, 2024
Figure 1 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 2 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 3 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 4 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Viaarxiv icon

An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training

Add code
Dec 19, 2023
Viaarxiv icon