Picture for Youshao Xiao

Youshao Xiao

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

Add code
Apr 15, 2024
Figure 1 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Figure 2 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Figure 3 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Figure 4 for AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes
Viaarxiv icon

AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster

Add code
Apr 15, 2024
Figure 1 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 2 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 3 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 4 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Viaarxiv icon

G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems

Add code
Jan 09, 2024
Figure 1 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 2 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 3 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Figure 4 for G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems
Viaarxiv icon

An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training

Add code
Dec 19, 2023
Figure 1 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 2 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 3 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Figure 4 for An Adaptive Placement and Parallelism Framework for Accelerating RLHF Training
Viaarxiv icon

Rethinking Memory and Communication Cost for Efficient Large Language Model Training

Add code
Oct 09, 2023
Figure 1 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 2 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 3 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 4 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Viaarxiv icon

An Effective and Efficient Time-aware Entity Alignment Framework via Two-aspect Three-view Label Propagation

Add code
Jul 12, 2023
Figure 1 for An Effective and Efficient Time-aware Entity Alignment Framework via Two-aspect Three-view Label Propagation
Figure 2 for An Effective and Efficient Time-aware Entity Alignment Framework via Two-aspect Three-view Label Propagation
Figure 3 for An Effective and Efficient Time-aware Entity Alignment Framework via Two-aspect Three-view Label Propagation
Figure 4 for An Effective and Efficient Time-aware Entity Alignment Framework via Two-aspect Three-view Label Propagation
Viaarxiv icon