Picture for Fanzhuang Meng

Fanzhuang Meng

AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster

Add code
Apr 15, 2024
Figure 1 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 2 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 3 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Figure 4 for AntBatchInfer: Elastic Batch Inference in the Kubernetes Cluster
Viaarxiv icon

Rethinking Memory and Communication Cost for Efficient Large Language Model Training

Add code
Oct 09, 2023
Figure 1 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 2 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 3 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Figure 4 for Rethinking Memory and Communication Cost for Efficient Large Language Model Training
Viaarxiv icon