Alert button
Picture for Longteng Zhang

Longteng Zhang

Alert button

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

Add code
Bookmark button
Alert button
Nov 07, 2023
Longteng Zhang, Xiang Liu, Zeyu Li, Xinglin Pan, Peijie Dong, Ruibo Fan, Rui Guo, Xin Wang, Qiong Luo, Shaohuai Shi, Xiaowen Chu

Viaarxiv icon

FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs

Add code
Bookmark button
Alert button
Sep 03, 2023
Zhenheng Tang, Yuxin Wang, Xin He, Longteng Zhang, Xinglin Pan, Qiang Wang, Rongfei Zeng, Kaiyong Zhao, Shaohuai Shi, Bingsheng He, Xiaowen Chu

Figure 1 for FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Figure 2 for FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Figure 3 for FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Figure 4 for FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs
Viaarxiv icon

LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning

Add code
Bookmark button
Alert button
Aug 07, 2023
Longteng Zhang, Lin Zhang, Shaohuai Shi, Xiaowen Chu, Bo Li

Figure 1 for LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Figure 2 for LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Figure 3 for LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Figure 4 for LoRA-FA: Memory-efficient Low-rank Adaptation for Large Language Models Fine-tuning
Viaarxiv icon

Evaluation and Optimization of Gradient Compression for Distributed Deep Learning

Add code
Bookmark button
Alert button
Jun 15, 2023
Lin Zhang, Longteng Zhang, Shaohuai Shi, Xiaowen Chu, Bo Li

Figure 1 for Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
Figure 2 for Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
Figure 3 for Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
Figure 4 for Evaluation and Optimization of Gradient Compression for Distributed Deep Learning
Viaarxiv icon