Alert button
Picture for Haibin Lin

Haibin Lin

Alert button

LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization

Add code
Bookmark button
Alert button
Mar 02, 2024
Juntao Zhao, Borui Wan, Yanghua Peng, Haibin Lin, Chuan Wu

Figure 1 for LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
Figure 2 for LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
Figure 3 for LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
Figure 4 for LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization
Viaarxiv icon

MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs

Add code
Bookmark button
Alert button
Feb 23, 2024
Ziheng Jiang, Haibin Lin, Yinmin Zhong, Qi Huang, Yangrui Chen, Zhi Zhang, Yanghua Peng, Xiang Li, Cong Xie, Shibiao Nong, Yulu Jia, Sun He, Hongmin Chen, Zhihao Bai, Qi Hou, Shipeng Yan, Ding Zhou, Yiyao Sheng, Zhuo Jiang, Haohan Xu, Haoran Wei, Zhang Zhang, Pengfei Nie, Leqi Zou, Sida Zhao, Liang Xiang, Zherui Liu, Zhe Li, Xiaoying Jia, Jianxi Ye, Xin Jin, Xin Liu

Viaarxiv icon

CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs

Add code
Bookmark button
Alert button
Nov 17, 2023
Hanpeng Hu, Junwei Su, Juntao Zhao, Yanghua Peng, Yibo Zhu, Haibin Lin, Chuan Wu

Figure 1 for CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs
Figure 2 for CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs
Figure 3 for CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs
Figure 4 for CDMPP: A Device-Model Agnostic Framework for Latency Prediction of Tensor Programs
Viaarxiv icon

LEMON: Lossless model expansion

Add code
Bookmark button
Alert button
Oct 12, 2023
Yite Wang, Jiahao Su, Hanlin Lu, Cong Xie, Tianyi Liu, Jianbo Yuan, Haibin Lin, Ruoyu Sun, Hongxia Yang

Figure 1 for LEMON: Lossless model expansion
Figure 2 for LEMON: Lossless model expansion
Figure 3 for LEMON: Lossless model expansion
Figure 4 for LEMON: Lossless model expansion
Viaarxiv icon

ByteComp: Revisiting Gradient Compression in Distributed Training

Add code
Bookmark button
Alert button
Jun 06, 2022
Zhuang Wang, Haibin Lin, Yibo Zhu, T. S. Eugene Ng

Figure 1 for ByteComp: Revisiting Gradient Compression in Distributed Training
Figure 2 for ByteComp: Revisiting Gradient Compression in Distributed Training
Figure 3 for ByteComp: Revisiting Gradient Compression in Distributed Training
Figure 4 for ByteComp: Revisiting Gradient Compression in Distributed Training
Viaarxiv icon

Espresso: Revisiting Gradient Compression from the System Perspective

Add code
Bookmark button
Alert button
May 28, 2022
Zhuang Wang, Haibin Lin, Yibo Zhu, T. S. Eugene Ng

Figure 1 for Espresso: Revisiting Gradient Compression from the System Perspective
Figure 2 for Espresso: Revisiting Gradient Compression from the System Perspective
Figure 3 for Espresso: Revisiting Gradient Compression from the System Perspective
Figure 4 for Espresso: Revisiting Gradient Compression from the System Perspective
Viaarxiv icon

dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training

Add code
Bookmark button
Alert button
May 18, 2022
Hanpeng Hu, Chenyu Jiang, Yuchen Zhong, Yanghua Peng, Chuan Wu, Yibo Zhu, Haibin Lin, Chuanxiong Guo

Figure 1 for dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training
Figure 2 for dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training
Figure 3 for dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training
Figure 4 for dPRO: A Generic Profiling and Optimization System for Expediting Distributed DNN Training
Viaarxiv icon

Compressed Communication for Distributed Training: Adaptive Methods and System

Add code
Bookmark button
Alert button
May 17, 2021
Yuchen Zhong, Cong Xie, Shuai Zheng, Haibin Lin

Figure 1 for Compressed Communication for Distributed Training: Adaptive Methods and System
Figure 2 for Compressed Communication for Distributed Training: Adaptive Methods and System
Figure 3 for Compressed Communication for Distributed Training: Adaptive Methods and System
Figure 4 for Compressed Communication for Distributed Training: Adaptive Methods and System
Viaarxiv icon

CSER: Communication-efficient SGD with Error Reset

Add code
Bookmark button
Alert button
Jul 29, 2020
Cong Xie, Shuai Zheng, Oluwasanmi Koyejo, Indranil Gupta, Mu Li, Haibin Lin

Figure 1 for CSER: Communication-efficient SGD with Error Reset
Figure 2 for CSER: Communication-efficient SGD with Error Reset
Figure 3 for CSER: Communication-efficient SGD with Error Reset
Figure 4 for CSER: Communication-efficient SGD with Error Reset
Viaarxiv icon