Alert button
Picture for Fangyu Wang

Fangyu Wang

Alert button

FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization

Add code
Bookmark button
Alert button
Feb 28, 2024
Yi Zhang, Fei Yang, Shuang Peng, Fangyu Wang, Aimin Pan

Viaarxiv icon

Holmes: Towards Distributed Training Across Clusters with Heterogeneous NIC Environment

Add code
Bookmark button
Alert button
Dec 11, 2023
Fei Yang, Shuang Peng, Ning Sun, Fangyu Wang, Ke Tan, Fu Wu, Jiezhong Qiu, Aimin Pan

Viaarxiv icon