Alert button
Picture for Yifan Xiong

Yifan Xiong

Alert button

FP8-LM: Training FP8 Large Language Models

Add code
Bookmark button
Alert button
Oct 27, 2023
Houwen Peng, Kan Wu, Yixuan Wei, Guoshuai Zhao, Yuxiang Yang, Ze Liu, Yifan Xiong, Ziyue Yang, Bolin Ni, Jingcheng Hu, Ruihang Li, Miaosen Zhang, Chen Li, Jia Ning, Ruizhe Wang, Zheng Zhang, Shuguang Liu, Joe Chau, Han Hu, Peng Cheng

Figure 1 for FP8-LM: Training FP8 Large Language Models
Figure 2 for FP8-LM: Training FP8 Large Language Models
Figure 3 for FP8-LM: Training FP8 Large Language Models
Figure 4 for FP8-LM: Training FP8 Large Language Models
Viaarxiv icon

Tutel: Adaptive Mixture-of-Experts at Scale

Add code
Bookmark button
Alert button
Jun 07, 2022
Changho Hwang, Wei Cui, Yifan Xiong, Ziyue Yang, Ze Liu, Han Hu, Zilong Wang, Rafael Salas, Jithin Jose, Prabhat Ram, Joe Chau, Peng Cheng, Fan Yang, Mao Yang, Yongqiang Xiong

Figure 1 for Tutel: Adaptive Mixture-of-Experts at Scale
Figure 2 for Tutel: Adaptive Mixture-of-Experts at Scale
Figure 3 for Tutel: Adaptive Mixture-of-Experts at Scale
Figure 4 for Tutel: Adaptive Mixture-of-Experts at Scale
Viaarxiv icon