Alert button
Picture for Yonghao Zhuang

Yonghao Zhuang

Alert button

Toward Inference-optimal Mixture-of-Expert Large Language Models

Add code
Bookmark button
Alert button
Apr 03, 2024
Longfei Yun, Yonghao Zhuang, Yao Fu, Eric P Xing, Hao Zhang

Viaarxiv icon

LLM360: Towards Fully Transparent Open-Source LLMs

Add code
Bookmark button
Alert button
Dec 11, 2023
Zhengzhong Liu, Aurick Qiao, Willie Neiswanger, Hongyi Wang, Bowen Tan, Tianhua Tao, Junbo Li, Yuqi Wang, Suqi Sun, Omkar Pangarkar, Richard Fan, Yi Gu, Victor Miller, Yonghao Zhuang, Guowei He, Haonan Li, Fajri Koto, Liping Tang, Nikhil Ranjan, Zhiqiang Shen, Xuguang Ren, Roberto Iriondo, Cun Mu, Zhiting Hu, Mark Schulze, Preslav Nakov, Tim Baldwin, Eric P. Xing

Figure 1 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 2 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 3 for LLM360: Towards Fully Transparent Open-Source LLMs
Figure 4 for LLM360: Towards Fully Transparent Open-Source LLMs
Viaarxiv icon

Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs

Add code
Bookmark button
Alert button
Oct 25, 2023
Bowen Tan, Yun Zhu, Lijuan Liu, Hongyi Wang, Yonghao Zhuang, Jindong Chen, Eric Xing, Zhiting Hu

Figure 1 for Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs
Figure 2 for Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs
Figure 3 for Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs
Figure 4 for Redco: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs
Viaarxiv icon

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Add code
Bookmark button
Alert button
Sep 30, 2023
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Tianle Li, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zhuohan Li, Zi Lin, Eric. P Xing, Joseph E. Gonzalez, Ion Stoica, Hao Zhang

Figure 1 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 2 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 3 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 4 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Viaarxiv icon

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Add code
Bookmark button
Alert button
Jun 09, 2023
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric. P Xing, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

Figure 1 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Figure 2 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Figure 3 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Figure 4 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Viaarxiv icon

On Optimizing the Communication of Model Parallelism

Add code
Bookmark button
Alert button
Nov 10, 2022
Yonghao Zhuang, Hexu Zhao, Lianmin Zheng, Zhuohan Li, Eric P. Xing, Qirong Ho, Joseph E. Gonzalez, Ion Stoica, Hao Zhang

Figure 1 for On Optimizing the Communication of Model Parallelism
Figure 2 for On Optimizing the Communication of Model Parallelism
Figure 3 for On Optimizing the Communication of Model Parallelism
Figure 4 for On Optimizing the Communication of Model Parallelism
Viaarxiv icon

Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning

Add code
Bookmark button
Alert button
Jan 28, 2022
Lianmin Zheng, Zhuohan Li, Hao Zhang, Yonghao Zhuang, Zhifeng Chen, Yanping Huang, Yida Wang, Yuanzhong Xu, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica

Figure 1 for Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
Figure 2 for Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
Figure 3 for Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
Figure 4 for Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning
Viaarxiv icon