Alert button
Picture for Siyuan Zhuang

Siyuan Zhuang

Alert button

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Add code
Bookmark button
Alert button
Sep 30, 2023
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Tianle Li, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zhuohan Li, Zi Lin, Eric. P Xing, Joseph E. Gonzalez, Ion Stoica, Hao Zhang

Figure 1 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 2 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 3 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 4 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Viaarxiv icon

Efficient Memory Management for Large Language Model Serving with PagedAttention

Add code
Bookmark button
Alert button
Sep 12, 2023
Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, Ion Stoica

Figure 1 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Figure 2 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Figure 3 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Figure 4 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Viaarxiv icon

Judging LLM-as-a-judge with MT-Bench and Chatbot Arena

Add code
Bookmark button
Alert button
Jun 09, 2023
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zi Lin, Zhuohan Li, Dacheng Li, Eric. P Xing, Hao Zhang, Joseph E. Gonzalez, Ion Stoica

Figure 1 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Figure 2 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Figure 3 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Figure 4 for Judging LLM-as-a-judge with MT-Bench and Chatbot Arena
Viaarxiv icon

TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models

Add code
Bookmark button
Alert button
Feb 16, 2021
Zhuohan Li, Siyuan Zhuang, Shiyuan Guo, Danyang Zhuo, Hao Zhang, Dawn Song, Ion Stoica

Figure 1 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Figure 2 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Figure 3 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Figure 4 for TeraPipe: Token-Level Pipeline Parallelism for Training Large-Scale Language Models
Viaarxiv icon

Hoplite: Efficient Collective Communication for Task-Based Distributed Systems

Add code
Bookmark button
Alert button
Feb 13, 2020
Siyuan Zhuang, Zhuohan Li, Danyang Zhuo, Stephanie Wang, Eric Liang, Robert Nishihara, Philipp Moritz, Ion Stoica

Figure 1 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Figure 2 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Figure 3 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Figure 4 for Hoplite: Efficient Collective Communication for Task-Based Distributed Systems
Viaarxiv icon