Alert button
Picture for Ying Sheng

Ying Sheng

Alert button

Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference

Add code
Bookmark button
Alert button
Mar 07, 2024
Wei-Lin Chiang, Lianmin Zheng, Ying Sheng, Anastasios Nikolas Angelopoulos, Tianle Li, Dacheng Li, Hao Zhang, Banghua Zhu, Michael Jordan, Joseph E. Gonzalez, Ion Stoica

Figure 1 for Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Figure 2 for Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Figure 3 for Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Figure 4 for Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Viaarxiv icon

Fairness in Serving Large Language Models

Add code
Bookmark button
Alert button
Dec 31, 2023
Ying Sheng, Shiyi Cao, Dacheng Li, Banghua Zhu, Zhuohan Li, Danyang Zhuo, Joseph E. Gonzalez, Ion Stoica

Viaarxiv icon

Efficiently Programming Large Language Models using SGLang

Add code
Bookmark button
Alert button
Dec 12, 2023
Lianmin Zheng, Liangsheng Yin, Zhiqiang Xie, Jeff Huang, Chuyue Sun, Cody Hao Yu, Shiyi Cao, Christos Kozyrakis, Ion Stoica, Joseph E. Gonzalez, Clark Barrett, Ying Sheng

Viaarxiv icon

S-LoRA: Serving Thousands of Concurrent LoRA Adapters

Add code
Bookmark button
Alert button
Nov 07, 2023
Ying Sheng, Shiyi Cao, Dacheng Li, Coleman Hooper, Nicholas Lee, Shuo Yang, Christopher Chou, Banghua Zhu, Lianmin Zheng, Kurt Keutzer, Joseph E. Gonzalez, Ion Stoica

Viaarxiv icon

Clover: Closed-Loop Verifiable Code Generation

Add code
Bookmark button
Alert button
Oct 26, 2023
Chuyue Sun, Ying Sheng, Oded Padon, Clark Barrett

Figure 1 for Clover: Closed-Loop Verifiable Code Generation
Figure 2 for Clover: Closed-Loop Verifiable Code Generation
Figure 3 for Clover: Closed-Loop Verifiable Code Generation
Figure 4 for Clover: Closed-Loop Verifiable Code Generation
Viaarxiv icon

LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset

Add code
Bookmark button
Alert button
Sep 30, 2023
Lianmin Zheng, Wei-Lin Chiang, Ying Sheng, Tianle Li, Siyuan Zhuang, Zhanghao Wu, Yonghao Zhuang, Zhuohan Li, Zi Lin, Eric. P Xing, Joseph E. Gonzalez, Ion Stoica, Hao Zhang

Figure 1 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 2 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 3 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Figure 4 for LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset
Viaarxiv icon

Efficient Memory Management for Large Language Model Serving with PagedAttention

Add code
Bookmark button
Alert button
Sep 12, 2023
Woosuk Kwon, Zhuohan Li, Siyuan Zhuang, Ying Sheng, Lianmin Zheng, Cody Hao Yu, Joseph E. Gonzalez, Hao Zhang, Ion Stoica

Figure 1 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Figure 2 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Figure 3 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Figure 4 for Efficient Memory Management for Large Language Model Serving with PagedAttention
Viaarxiv icon

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Add code
Bookmark button
Alert button
Jul 19, 2023
Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Ré, Clark Barrett, Zhangyang Wang, Beidi Chen

Figure 1 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 2 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 3 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 4 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Viaarxiv icon