Alert button
Picture for Shiyun Wei

Shiyun Wei

Alert button

LLMCad: Fast and Scalable On-device Large Language Model Inference

Add code
Bookmark button
Alert button
Sep 08, 2023
Daliang Xu, Wangsong Yin, Xin Jin, Ying Zhang, Shiyun Wei, Mengwei Xu, Xuanzhe Liu

Figure 1 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Figure 2 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Figure 3 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Figure 4 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Viaarxiv icon

EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models

Add code
Bookmark button
Alert button
Aug 28, 2023
Rongjie Yi, Liwei Guo, Shiyun Wei, Ao Zhou, Shangguang Wang, Mengwei Xu

Viaarxiv icon