Alert button
Picture for Daliang Xu

Daliang Xu

Alert button

A Survey of Resource-efficient LLM and Multimodal Foundation Models

Add code
Bookmark button
Alert button
Jan 16, 2024
Mengwei Xu, Wangsong Yin, Dongqi Cai, Rongjie Yi, Daliang Xu, Qipeng Wang, Bingyang Wu, Yihao Zhao, Chen Yang, Shihe Wang, Qiyang Zhang, Zhenyan Lu, Li Zhang, Shangguang Wang, Yuanchun Li, Yunxin Liu, Xin Jin, Xuanzhe Liu

Viaarxiv icon

LLMCad: Fast and Scalable On-device Large Language Model Inference

Add code
Bookmark button
Alert button
Sep 08, 2023
Daliang Xu, Wangsong Yin, Xin Jin, Ying Zhang, Shiyun Wei, Mengwei Xu, Xuanzhe Liu

Figure 1 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Figure 2 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Figure 3 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Figure 4 for LLMCad: Fast and Scalable On-device Large Language Model Inference
Viaarxiv icon