Alert button
Picture for Sheng Qian

Sheng Qian

Alert button

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Add code
Bookmark button
Alert button
Mar 29, 2024
Luchang Li, Sheng Qian, Jie Lu, Lunxi Yuan, Rui Wang, Qin Xie

Figure 1 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Figure 2 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Figure 3 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Figure 4 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Viaarxiv icon