Alert button

Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Mar 29, 2024
Luchang Li, Sheng Qian, Jie Lu, Lunxi Yuan, Rui Wang, Qin Xie

Figure 1 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Figure 2 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Figure 3 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs
Figure 4 for Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: