Picture for Jiawen Qi

Jiawen Qi

When NPUs Are Not Always Faster: A Stage-Level Analysis of Mobile LLM Inference

Add code
May 22, 2026
Viaarxiv icon