Picture for Afsara Benazir

Afsara Benazir

University of Virginia

Efficient Mixture-of-Experts LLM Inference with Apple Silicon NPUs

Add code
Apr 20, 2026
Viaarxiv icon

Leveraging cache to enable SLU on tiny devices

Add code
Nov 30, 2023
Figure 1 for Leveraging cache to enable SLU on tiny devices
Figure 2 for Leveraging cache to enable SLU on tiny devices
Figure 3 for Leveraging cache to enable SLU on tiny devices
Figure 4 for Leveraging cache to enable SLU on tiny devices
Viaarxiv icon