Picture for Xiaojian Liao

Xiaojian Liao

CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs

Add code
May 10, 2025
Viaarxiv icon

CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory

Add code
Mar 04, 2025
Viaarxiv icon