Alert button

ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models

Jan 25, 2024
Yao Fu, Leyang Xue, Yeqi Huang, Andrei-Octavian Brabete, Dmitrii Ustiugov, Yuvraj Patel, Luo Mai

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: