Picture for Yifan Sui

Yifan Sui

ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs

Add code
May 20, 2025
Viaarxiv icon