Picture for Yitao Hu

Yitao Hu

ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs

Add code
May 20, 2025
Viaarxiv icon