Picture for Joan Oliveras

Joan Oliveras

Maximizing GPU Efficiency via Optimal Adapter Caching: An Analytical Approach for Multi-Tenant LLM Serving

Add code
Aug 11, 2025
Viaarxiv icon