Picture for Bozhao Gong

Bozhao Gong

SLO-Aware Compute Resource Allocation for Prefill-Decode Disaggregated LLM Inference

Add code
Mar 05, 2026
Viaarxiv icon