Picture for Michael Peng

Michael Peng

An Interpretable Latency Model for Speculative Decoding in LLM Serving

Add code
May 14, 2026
Viaarxiv icon