Picture for Xiangchen Li

Xiangchen Li

WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching

Add code
Jan 15, 2026
Viaarxiv icon

SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving

Add code
Jun 11, 2025
Viaarxiv icon