Picture for Zelei Shao

Zelei Shao

Scaling Speculative Decoding with Lookahead Reasoning

Add code
Jun 24, 2025
Viaarxiv icon

MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators

Add code
Apr 03, 2025
Viaarxiv icon