Picture for Zelei Shao

Zelei Shao

Beat the long tail: Distribution-Aware Speculative Decoding for RL Training

Add code
Nov 17, 2025
Viaarxiv icon

Scaling Speculative Decoding with Lookahead Reasoning

Add code
Jun 24, 2025
Viaarxiv icon

MiLo: Efficient Quantized MoE Inference with Mixture of Low-Rank Compensators

Add code
Apr 03, 2025
Viaarxiv icon