Picture for Xinfeng Xia

Xinfeng Xia

MoE-SpeQ: Speculative Quantized Decoding with Proactive Expert Prefetching and Offloading for Mixture-of-Experts

Add code
Nov 18, 2025
Viaarxiv icon