Picture for Yesheng Liang

Yesheng Liang

DFlash: Block Diffusion for Flash Speculative Decoding

Add code
Feb 05, 2026
Viaarxiv icon

ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference

Add code
Nov 13, 2025
Viaarxiv icon