Picture for Xinrui Zhong

Xinrui Zhong

Vortex: Efficient and Programmable Sparse Attention Serving for AI Agents

Add code
Jun 04, 2026
Viaarxiv icon

$R^2$-dLLM: Accelerating Diffusion Large Language Models via Spatio-Temporal Redundancy Reduction

Add code
Apr 21, 2026
Viaarxiv icon