Picture for Steven Li

Steven Li

Quantized Reasoning Models Think They Need to Think Longer, but They Do Not

Add code
May 29, 2026
Viaarxiv icon

JacQuant: STE-Free Quantization-Aware Training via Learned Jacobian Surrogates

Add code
May 25, 2026
Viaarxiv icon

Rethinking Model Efficiency: Multi-Agent Inference with Large Models

Add code
Apr 06, 2026
Viaarxiv icon

CLAA: Cross-Layer Attention Aggregation for Accelerating LLM Prefill

Add code
Feb 17, 2026
Viaarxiv icon

MoE-Spec: Expert Budgeting for Efficient Speculative Decoding

Add code
Feb 17, 2026
Viaarxiv icon

Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction

Add code
Dec 16, 2025
Figure 1 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Figure 2 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Figure 3 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Figure 4 for Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
Viaarxiv icon

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference

Add code
Apr 28, 2025
Viaarxiv icon