Picture for Quantong Qiu

Quantong Qiu

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Add code
Jan 24, 2026
Viaarxiv icon

LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling

Add code
Oct 08, 2025
Viaarxiv icon

Accurate KV Cache Quantization with Outlier Tokens Tracing

Add code
May 16, 2025
Viaarxiv icon