Picture for Quantong Qiu

Quantong Qiu

Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers

Add code
Jan 24, 2026
Viaarxiv icon

LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling

Add code
Oct 08, 2025
Viaarxiv icon

Accurate KV Cache Quantization with Outlier Tokens Tracing

Add code
May 16, 2025
Figure 1 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Figure 2 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Figure 3 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Figure 4 for Accurate KV Cache Quantization with Outlier Tokens Tracing
Viaarxiv icon