Picture for Xiaodong Ji

Xiaodong Ji

SALE : Low-bit Estimation for Efficient Sparse Attention in Long-context LLM Prefilling

Add code
May 30, 2025
Viaarxiv icon