Picture for Haiyuan Wan

Haiyuan Wan

Spotlight Attention: Towards Efficient LLM Generation via Non-linear Hashing-based KV Cache Retrieval

Add code
Aug 27, 2025
Viaarxiv icon