Picture for Narasimha Reddy

Narasimha Reddy

Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity

Add code
May 20, 2025
Viaarxiv icon

ESPN: Memory-Efficient Multi-Vector Information Retrieval

Add code
Dec 09, 2023
Viaarxiv icon