Picture for Susav Shrestha

Susav Shrestha

Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity

Add code
May 20, 2025
Viaarxiv icon

ESPN: Memory-Efficient Multi-Vector Information Retrieval

Add code
Dec 09, 2023
Figure 1 for ESPN: Memory-Efficient Multi-Vector Information Retrieval
Figure 2 for ESPN: Memory-Efficient Multi-Vector Information Retrieval
Figure 3 for ESPN: Memory-Efficient Multi-Vector Information Retrieval
Figure 4 for ESPN: Memory-Efficient Multi-Vector Information Retrieval
Viaarxiv icon