Alert button
Picture for Aditya Desai

Aditya Desai

Alert button

Heterogeneous federated collaborative filtering using FAIR: Federated Averaging in Random Subspaces

Add code
Bookmark button
Alert button
Nov 03, 2023
Aditya Desai, Benjamin Meisburger, Zichang Liu, Anshumali Shrivastava

Viaarxiv icon

In defense of parameter sharing for model-compression

Add code
Bookmark button
Alert button
Oct 17, 2023
Aditya Desai, Anshumali Shrivastava

Viaarxiv icon

Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time

Add code
Bookmark button
Alert button
May 26, 2023
Zichang Liu, Aditya Desai, Fangshuo Liao, Weitao Wang, Victor Xie, Zhaozhuo Xu, Anastasios Kyrillidis, Anshumali Shrivastava

Figure 1 for Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time
Figure 2 for Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time
Figure 3 for Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time
Figure 4 for Scissorhands: Exploiting the Persistence of Importance Hypothesis for LLM KV Cache Compression at Test Time
Viaarxiv icon

The trade-offs of model size in large recommendation models : A 10000 $\times$ compressed criteo-tb DLRM model (100 GB parameters to mere 10MB)

Add code
Bookmark button
Alert button
Jul 21, 2022
Aditya Desai, Anshumali Shrivastava

Figure 1 for The trade-offs of model size in large recommendation models : A 10000 $\times$ compressed criteo-tb DLRM model (100 GB parameters to mere 10MB)
Figure 2 for The trade-offs of model size in large recommendation models : A 10000 $\times$ compressed criteo-tb DLRM model (100 GB parameters to mere 10MB)
Figure 3 for The trade-offs of model size in large recommendation models : A 10000 $\times$ compressed criteo-tb DLRM model (100 GB parameters to mere 10MB)
Figure 4 for The trade-offs of model size in large recommendation models : A 10000 $\times$ compressed criteo-tb DLRM model (100 GB parameters to mere 10MB)
Viaarxiv icon

Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing

Add code
Bookmark button
Alert button
Jul 21, 2022
Aditya Desai, Keren Zhou, Anshumali Shrivastava

Figure 1 for Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing
Figure 2 for Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing
Figure 3 for Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing
Figure 4 for Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing
Viaarxiv icon

Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000$\times$ Compression and 2.7$\times$ Faster Inference

Add code
Bookmark button
Alert button
Aug 04, 2021
Aditya Desai, Li Chou, Anshumali Shrivastava

Figure 1 for Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000$\times$ Compression and 2.7$\times$ Faster Inference
Figure 2 for Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000$\times$ Compression and 2.7$\times$ Faster Inference
Figure 3 for Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000$\times$ Compression and 2.7$\times$ Faster Inference
Figure 4 for Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000$\times$ Compression and 2.7$\times$ Faster Inference
Viaarxiv icon

Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion

Add code
Bookmark button
Alert button
Feb 26, 2021
Zhaozhuo Xu, Aditya Desai, Menal Gupta, Anu Chandran, Antoine Vial-Aussavy, Anshumali Shrivastava

Figure 1 for Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion
Figure 2 for Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion
Figure 3 for Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion
Figure 4 for Beyond Convolutions: A Novel Deep Learning Approach for Raw Seismic Data Ingestion
Viaarxiv icon

Semantically Constrained Memory Allocation (SCMA) for Embedding in Efficient Recommendation Systems

Add code
Bookmark button
Alert button
Feb 24, 2021
Aditya Desai, Yanzhou Pan, Kuangyuan Sun, Li Chou, Anshumali Shrivastava

Figure 1 for Semantically Constrained Memory Allocation (SCMA) for Embedding in Efficient Recommendation Systems
Figure 2 for Semantically Constrained Memory Allocation (SCMA) for Embedding in Efficient Recommendation Systems
Figure 3 for Semantically Constrained Memory Allocation (SCMA) for Embedding in Efficient Recommendation Systems
Figure 4 for Semantically Constrained Memory Allocation (SCMA) for Embedding in Efficient Recommendation Systems
Viaarxiv icon

Density Sketches for Sampling and Estimation

Add code
Bookmark button
Alert button
Feb 24, 2021
Aditya Desai, Benjamin Coleman, Anshumali Shrivastava

Figure 1 for Density Sketches for Sampling and Estimation
Figure 2 for Density Sketches for Sampling and Estimation
Figure 3 for Density Sketches for Sampling and Estimation
Figure 4 for Density Sketches for Sampling and Estimation
Viaarxiv icon