Picture for Arun Kejariwal

Arun Kejariwal

Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models

Add code
May 03, 2023
Figure 1 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Figure 2 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Figure 3 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Figure 4 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Viaarxiv icon

DreamShard: Generalizable Embedding Table Placement for Recommender Systems

Add code
Oct 05, 2022
Figure 1 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Figure 2 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Figure 3 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Figure 4 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Viaarxiv icon

Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems

Add code
Sep 02, 2022
Figure 1 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 2 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 3 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 4 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Viaarxiv icon

AutoShard: Automated Embedding Table Sharding for Recommender Systems

Add code
Aug 12, 2022
Figure 1 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 2 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 3 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 4 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Viaarxiv icon

Building a Performance Model for Deep Learning Recommendation Model Training on GPUs

Add code
Jan 19, 2022
Figure 1 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 2 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 3 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 4 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Viaarxiv icon

Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems

Add code
May 04, 2021
Figure 1 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 2 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 3 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 4 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Viaarxiv icon

Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data

Add code
Oct 21, 2020
Figure 1 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 2 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 3 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 4 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Viaarxiv icon

Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism

Add code
Oct 18, 2020
Figure 1 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 2 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 3 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 4 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Viaarxiv icon

On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data

Add code
Oct 12, 2017
Figure 1 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Figure 2 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Figure 3 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Figure 4 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Viaarxiv icon

Real Time Analytics: Algorithms and Systems

Add code
Aug 07, 2017
Figure 1 for Real Time Analytics: Algorithms and Systems
Viaarxiv icon