Alert button
Picture for Arun Kejariwal

Arun Kejariwal

Alert button

Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models

May 03, 2023
Daochen Zha, Louis Feng, Liang Luo, Bhargav Bhushanam, Zirui Liu, Yusuo Hu, Jade Nie, Yuzhen Huang, Yuandong Tian, Arun Kejariwal, Xia Hu

Figure 1 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Figure 2 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Figure 3 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Figure 4 for Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models
Viaarxiv icon

DreamShard: Generalizable Embedding Table Placement for Recommender Systems

Oct 05, 2022
Daochen Zha, Louis Feng, Qiaoyu Tan, Zirui Liu, Kwei-Herng Lai, Bhargav Bhushanam, Yuandong Tian, Arun Kejariwal, Xia Hu

Figure 1 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Figure 2 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Figure 3 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Figure 4 for DreamShard: Generalizable Embedding Table Placement for Recommender Systems
Viaarxiv icon

Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems

Sep 02, 2022
Mao Ye, Ruichen Jiang, Haoxiang Wang, Dhruv Choudhary, Xiaocong Du, Bhargav Bhushanam, Aryan Mokhtari, Arun Kejariwal, Qiang Liu

Figure 1 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 2 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 3 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 4 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Viaarxiv icon

AutoShard: Automated Embedding Table Sharding for Recommender Systems

Aug 12, 2022
Daochen Zha, Louis Feng, Bhargav Bhushanam, Dhruv Choudhary, Jade Nie, Yuandong Tian, Jay Chae, Yinbin Ma, Arun Kejariwal, Xia Hu

Figure 1 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 2 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 3 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 4 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Viaarxiv icon

Building a Performance Model for Deep Learning Recommendation Model Training on GPUs

Jan 19, 2022
Zhongyi Lin, Louis Feng, Ehsan K. Ardestani, Jaewon Lee, John Lundell, Changkyu Kim, Arun Kejariwal, John D. Owens

Figure 1 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 2 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 3 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Figure 4 for Building a Performance Model for Deep Learning Recommendation Model Training on GPUs
Viaarxiv icon

Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems

May 04, 2021
Xiaocong Du, Bhargav Bhushanam, Jiecao Yu, Dhruv Choudhary, Tianxiang Gao, Sherman Wong, Louis Feng, Jongsoo Park, Yu Cao, Arun Kejariwal

Figure 1 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 2 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 3 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Figure 4 for Alternate Model Growth and Pruning for Efficient Training of Recommendation Systems
Viaarxiv icon

Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data

Oct 21, 2020
Mao Ye, Dhruv Choudhary, Jiecao Yu, Ellie Wen, Zeliang Chen, Jiyan Yang, Jongsoo Park, Qiang Liu, Arun Kejariwal

Figure 1 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 2 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 3 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 4 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Viaarxiv icon

Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism

Oct 18, 2020
Vipul Gupta, Dhruv Choudhary, Ping Tak Peter Tang, Xiaohan Wei, Xing Wang, Yuzhen Huang, Arun Kejariwal, Kannan Ramchandran, Michael W. Mahoney

Figure 1 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 2 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 3 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Figure 4 for Fast Distributed Training of Deep Neural Networks: Dynamic Communication Thresholding for Model and Data Parallelism
Viaarxiv icon

On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data

Oct 12, 2017
Dhruv Choudhary, Arun Kejariwal, Francois Orsini

Figure 1 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Figure 2 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Figure 3 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Figure 4 for On the Runtime-Efficacy Trade-off of Anomaly Detection Techniques for Real-Time Streaming Data
Viaarxiv icon