Alert button
Picture for Dhruv Choudhary

Dhruv Choudhary

Alert button

Microscaling Data Formats for Deep Learning

Add code
Bookmark button
Alert button
Oct 19, 2023
Bita Darvish Rouhani, Ritchie Zhao, Ankit More, Mathew Hall, Alireza Khodamoradi, Summer Deng, Dhruv Choudhary, Marius Cornea, Eric Dellinger, Kristof Denolf, Stosic Dusan, Venmugil Elango, Maximilian Golub, Alexander Heinecke, Phil James-Roxby, Dharmesh Jani, Gaurav Kolhe, Martin Langhammer, Ada Li, Levi Melnick, Maral Mesmakhosroshahi, Andres Rodriguez, Michael Schulte, Rasoul Shafipour, Lei Shao, Michael Siu, Pradeep Dubey, Paulius Micikevicius, Maxim Naumov, Colin Verrilli, Ralph Wittig, Doug Burger, Eric Chung

Figure 1 for Microscaling Data Formats for Deep Learning
Figure 2 for Microscaling Data Formats for Deep Learning
Figure 3 for Microscaling Data Formats for Deep Learning
Figure 4 for Microscaling Data Formats for Deep Learning
Viaarxiv icon

FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models

Add code
Bookmark button
Alert button
Jan 08, 2023
Geet Sethi, Pallab Bhattacharya, Dhruv Choudhary, Carole-Jean Wu, Christos Kozyrakis

Figure 1 for FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Figure 2 for FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Figure 3 for FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Figure 4 for FlexShard: Flexible Sharding for Industry-Scale Sequence Recommendation Models
Viaarxiv icon

RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure

Add code
Bookmark button
Alert button
Nov 14, 2022
Mark Zhao, Dhruv Choudhary, Devashish Tyagi, Ajay Somani, Max Kaplan, Sung-Han Lin, Sarunya Pumma, Jongsoo Park, Aarti Basant, Niket Agarwal, Carole-Jean Wu, Christos Kozyrakis

Figure 1 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Figure 2 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Figure 3 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Figure 4 for RecD: Deduplication for End-to-End Deep Learning Recommendation Model Training Infrastructure
Viaarxiv icon

Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems

Add code
Bookmark button
Alert button
Sep 02, 2022
Mao Ye, Ruichen Jiang, Haoxiang Wang, Dhruv Choudhary, Xiaocong Du, Bhargav Bhushanam, Aryan Mokhtari, Arun Kejariwal, Qiang Liu

Figure 1 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 2 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 3 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Figure 4 for Future Gradient Descent for Adapting the Temporal Shifting Data Distribution in Online Recommendation Systems
Viaarxiv icon

AutoShard: Automated Embedding Table Sharding for Recommender Systems

Add code
Bookmark button
Alert button
Aug 12, 2022
Daochen Zha, Louis Feng, Bhargav Bhushanam, Dhruv Choudhary, Jade Nie, Yuandong Tian, Jay Chae, Yinbin Ma, Arun Kejariwal, Xia Hu

Figure 1 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 2 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 3 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Figure 4 for AutoShard: Automated Embedding Table Sharding for Recommender Systems
Viaarxiv icon

Positive Unlabeled Contrastive Learning

Add code
Bookmark button
Alert button
Jun 01, 2022
Anish Acharya, Sujay Sanghavi, Li Jing, Bhargav Bhushanam, Dhruv Choudhary, Michael Rabbat, Inderjit Dhillon

Figure 1 for Positive Unlabeled Contrastive Learning
Figure 2 for Positive Unlabeled Contrastive Learning
Figure 3 for Positive Unlabeled Contrastive Learning
Figure 4 for Positive Unlabeled Contrastive Learning
Viaarxiv icon

Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits

Add code
Bookmark button
Alert button
Oct 24, 2021
Yan Li, Dhruv Choudhary, Xiaohan Wei, Baichuan Yuan, Bhargav Bhushanam, Tuo Zhao, Guanghui Lan

Figure 1 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Figure 2 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Figure 3 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Figure 4 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Viaarxiv icon

Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale

Add code
Bookmark button
Alert button
May 26, 2021
Zhaoxia, Deng, Jongsoo Park, Ping Tak Peter Tang, Haixin Liu, Jie, Yang, Hector Yuen, Jianyu Huang, Daya Khudia, Xiaohan Wei, Ellie Wen, Dhruv Choudhary, Raghuraman Krishnamoorthi, Carole-Jean Wu, Satish Nadathur, Changkyu Kim, Maxim Naumov, Sam Naghshineh, Mikhail Smelyanskiy

Figure 1 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 2 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 3 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Figure 4 for Low-Precision Hardware Architectures Meet Recommendation Model Inference at Scale
Viaarxiv icon