Alert button
Picture for Assaf Eisenman

Assaf Eisenman

Alert button

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models

Apr 15, 2021
Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng, Yinbin Ma, Junjie Yang, Ellie Wen, Hong Li, Lin Yang, Chonglin Sun, Whitney Zhao, Dimitry Melts, Krishna Dhulipala, KR Kishore, Tyler Graf, Assaf Eisenman, Kiran Kumar Matam, Adi Gangidi, Guoqiang Jerry Chen, Manoj Krishnan, Avinash Nayak, Krishnakumar Nair, Bharath Muthiah, Mahmoud khorashadi, Pallab Bhattacharya, Petr Lapukhov, Maxim Naumov, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, Vijay Rao

Figure 1 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 2 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 3 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 4 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Viaarxiv icon

Check-N-Run: A Checkpointing System for Training Recommendation Models

Oct 17, 2020
Assaf Eisenman, Kiran Kumar Matam, Steven Ingram, Dheevatsa Mudigere, Raghuraman Krishnamoorthi, Murali Annavaram, Krishnakumar Nair, Misha Smelyanskiy

Figure 1 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 2 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 3 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Figure 4 for Check-N-Run: A Checkpointing System for Training Recommendation Models
Viaarxiv icon

Bandana: Using Non-volatile Memory for Storing Deep Learning Models

Nov 15, 2018
Assaf Eisenman, Maxim Naumov, Darryl Gardner, Misha Smelyanskiy, Sergey Pupyrev, Kim Hazelwood, Asaf Cidon, Sachin Katti

Figure 1 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Figure 2 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Figure 3 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Figure 4 for Bandana: Using Non-volatile Memory for Storing Deep Learning Models
Viaarxiv icon