Alert button
Picture for Jongsoo Park

Jongsoo Park

Alert button

Amy

High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models

Add code
Bookmark button
Alert button
Apr 13, 2021
Dheevatsa Mudigere, Yuchen Hao, Jianyu Huang, Andrew Tulloch, Srinivas Sridharan, Xing Liu, Mustafa Ozdal, Jade Nie, Jongsoo Park, Liang Luo, Jie Amy Yang, Leon Gao, Dmytro Ivchenko, Aarti Basant, Yuxi Hu, Jiyan Yang, Ehsan K. Ardestani, Xiaodong Wang, Rakesh Komuravelli, Ching-Hsiang Chu, Serhat Yilmaz, Huayu Li, Jiyuan Qian, Zhuobo Feng, Yinbin Ma, Junjie Yang, Ellie Wen, Hong Li, Lin Yang, Chonglin Sun, Whitney Zhao, Krishna Dhulipala, KR Kishore, Tyler Graf, Assaf Eisenman, Kiran Kumar Matam, Adi Gangidi, Pallab Bhattacharya, Guoqiang Jerry Chen, Manoj Krishnan, Krishnakumar Nair, Petr Lapukhov, Maxim Naumov, Lin Qiao, Mikhail Smelyanskiy, Bill Jia, Vijay Rao

Figure 1 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 2 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 3 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Figure 4 for High-performance, Distributed Training of Large-scale Deep Learning Recommendation Models
Viaarxiv icon

FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference

Add code
Bookmark button
Alert button
Jan 13, 2021
Daya Khudia, Jianyu Huang, Protonu Basu, Summer Deng, Haixin Liu, Jongsoo Park, Mikhail Smelyanskiy

Figure 1 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Figure 2 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Figure 3 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Figure 4 for FBGEMM: Enabling High-Performance Low-Precision Deep Learning Inference
Viaarxiv icon

Mixed-Precision Embedding Using a Cache

Add code
Bookmark button
Alert button
Oct 23, 2020
Jie Amy Yang, Jianyu Huang, Jongsoo Park, Ping Tak Peter Tang, Andrew Tulloch

Figure 1 for Mixed-Precision Embedding Using a Cache
Figure 2 for Mixed-Precision Embedding Using a Cache
Figure 3 for Mixed-Precision Embedding Using a Cache
Figure 4 for Mixed-Precision Embedding Using a Cache
Viaarxiv icon

Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data

Add code
Bookmark button
Alert button
Oct 21, 2020
Mao Ye, Dhruv Choudhary, Jiecao Yu, Ellie Wen, Zeliang Chen, Jiyan Yang, Jongsoo Park, Qiang Liu, Arun Kejariwal

Figure 1 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 2 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 3 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Figure 4 for Adaptive Dense-to-Sparse Paradigm for Pruning Online Recommendation System with Non-Stationary Data
Viaarxiv icon

Post-Training 4-bit Quantization on Embedding Tables

Add code
Bookmark button
Alert button
Nov 05, 2019
Hui Guan, Andrey Malevich, Jiyan Yang, Jongsoo Park, Hector Yuen

Figure 1 for Post-Training 4-bit Quantization on Embedding Tables
Figure 2 for Post-Training 4-bit Quantization on Embedding Tables
Figure 3 for Post-Training 4-bit Quantization on Embedding Tables
Figure 4 for Post-Training 4-bit Quantization on Embedding Tables
Viaarxiv icon

A Study of BFLOAT16 for Deep Learning Training

Add code
Bookmark button
Alert button
Jun 13, 2019
Dhiraj Kalamkar, Dheevatsa Mudigere, Naveen Mellempudi, Dipankar Das, Kunal Banerjee, Sasikanth Avancha, Dharma Teja Vooturi, Nataraj Jammalamadaka, Jianyu Huang, Hector Yuen, Jiyan Yang, Jongsoo Park, Alexander Heinecke, Evangelos Georganas, Sudarshan Srinivasan, Abhisek Kundu, Misha Smelyanskiy, Bharat Kaul, Pradeep Dubey

Figure 1 for A Study of BFLOAT16 for Deep Learning Training
Figure 2 for A Study of BFLOAT16 for Deep Learning Training
Figure 3 for A Study of BFLOAT16 for Deep Learning Training
Figure 4 for A Study of BFLOAT16 for Deep Learning Training
Viaarxiv icon