Post-Training 4-bit Quantization on Embedding Tables

Nov 05, 2019
Hui Guan, Andrey Malevich, Jiyan Yang, Jongsoo Park, Hector Yuen

* Accepted in MLSys@NeurIPS'19 (http://learningsys.org/neurips19/) 

  Access Model/Code and Paper
A Study of BFLOAT16 for Deep Learning Training

Jun 13, 2019
Dhiraj Kalamkar, Dheevatsa Mudigere, Naveen Mellempudi, Dipankar Das, Kunal Banerjee, Sasikanth Avancha, Dharma Teja Vooturi, Nataraj Jammalamadaka, Jianyu Huang, Hector Yuen, Jiyan Yang, Jongsoo Park, Alexander Heinecke, Evangelos Georganas, Sudarshan Srinivasan, Abhisek Kundu, Misha Smelyanskiy, Bharat Kaul, Pradeep Dubey


  Access Model/Code and Paper
Deep Learning Recommendation Model for Personalization and Recommendation Systems

May 31, 2019
Maxim Naumov, Dheevatsa Mudigere, Hao-Jun Michael Shi, Jianyu Huang, Narayanan Sundaraman, Jongsoo Park, Xiaodong Wang, Udit Gupta, Carole-Jean Wu, Alisson G. Azzolini, Dmytro Dzhulgakov, Andrey Mallevich, Ilia Cherniavskii, Yinghai Lu, Raghuraman Krishnamoorthi, Ansha Yu, Volodymyr Kondratenko, Stephanie Pereira, Xianjie Chen, Wenlin Chen, Vijay Rao, Bill Jia, Liang Xiong, Misha Smelyanskiy

* 10 pages, 6 figures 

  Access Model/Code and Paper
Spatial-Winograd Pruning Enabling Sparse Winograd Convolution

Jan 08, 2019
Jiecao Yu, Jongsoo Park, Maxim Naumov


  Access Model/Code and Paper
Deep Learning Inference in Facebook Data Centers: Characterization, Performance Optimizations and Hardware Implications

Nov 29, 2018
Jongsoo Park, Maxim Naumov, Protonu Basu, Summer Deng, Aravind Kalaiah, Daya Khudia, James Law, Parth Malani, Andrey Malevich, Satish Nadathur, Juan Pino, Martin Schatz, Alexander Sidorov, Viswanath Sivakumar, Andrew Tulloch, Xiaodong Wang, Yiming Wu, Hector Yuen, Utku Diril, Dmytro Dzhulgakov, Kim Hazelwood, Bill Jia, Yangqing Jia, Lin Qiao, Vijay Rao, Nadav Rotem, Sungjoo Yoo, Mikhail Smelyanskiy


  Access Model/Code and Paper
On Periodic Functions as Regularizers for Quantization of Neural Networks

Nov 24, 2018
Maxim Naumov, Utku Diril, Jongsoo Park, Benjamin Ray, Jedrzej Jablonski, Andrew Tulloch

* 11 pages, 7 figures 

  Access Model/Code and Paper
Enabling Sparse Winograd Convolution by Native Pruning

Oct 13, 2017
Sheng Li, Jongsoo Park, Ping Tak Peter Tang

* 10 pages, 2 figures 

  Access Model/Code and Paper
Faster CNNs with Direct Sparse Convolutions and Guided Pruning

Jul 28, 2017
Jongsoo Park, Sheng Li, Wei Wen, Ping Tak Peter Tang, Hai Li, Yiran Chen, Pradeep Dubey

* 12 pages, 5 figures 

  Access Model/Code and Paper