Alert button
Picture for Pradeep Dubey

Pradeep Dubey

Alert button

Microscaling Data Formats for Deep Learning

Oct 19, 2023
Bita Darvish Rouhani, Ritchie Zhao, Ankit More, Mathew Hall, Alireza Khodamoradi, Summer Deng, Dhruv Choudhary, Marius Cornea, Eric Dellinger, Kristof Denolf, Stosic Dusan, Venmugil Elango, Maximilian Golub, Alexander Heinecke, Phil James-Roxby, Dharmesh Jani, Gaurav Kolhe, Martin Langhammer, Ada Li, Levi Melnick, Maral Mesmakhosroshahi, Andres Rodriguez, Michael Schulte, Rasoul Shafipour, Lei Shao, Michael Siu, Pradeep Dubey, Paulius Micikevicius, Maxim Naumov, Colin Verrilli, Ralph Wittig, Doug Burger, Eric Chung

Figure 1 for Microscaling Data Formats for Deep Learning
Figure 2 for Microscaling Data Formats for Deep Learning
Figure 3 for Microscaling Data Formats for Deep Learning
Figure 4 for Microscaling Data Formats for Deep Learning
Viaarxiv icon

AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks

Apr 14, 2023
Abhisek Kundu, Naveen K. Mellempudi, Dharma Teja Vooturi, Bharat Kaul, Pradeep Dubey

Figure 1 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks
Figure 2 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks
Figure 3 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks
Figure 4 for AUTOSPARSE: Towards Automated Sparse Training of Deep Neural Networks
Viaarxiv icon

FP8 Formats for Deep Learning

Sep 12, 2022
Paulius Micikevicius, Dusan Stosic, Neil Burgess, Marius Cornea, Pradeep Dubey, Richard Grisenthwaite, Sangwon Ha, Alexander Heinecke, Patrick Judd, John Kamalu, Naveen Mellempudi, Stuart Oberman, Mohammad Shoeybi, Michael Siu, Hao Wu

Figure 1 for FP8 Formats for Deep Learning
Figure 2 for FP8 Formats for Deep Learning
Figure 3 for FP8 Formats for Deep Learning
Figure 4 for FP8 Formats for Deep Learning
Viaarxiv icon

Systolic Computing on GPUs for Productive Performance

Oct 29, 2020
Hongbo Rong, Xiaochen Hao, Yun Liang, Lidong Xu, Hong H Jiang, Pradeep Dubey

Figure 1 for Systolic Computing on GPUs for Productive Performance
Figure 2 for Systolic Computing on GPUs for Productive Performance
Figure 3 for Systolic Computing on GPUs for Productive Performance
Figure 4 for Systolic Computing on GPUs for Productive Performance
Viaarxiv icon

MISIM: An End-to-End Neural Code Similarity System

Jun 15, 2020
Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Nesime Tatbul, Jesmin Jahan Tithi, Paul Petersen, Timothy Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich

Figure 1 for MISIM: An End-to-End Neural Code Similarity System
Figure 2 for MISIM: An End-to-End Neural Code Similarity System
Figure 3 for MISIM: An End-to-End Neural Code Similarity System
Figure 4 for MISIM: An End-to-End Neural Code Similarity System
Viaarxiv icon

Context-Aware Parse Trees

Mar 24, 2020
Fangke Ye, Shengtian Zhou, Anand Venkat, Ryan Marcus, Paul Petersen, Jesmin Jahan Tithi, Tim Mattson, Tim Kraska, Pradeep Dubey, Vivek Sarkar, Justin Gottschlich

Figure 1 for Context-Aware Parse Trees
Figure 2 for Context-Aware Parse Trees
Figure 3 for Context-Aware Parse Trees
Figure 4 for Context-Aware Parse Trees
Viaarxiv icon

K-TanH: Hardware Efficient Activations For Deep Learning

Oct 21, 2019
Abhisek Kundu, Sudarshan Srinivasan, Eric C. Qin, Dhiraj Kalamkar, Naveen K. Mellempudi, Dipankar Das, Kunal Banerjee, Bharat Kaul, Pradeep Dubey

Figure 1 for K-TanH: Hardware Efficient Activations For Deep Learning
Figure 2 for K-TanH: Hardware Efficient Activations For Deep Learning
Figure 3 for K-TanH: Hardware Efficient Activations For Deep Learning
Figure 4 for K-TanH: Hardware Efficient Activations For Deep Learning
Viaarxiv icon

A Study of BFLOAT16 for Deep Learning Training

Jun 13, 2019
Dhiraj Kalamkar, Dheevatsa Mudigere, Naveen Mellempudi, Dipankar Das, Kunal Banerjee, Sasikanth Avancha, Dharma Teja Vooturi, Nataraj Jammalamadaka, Jianyu Huang, Hector Yuen, Jiyan Yang, Jongsoo Park, Alexander Heinecke, Evangelos Georganas, Sudarshan Srinivasan, Abhisek Kundu, Misha Smelyanskiy, Bharat Kaul, Pradeep Dubey

Figure 1 for A Study of BFLOAT16 for Deep Learning Training
Figure 2 for A Study of BFLOAT16 for Deep Learning Training
Figure 3 for A Study of BFLOAT16 for Deep Learning Training
Figure 4 for A Study of BFLOAT16 for Deep Learning Training
Viaarxiv icon