Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Efficient and Generic 1D Dilated Convolution Layer for Deep Learning



Narendra Chaudhary , Sanchit Misra , Dhiraj Kalamkar , Alexander Heinecke , Evangelos Georganas , Barukh Ziv , Menachem Adelman , Bharat Kaul


   Access Paper or Ask Questions

MADRaS : Multi Agent Driving Simulator



Anirban Santara , Sohan Rudra , Sree Aditya Buridi , Meha Kaushik , Abhishek Naik , Bharat Kaul , Balaraman Ravindran


   Access Paper or Ask Questions

PolyDL: Polyhedral Optimizations for Creation of High Performance DL primitives



Sanket Tavarageri , Alexander Heinecke , Sasikanth Avancha , Gagandeep Goyal , Ramakrishna Upadrasta , Bharat Kaul

* arXiv admin note: substantial text overlap with arXiv:2002.02145 

   Access Paper or Ask Questions

PolyScientist: Automatic Loop Transformations Combined with Microkernels for Optimization of Deep Learning Primitives



Sanket Tavarageri , Alexander Heinecke , Sasikanth Avancha , Gagandeep Goyal , Ramakrishna Upadrasta , Bharat Kaul


   Access Paper or Ask Questions

SEERL: Sample Efficient Ensemble Reinforcement Learning



Rohan Saphal , Balaraman Ravindran , Dheevatsa Mudigere , Sasikanth Avancha , Bharat Kaul


   Access Paper or Ask Questions

K-TanH: Hardware Efficient Activations For Deep Learning



Abhisek Kundu , Sudarshan Srinivasan , Eric C. Qin , Dhiraj Kalamkar , Naveen K. Mellempudi , Dipankar Das , Kunal Banerjee , Bharat Kaul , Pradeep Dubey

* 14 pages, 14 figures 

   Access Paper or Ask Questions

High Performance Scalable FPGA Accelerator for Deep Neural Networks



Sudarshan Srinivasan , Pradeep Janedula , Saurabh Dhoble , Sasikanth Avancha , Dipankar Das , Naveen Mellempudi , Bharat Daga , Martin Langhammer , Gregg Baeckler , Bharat Kaul


   Access Paper or Ask Questions

A Study of BFLOAT16 for Deep Learning Training



Dhiraj Kalamkar , Dheevatsa Mudigere , Naveen Mellempudi , Dipankar Das , Kunal Banerjee , Sasikanth Avancha , Dharma Teja Vooturi , Nataraj Jammalamadaka , Jianyu Huang , Hector Yuen , Jiyan Yang , Jongsoo Park , Alexander Heinecke , Evangelos Georganas , Sudarshan Srinivasan , Abhisek Kundu , Misha Smelyanskiy , Bharat Kaul , Pradeep Dubey


   Access Paper or Ask Questions

Automatic Model Parallelism for Deep Neural Networks with Compiler and Hardware Support



Sanket Tavarageri , Srinivas Sridharan , Bharat Kaul


   Access Paper or Ask Questions

1
2
3
>>