Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
On Task-Level Dialogue Composition of Generative Transformer Model

Oct 09, 2020
Prasanna Parthasarathi, Arvind Neelakantan, Sharan Narang

* 8 pages; Accepted at Workshop on Insights from Negative Results in NLP 

  Access Paper or Ask Questions

WT5?! Training Text-to-Text Models to Explain their Predictions

Apr 30, 2020
Sharan Narang, Colin Raffel, Katherine Lee, Adam Roberts, Noah Fiedel, Karishma Malkan


  Access Paper or Ask Questions

Neural Assistant: Joint Action Prediction, Response Generation, and Latent Knowledge Reasoning

Oct 31, 2019
Arvind Neelakantan, Semih Yavuz, Sharan Narang, Vishaal Prasad, Ben Goodrich, Daniel Duckworth, Chinnadhurai Sankar, Xifeng Yan


  Access Paper or Ask Questions

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Oct 24, 2019
Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, Peter J. Liu


  Access Paper or Ask Questions

Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning

Feb 22, 2018
Wei Ping, Kainan Peng, Andrew Gibiansky, Sercan O. Arik, Ajay Kannan, Sharan Narang, Jonathan Raiman, John Miller

* Published as a conference paper at ICLR 2018. (v3 changed paper title) 

  Access Paper or Ask Questions

Mixed Precision Training

Feb 15, 2018
Paulius Micikevicius, Sharan Narang, Jonah Alben, Gregory Diamos, Erich Elsen, David Garcia, Boris Ginsburg, Michael Houston, Oleksii Kuchaiev, Ganesh Venkatesh, Hao Wu

* Published as a conference paper at ICLR 2018 

  Access Paper or Ask Questions

Deep Learning Scaling is Predictable, Empirically

Dec 01, 2017
Joel Hestness, Sharan Narang, Newsha Ardalani, Gregory Diamos, Heewoo Jun, Hassan Kianinejad, Md. Mostofa Ali Patwary, Yang Yang, Yanqi Zhou

* 19 pages, 11 figures 

  Access Paper or Ask Questions

Block-Sparse Recurrent Neural Networks

Nov 08, 2017
Sharan Narang, Eric Undersander, Gregory Diamos


  Access Paper or Ask Questions

Exploring Sparsity in Recurrent Neural Networks

Nov 06, 2017
Sharan Narang, Erich Elsen, Gregory Diamos, Shubho Sengupta

* Published as a conference paper at ICLR 2017 

  Access Paper or Ask Questions

DSD: Dense-Sparse-Dense Training for Deep Neural Networks

Feb 21, 2017
Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally

* Published as a conference paper at ICLR 2017 

  Access Paper or Ask Questions

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin

Dec 08, 2015
Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, Zhenyao Zhu


  Access Paper or Ask Questions