Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Raul Puri

Evaluating Large Language Models Trained on Code


Jul 14, 2021
Mark Chen, Jerry Tworek, Heewoo Jun, Qiming Yuan, Henrique Ponde de Oliveira Pinto, Jared Kaplan, Harri Edwards, Yuri Burda, Nicholas Joseph, Greg Brockman, Alex Ray, Raul Puri, Gretchen Krueger, Michael Petrov, Heidy Khlaaf, Girish Sastry, Pamela Mishkin, Brooke Chan, Scott Gray, Nick Ryder, Mikhail Pavlov, Alethea Power, Lukasz Kaiser, Mohammad Bavarian, Clemens Winter, Philippe Tillet, Felipe Petroski Such, Dave Cummings, Matthias Plappert, Fotios Chantzis, Elizabeth Barnes, Ariel Herbert-Voss, William Hebgen Guss, Alex Nichol, Alex Paino, Nikolas Tezak, Jie Tang, Igor Babuschkin, Suchir Balaji, Shantanu Jain, William Saunders, Christopher Hesse, Andrew N. Carr, Jan Leike, Josh Achiam, Vedant Misra, Evan Morikawa, Alec Radford, Matthew Knight, Miles Brundage, Mira Murati, Katie Mayer, Peter Welinder, Bob McGrew, Dario Amodei, Sam McCandlish, Ilya Sutskever, Wojciech Zaremba

* corrected typos, added references, added authors, added acknowledgements 

  Access Paper or Ask Questions

Local Knowledge Powered Conversational Agents


Oct 20, 2020
Sashank Santhanam, Wei Ping, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro


  Access Paper or Ask Questions

BioMegatron: Larger Biomedical Domain Language Model


Oct 14, 2020
Hoo-Chang Shin, Yang Zhang, Evelina Bakhturina, Raul Puri, Mostofa Patwary, Mohammad Shoeybi, Raghav Mani

* Accepted for publication at EMNLP 2020 

  Access Paper or Ask Questions

MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models


Oct 02, 2020
Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro

* Accepted in EMNLP 2020 main conference 

  Access Paper or Ask Questions

Large Scale Multi-Actor Generative Dialog Modeling


May 13, 2020
Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro


  Access Paper or Ask Questions

Training Question Answering Models From Synthetic Data


Feb 22, 2020
Raul Puri, Ryan Spring, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro


  Access Paper or Ask Questions

Zero-shot Text Classification With Generative Language Models


Dec 10, 2019
Raul Puri, Bryan Catanzaro


  Access Paper or Ask Questions

Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism


Oct 05, 2019
Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, Bryan Catanzaro


  Access Paper or Ask Questions

Practical Text Classification With Large Pre-Trained Language Models


Dec 04, 2018
Neel Kant, Raul Puri, Nikolai Yakovenko, Bryan Catanzaro

* 8 pages, submitted to AAAI 2019 

  Access Paper or Ask Questions

Large Scale Language Modeling: Converging on 40GB of Text in Four Hours


Aug 11, 2018
Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro

* 8 pages; To appear in High Performance Machine Learning Workshop (HPML) 2018 

  Access Paper or Ask Questions