Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Unsupervised Paraphrase Generation via Dynamic Blocking

Oct 24, 2020
Tong Niu, Semih Yavuz, Yingbo Zhou, Huan Wang, Nitish Shirish Keskar, Caiming Xiong

* 10 pages 

  Access Paper or Ask Questions

GeDi: Generative Discriminator Guided Sequence Generation

Sep 14, 2020
Ben Krause, Akhilesh Deepak Gotmare, Bryan McCann, Nitish Shirish Keskar, Shafiq Joty, Richard Socher, Nazneen Fatema Rajani


  Access Paper or Ask Questions

Mirostat: A Perplexity-Controlled Neural Text Decoding Algorithm

Jul 29, 2020
Sourya Basu, Govardana Sachitanandam Ramachandran, Nitish Shirish Keskar, Lav R. Varshney

* 18 pages, 8 figures 

  Access Paper or Ask Questions

ProGen: Language Modeling for Protein Generation

Mar 08, 2020
Ali Madani, Bryan McCann, Nikhil Naik, Nitish Shirish Keskar, Namrata Anand, Raphael R. Eguchi, Po-Ssu Huang, Richard Socher


  Access Paper or Ask Questions

Limits of Detecting Text Generated by Large-Scale Language Models

Feb 09, 2020
Lav R. Varshney, Nitish Shirish Keskar, Richard Socher

* ITA 2020 

  Access Paper or Ask Questions

Global Capacity Measures for Deep ReLU Networks via Path Sampling

Oct 22, 2019
Ryan Theisen, Jason M. Klusowski, Huan Wang, Nitish Shirish Keskar, Caiming Xiong, Richard Socher


  Access Paper or Ask Questions

CTRL: A Conditional Transformer Language Model for Controllable Generation

Sep 20, 2019
Nitish Shirish Keskar, Bryan McCann, Lav R. Varshney, Caiming Xiong, Richard Socher


  Access Paper or Ask Questions

Pretrained AI Models: Performativity, Mobility, and Change

Sep 07, 2019
Lav R. Varshney, Nitish Shirish Keskar, Richard Socher


  Access Paper or Ask Questions

Neural Text Summarization: A Critical Evaluation

Aug 23, 2019
Wojciech Kryściński, Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher

* To appear in EMNLP 2019, 13 pages, 2 figures, 6 tables 

  Access Paper or Ask Questions

XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering

May 27, 2019
Jasdeep Singh, Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard Socher


  Access Paper or Ask Questions

Unifying Question Answering and Text Classification via Span Extraction

Apr 19, 2019
Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher


  Access Paper or Ask Questions

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering

Jan 03, 2019
Victor Zhong, Caiming Xiong, Nitish Shirish Keskar, Richard Socher

* ICLR 2019; 9 pages, 7 figures 

  Access Paper or Ask Questions

A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation

Oct 29, 2018
Akhilesh Gotmare, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

* We use empirical tools of mode connectivity and SVCCA to investigate neural network training heuristics of learning rate restarts, warmup and knowledge distillation. arXiv admin note: text overlap with arXiv:1806.06977 

  Access Paper or Ask Questions

Identifying Generalization Properties in Neural Networks

Sep 19, 2018
Huan Wang, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

* 23 pages 

  Access Paper or Ask Questions

The Natural Language Decathlon: Multitask Learning as Question Answering

Jun 20, 2018
Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard Socher


  Access Paper or Ask Questions

Using Mode Connectivity for Loss Landscape Analysis

Jun 18, 2018
Akhilesh Gotmare, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

* Accepted as a workshop paper at ICML's Workshop on Modern Trends in Nonconvex Optimization for Machine Learning, 2018 

  Access Paper or Ask Questions

An Analysis of Neural Language Modeling at Multiple Scales

Mar 22, 2018
Stephen Merity, Nitish Shirish Keskar, Richard Socher


  Access Paper or Ask Questions

Improving Generalization Performance by Switching from Adam to SGD

Dec 20, 2017
Nitish Shirish Keskar, Richard Socher


  Access Paper or Ask Questions

Weighted Transformer Network for Machine Translation

Nov 06, 2017
Karim Ahmed, Nitish Shirish Keskar, Richard Socher


  Access Paper or Ask Questions

Regularizing and Optimizing LSTM Language Models

Aug 07, 2017
Stephen Merity, Nitish Shirish Keskar, Richard Socher


  Access Paper or Ask Questions

On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima

Feb 09, 2017
Nitish Shirish Keskar, Dheevatsa Mudigere, Jorge Nocedal, Mikhail Smelyanskiy, Ping Tak Peter Tang

* Accepted as a conference paper at ICLR 2017 

  Access Paper or Ask Questions

adaQN: An Adaptive Quasi-Newton Algorithm for Training RNNs

Feb 23, 2016
Nitish Shirish Keskar, Albert S. Berahas


  Access Paper or Ask Questions