Alert button
Picture for Nitish Shirish Keskar

Nitish Shirish Keskar

Alert button

Global Capacity Measures for Deep ReLU Networks via Path Sampling

Add code
Bookmark button
Alert button
Oct 22, 2019
Ryan Theisen, Jason M. Klusowski, Huan Wang, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

Figure 1 for Global Capacity Measures for Deep ReLU Networks via Path Sampling
Viaarxiv icon

CTRL: A Conditional Transformer Language Model for Controllable Generation

Add code
Bookmark button
Alert button
Sep 20, 2019
Nitish Shirish Keskar, Bryan McCann, Lav R. Varshney, Caiming Xiong, Richard Socher

Figure 1 for CTRL: A Conditional Transformer Language Model for Controllable Generation
Figure 2 for CTRL: A Conditional Transformer Language Model for Controllable Generation
Figure 3 for CTRL: A Conditional Transformer Language Model for Controllable Generation
Figure 4 for CTRL: A Conditional Transformer Language Model for Controllable Generation
Viaarxiv icon

Pretrained AI Models: Performativity, Mobility, and Change

Add code
Bookmark button
Alert button
Sep 07, 2019
Lav R. Varshney, Nitish Shirish Keskar, Richard Socher

Figure 1 for Pretrained AI Models: Performativity, Mobility, and Change
Figure 2 for Pretrained AI Models: Performativity, Mobility, and Change
Figure 3 for Pretrained AI Models: Performativity, Mobility, and Change
Viaarxiv icon

Neural Text Summarization: A Critical Evaluation

Add code
Bookmark button
Alert button
Aug 23, 2019
Wojciech Kryściński, Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher

Figure 1 for Neural Text Summarization: A Critical Evaluation
Figure 2 for Neural Text Summarization: A Critical Evaluation
Figure 3 for Neural Text Summarization: A Critical Evaluation
Figure 4 for Neural Text Summarization: A Critical Evaluation
Viaarxiv icon

XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering

Add code
Bookmark button
Alert button
May 27, 2019
Jasdeep Singh, Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

Figure 1 for XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering
Figure 2 for XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering
Figure 3 for XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering
Figure 4 for XLDA: Cross-Lingual Data Augmentation for Natural Language Inference and Question Answering
Viaarxiv icon

Unifying Question Answering and Text Classification via Span Extraction

Add code
Bookmark button
Alert button
Apr 19, 2019
Nitish Shirish Keskar, Bryan McCann, Caiming Xiong, Richard Socher

Figure 1 for Unifying Question Answering and Text Classification via Span Extraction
Figure 2 for Unifying Question Answering and Text Classification via Span Extraction
Figure 3 for Unifying Question Answering and Text Classification via Span Extraction
Figure 4 for Unifying Question Answering and Text Classification via Span Extraction
Viaarxiv icon

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering

Add code
Bookmark button
Alert button
Jan 03, 2019
Victor Zhong, Caiming Xiong, Nitish Shirish Keskar, Richard Socher

Figure 1 for Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering
Figure 2 for Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering
Figure 3 for Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering
Figure 4 for Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering
Viaarxiv icon

A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation

Add code
Bookmark button
Alert button
Oct 29, 2018
Akhilesh Gotmare, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

Figure 1 for A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Figure 2 for A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Figure 3 for A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Figure 4 for A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation
Viaarxiv icon

Identifying Generalization Properties in Neural Networks

Add code
Bookmark button
Alert button
Sep 19, 2018
Huan Wang, Nitish Shirish Keskar, Caiming Xiong, Richard Socher

Figure 1 for Identifying Generalization Properties in Neural Networks
Figure 2 for Identifying Generalization Properties in Neural Networks
Figure 3 for Identifying Generalization Properties in Neural Networks
Figure 4 for Identifying Generalization Properties in Neural Networks
Viaarxiv icon