Alert button
Picture for Srinivasan Iyer

Srinivasan Iyer

Alert button

Instruction-tuned Language Models are Better Knowledge Learners

Feb 20, 2024
Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srinivasan Iyer

Viaarxiv icon

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Dec 28, 2022
Srinivasan Iyer, Xi Victoria Lin, Ramakanth Pasunuru, Todor Mihaylov, Daniel Simig, Ping Yu, Kurt Shuster, Tianlu Wang, Qing Liu, Punit Singh Koura, Xian Li, Brian O'Horo, Gabriel Pereyra, Jeff Wang, Christopher Dewan, Asli Celikyilmaz, Luke Zettlemoyer, Ves Stoyanov

Figure 1 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 2 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 3 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Figure 4 for OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization
Viaarxiv icon

Complementary Explanations for Effective In-Context Learning

Nov 25, 2022
Xi Ye, Srinivasan Iyer, Asli Celikyilmaz, Ves Stoyanov, Greg Durrett, Ramakanth Pasunuru

Figure 1 for Complementary Explanations for Effective In-Context Learning
Figure 2 for Complementary Explanations for Effective In-Context Learning
Figure 3 for Complementary Explanations for Effective In-Context Learning
Figure 4 for Complementary Explanations for Effective In-Context Learning
Viaarxiv icon

Efficient Large Scale Language Modeling with Mixtures of Experts

Dec 20, 2021
Mikel Artetxe, Shruti Bhosale, Naman Goyal, Todor Mihaylov, Myle Ott, Sam Shleifer, Xi Victoria Lin, Jingfei Du, Srinivasan Iyer, Ramakanth Pasunuru, Giri Anantharaman, Xian Li, Shuohui Chen, Halil Akin, Mandeep Baines, Louis Martin, Xing Zhou, Punit Singh Koura, Brian O'Horo, Jeff Wang, Luke Zettlemoyer, Mona Diab, Zornitsa Kozareva, Ves Stoyanov

Figure 1 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 2 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 3 for Efficient Large Scale Language Modeling with Mixtures of Experts
Figure 4 for Efficient Large Scale Language Modeling with Mixtures of Experts
Viaarxiv icon

Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

Nov 26, 2021
Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal, Srinivasan Iyer

Figure 1 for Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs
Figure 2 for Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs
Figure 3 for Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs
Figure 4 for Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs
Viaarxiv icon

EASE: Extractive-Abstractive Summarization with Explanations

May 14, 2021
Haoran Li, Arash Einolghozati, Srinivasan Iyer, Bhargavi Paranjape, Yashar Mehdad, Sonal Gupta, Marjan Ghazvininejad

Figure 1 for EASE: Extractive-Abstractive Summarization with Explanations
Figure 2 for EASE: Extractive-Abstractive Summarization with Explanations
Figure 3 for EASE: Extractive-Abstractive Summarization with Explanations
Figure 4 for EASE: Extractive-Abstractive Summarization with Explanations
Viaarxiv icon

FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation

Dec 31, 2020
Kushal Lakhotia, Bhargavi Paranjape, Asish Ghoshal, Wen-tau Yih, Yashar Mehdad, Srinivasan Iyer

Figure 1 for FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation
Figure 2 for FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation
Figure 3 for FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation
Figure 4 for FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation
Viaarxiv icon

Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA

Dec 30, 2020
Ana Valeria Gonzalez, Gagan Bansal, Angela Fan, Robin Jia, Yashar Mehdad, Srinivasan Iyer

Figure 1 for Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA
Figure 2 for Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA
Figure 3 for Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA
Figure 4 for Human Evaluation of Spoken vs. Visual Explanations for Open-Domain QA
Viaarxiv icon

RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering

Oct 21, 2020
Srinivasan Iyer, Sewon Min, Yashar Mehdad, Wen-tau Yih

Figure 1 for RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering
Figure 2 for RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering
Figure 3 for RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering
Figure 4 for RECONSIDER: Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering
Viaarxiv icon

Efficient One-Pass End-to-End Entity Linking for Questions

Oct 06, 2020
Belinda Z. Li, Sewon Min, Srinivasan Iyer, Yashar Mehdad, Wen-tau Yih

Figure 1 for Efficient One-Pass End-to-End Entity Linking for Questions
Figure 2 for Efficient One-Pass End-to-End Entity Linking for Questions
Figure 3 for Efficient One-Pass End-to-End Entity Linking for Questions
Figure 4 for Efficient One-Pass End-to-End Entity Linking for Questions
Viaarxiv icon