Alert button
Picture for Yashar Mehdad

Yashar Mehdad

Alert button

Attention or Convolution: Transformer Encoders in Audio Language Models for Inference Efficiency

Add code
Bookmark button
Alert button
Nov 05, 2023
Sungho Jeon, Ching-Feng Yeh, Hakan Inan, Wei-Ning Hsu, Rashi Rungta, Yashar Mehdad, Daniel Bikel

Viaarxiv icon

Effective Long-Context Scaling of Foundation Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Wenhan Xiong, Jingyu Liu, Igor Molybog, Hejia Zhang, Prajjwal Bhargava, Rui Hou, Louis Martin, Rashi Rungta, Karthik Abinav Sankararaman, Barlas Oguz, Madian Khabsa, Han Fang, Yashar Mehdad, Sharan Narang, Kshitiz Malik, Angela Fan, Shruti Bhosale, Sergey Edunov, Mike Lewis, Sinong Wang, Hao Ma

Figure 1 for Effective Long-Context Scaling of Foundation Models
Figure 2 for Effective Long-Context Scaling of Foundation Models
Figure 3 for Effective Long-Context Scaling of Foundation Models
Figure 4 for Effective Long-Context Scaling of Foundation Models
Viaarxiv icon

LLM-QAT: Data-Free Quantization Aware Training for Large Language Models

Add code
Bookmark button
Alert button
May 29, 2023
Zechun Liu, Barlas Oguz, Changsheng Zhao, Ernie Chang, Pierre Stock, Yashar Mehdad, Yangyang Shi, Raghuraman Krishnamoorthi, Vikas Chandra

Figure 1 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Figure 2 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Figure 3 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Figure 4 for LLM-QAT: Data-Free Quantization Aware Training for Large Language Models
Viaarxiv icon

VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation

Add code
Bookmark button
Alert button
May 04, 2023
Xilun Chen, Lili Yu, Wenhan Xiong, Barlas Oğuz, Yashar Mehdad, Wen-tau Yih

Figure 1 for VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation
Figure 2 for VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation
Figure 3 for VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation
Figure 4 for VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation
Viaarxiv icon

How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval

Add code
Bookmark button
Alert button
Feb 15, 2023
Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

Figure 1 for How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval
Figure 2 for How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval
Figure 3 for How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval
Figure 4 for How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval
Viaarxiv icon

STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension

Add code
Bookmark button
Alert button
Dec 24, 2022
Borui Wang, Chengcheng Feng, Arjun Nair, Madelyn Mao, Jai Desai, Asli Celikyilmaz, Haoran Li, Yashar Mehdad, Dragomir Radev

Figure 1 for STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension
Figure 2 for STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension
Figure 3 for STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension
Figure 4 for STRUDEL: Structured Dialogue Summarization for Dialogue Comprehension
Viaarxiv icon

Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences

Add code
Bookmark button
Alert button
Dec 19, 2022
Asish Ghoshal, Arash Einolghozati, Ankit Arun, Haoran Li, Lili Yu, Yashar Mehdad, Scott Wen-tau Yih, Asli Celikyilmaz

Figure 1 for Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences
Figure 2 for Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences
Figure 3 for Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences
Figure 4 for Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences
Viaarxiv icon

CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval

Add code
Bookmark button
Alert button
Nov 18, 2022
Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen

Figure 1 for CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
Figure 2 for CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
Figure 3 for CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
Figure 4 for CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval
Viaarxiv icon

Bridging the Training-Inference Gap for Dense Phrase Retrieval

Add code
Bookmark button
Alert button
Oct 25, 2022
Gyuwan Kim, Jinhyuk Lee, Barlas Oguz, Wenhan Xiong, Yizhe Zhang, Yashar Mehdad, William Yang Wang

Figure 1 for Bridging the Training-Inference Gap for Dense Phrase Retrieval
Figure 2 for Bridging the Training-Inference Gap for Dense Phrase Retrieval
Figure 3 for Bridging the Training-Inference Gap for Dense Phrase Retrieval
Figure 4 for Bridging the Training-Inference Gap for Dense Phrase Retrieval
Viaarxiv icon

Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task

Add code
Bookmark button
Alert button
Sep 28, 2022
Hakan Inan, Rashi Rungta, Yashar Mehdad

Figure 1 for Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task
Figure 2 for Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task
Figure 3 for Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task
Figure 4 for Structured Summarization: Unified Text Segmentation and Segment Labeling as a Generation Task
Viaarxiv icon