Alert button

"Text": models, code, and papers
Alert button

Efficient Machine Translation Domain Adaptation

Apr 26, 2022
Pedro Henrique Martins, Zita Marinho, André F. T. Martins

Figure 1 for Efficient Machine Translation Domain Adaptation
Figure 2 for Efficient Machine Translation Domain Adaptation
Figure 3 for Efficient Machine Translation Domain Adaptation
Figure 4 for Efficient Machine Translation Domain Adaptation
Viaarxiv icon

Topic-Aware Abstractive Text Summarization

Oct 20, 2020
Chujie Zheng, Kunpeng Zhang, Harry Jiannan Wang, Ling Fan

Figure 1 for Topic-Aware Abstractive Text Summarization
Figure 2 for Topic-Aware Abstractive Text Summarization
Figure 3 for Topic-Aware Abstractive Text Summarization
Figure 4 for Topic-Aware Abstractive Text Summarization
Viaarxiv icon

A Rich Recipe Representation as Plan to Support Expressive Multi Modal Queries on Recipe Content and Preparation Process

Mar 31, 2022
Vishal Pallagani, Priyadharsini Ramamurthy, Vedant Khandelwal, Revathy Venkataramanan, Kausik Lakkaraju, Sathyanarayanan N. Aakur, Biplav Srivastava

Figure 1 for A Rich Recipe Representation as Plan to Support Expressive Multi Modal Queries on Recipe Content and Preparation Process
Figure 2 for A Rich Recipe Representation as Plan to Support Expressive Multi Modal Queries on Recipe Content and Preparation Process
Figure 3 for A Rich Recipe Representation as Plan to Support Expressive Multi Modal Queries on Recipe Content and Preparation Process
Figure 4 for A Rich Recipe Representation as Plan to Support Expressive Multi Modal Queries on Recipe Content and Preparation Process
Viaarxiv icon

MMER: Multimodal Multi-task learning for Emotion Recognition in Spoken Utterances

Apr 01, 2022
Harshvardhan Srivastava, Sreyan Ghosh, S. Umesh

Figure 1 for MMER: Multimodal Multi-task learning for Emotion Recognition in Spoken Utterances
Figure 2 for MMER: Multimodal Multi-task learning for Emotion Recognition in Spoken Utterances
Viaarxiv icon

Adaptive Text Recognition through Visual Matching

Sep 14, 2020
Chuhan Zhang, Ankush Gupta, Andrew Zisserman

Figure 1 for Adaptive Text Recognition through Visual Matching
Figure 2 for Adaptive Text Recognition through Visual Matching
Figure 3 for Adaptive Text Recognition through Visual Matching
Figure 4 for Adaptive Text Recognition through Visual Matching
Viaarxiv icon

QURIOUS: Question Generation Pretraining for Text Generation

Apr 23, 2020
Shashi Narayan, Gonçalo Simoes, Ji Ma, Hannah Craighead, Ryan Mcdonald

Figure 1 for QURIOUS: Question Generation Pretraining for Text Generation
Figure 2 for QURIOUS: Question Generation Pretraining for Text Generation
Figure 3 for QURIOUS: Question Generation Pretraining for Text Generation
Figure 4 for QURIOUS: Question Generation Pretraining for Text Generation
Viaarxiv icon

SE-DAE: Style-Enhanced Denoising Auto-Encoder for Unsupervised Text Style Transfer

Apr 27, 2021
Jicheng Li, Yang Feng, Jiao Ou

Figure 1 for SE-DAE: Style-Enhanced Denoising Auto-Encoder for Unsupervised Text Style Transfer
Figure 2 for SE-DAE: Style-Enhanced Denoising Auto-Encoder for Unsupervised Text Style Transfer
Figure 3 for SE-DAE: Style-Enhanced Denoising Auto-Encoder for Unsupervised Text Style Transfer
Figure 4 for SE-DAE: Style-Enhanced Denoising Auto-Encoder for Unsupervised Text Style Transfer
Viaarxiv icon

CM3: A Causal Masked Multimodal Model of the Internet

Jan 19, 2022
Armen Aghajanyan, Bernie Huang, Candace Ross, Vladimir Karpukhin, Hu Xu, Naman Goyal, Dmytro Okhonko, Mandar Joshi, Gargi Ghosh, Mike Lewis, Luke Zettlemoyer

Figure 1 for CM3: A Causal Masked Multimodal Model of the Internet
Figure 2 for CM3: A Causal Masked Multimodal Model of the Internet
Figure 3 for CM3: A Causal Masked Multimodal Model of the Internet
Figure 4 for CM3: A Causal Masked Multimodal Model of the Internet
Viaarxiv icon

VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text

Apr 22, 2021
Hassan Akbari, Linagzhe Yuan, Rui Qian, Wei-Hong Chuang, Shih-Fu Chang, Yin Cui, Boqing Gong

Figure 1 for VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Figure 2 for VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Figure 3 for VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Figure 4 for VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
Viaarxiv icon

Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems

Oct 08, 2020
Yinghui Huang, Hong-Kwang Kuo, Samuel Thomas, Zvi Kons, Kartik Audhkhasi, Brian Kingsbury, Ron Hoory, Michael Picheny

Figure 1 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 2 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 3 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Figure 4 for Leveraging Unpaired Text Data for Training End-to-End Speech-to-Intent Systems
Viaarxiv icon