Alert button

"Text": models, code, and papers
Alert button

Bootstrapping Vision-Language Learning with Decoupled Language Pre-training

Jul 13, 2023
Yiren Jian, Chongyang Gao, Soroush Vosoughi

Figure 1 for Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
Figure 2 for Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
Figure 3 for Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
Figure 4 for Bootstrapping Vision-Language Learning with Decoupled Language Pre-training
Viaarxiv icon

ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation

May 24, 2023
Chenyang Le, Yao Qian, Long Zhou, Shujie Liu, Michael Zeng, Xuedong Huang

Figure 1 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Figure 2 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Figure 3 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Figure 4 for ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
Viaarxiv icon

Masked Audio Text Encoders are Effective Multi-Modal Rescorers

May 11, 2023
Jinglun Cai, Monica Sunkara, Xilai Li, Anshu Bhatia, Xiao Pan, Sravan Bodapati

Figure 1 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Figure 2 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Figure 3 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Figure 4 for Masked Audio Text Encoders are Effective Multi-Modal Rescorers
Viaarxiv icon

A Call for Standardization and Validation of Text Style Transfer Evaluation

Jun 01, 2023
Phil Ostheimer, Mayank Nagda, Marius Kloft, Sophie Fellenz

Figure 1 for A Call for Standardization and Validation of Text Style Transfer Evaluation
Figure 2 for A Call for Standardization and Validation of Text Style Transfer Evaluation
Figure 3 for A Call for Standardization and Validation of Text Style Transfer Evaluation
Figure 4 for A Call for Standardization and Validation of Text Style Transfer Evaluation
Viaarxiv icon

ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval

May 28, 2023
Jiapeng Wang, Chengyu Wang, Xiaodan Wang, Jun Huang, Lianwen Jin

Figure 1 for ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Figure 2 for ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Figure 3 for ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Figure 4 for ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval
Viaarxiv icon

Controlling keywords and their positions in text generation

Apr 19, 2023
Yuichi Sasazawa, Terufumi Morishita, Hiroaki Ozaki, Osamu Imaichi, Yasuhiro Sogawa

Figure 1 for Controlling keywords and their positions in text generation
Figure 2 for Controlling keywords and their positions in text generation
Figure 3 for Controlling keywords and their positions in text generation
Figure 4 for Controlling keywords and their positions in text generation
Viaarxiv icon

AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation

May 22, 2023
Guy Yariv, Itai Gat, Lior Wolf, Yossi Adi, Idan Schwartz

Figure 1 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 2 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 3 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Figure 4 for AudioToken: Adaptation of Text-Conditioned Diffusion Models for Audio-to-Image Generation
Viaarxiv icon

ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games

May 24, 2023
Ruoyao Wang, Graham Todd, Eric Yuan, Ziang Xiao, Marc-Alexandre Côté, Peter Jansen

Figure 1 for ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Figure 2 for ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Figure 3 for ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Figure 4 for ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games
Viaarxiv icon

UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models

Jul 20, 2023
Xin Li, Sima Behpour, Thang Doan, Wenbin He, Liang Gou, Liu Ren

Figure 1 for UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models
Figure 2 for UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models
Figure 3 for UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models
Figure 4 for UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models
Viaarxiv icon

Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation

Apr 28, 2023
Fantine Huot, Joshua Maynez, Shashi Narayan, Reinald Kim Amplayo, Kuzman Ganchev, Annie Louis, Anders Sandholm, Dipanjan Das, Mirella Lapata

Figure 1 for Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation
Figure 2 for Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation
Figure 3 for Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation
Figure 4 for Text-Blueprint: An Interactive Platform for Plan-based Conditional Generation
Viaarxiv icon