Alert button

"Text": models, code, and papers
Alert button

EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata

Jan 11, 2023
Chenhao Zheng, Ayush Shrivastava, Andrew Owens

Figure 1 for EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata
Figure 2 for EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata
Figure 3 for EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata
Figure 4 for EXIF as Language: Learning Cross-Modal Associations Between Images and Camera Metadata
Viaarxiv icon

A novel multimodal dynamic fusion network for disfluency detection in spoken utterances

Nov 27, 2022
Sreyan Ghosh, Utkarsh Tyagi, Sonal Kumar, Manan Suri, Rajiv Ratn Shah

Figure 1 for A novel multimodal dynamic fusion network for disfluency detection in spoken utterances
Figure 2 for A novel multimodal dynamic fusion network for disfluency detection in spoken utterances
Figure 3 for A novel multimodal dynamic fusion network for disfluency detection in spoken utterances
Viaarxiv icon

Exploring Vision Transformers as Diffusion Learners

Dec 28, 2022
He Cao, Jianan Wang, Tianhe Ren, Xianbiao Qi, Yihao Chen, Yuan Yao, Lei Zhang

Figure 1 for Exploring Vision Transformers as Diffusion Learners
Figure 2 for Exploring Vision Transformers as Diffusion Learners
Figure 3 for Exploring Vision Transformers as Diffusion Learners
Figure 4 for Exploring Vision Transformers as Diffusion Learners
Viaarxiv icon

Audio-text Retrieval in Context

Mar 29, 2022
Siyu Lou, Xuenan Xu, Mengyue Wu, Kai Yu

Figure 1 for Audio-text Retrieval in Context
Figure 2 for Audio-text Retrieval in Context
Figure 3 for Audio-text Retrieval in Context
Figure 4 for Audio-text Retrieval in Context
Viaarxiv icon

Quantum Recurrent Neural Networks for Sequential Learning

Feb 07, 2023
Yanan Li, Zhimin Wang, Rongbing Han, Shangshang Shi, Jiaxin Li, Ruimin Shang, Haiyong Zheng, Guoqiang Zhong, Yongjian Gu

Figure 1 for Quantum Recurrent Neural Networks for Sequential Learning
Figure 2 for Quantum Recurrent Neural Networks for Sequential Learning
Figure 3 for Quantum Recurrent Neural Networks for Sequential Learning
Figure 4 for Quantum Recurrent Neural Networks for Sequential Learning
Viaarxiv icon

MTTM: Metamorphic Testing for Textual Content Moderation Software

Feb 11, 2023
Wenxuan Wang, Jen-tse Huang, Weibin Wu, Jianping Zhang, Yizhan Huang, Shuqing Li, Pinjia He, Michael Lyu

Figure 1 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 2 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 3 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Figure 4 for MTTM: Metamorphic Testing for Textual Content Moderation Software
Viaarxiv icon

Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer

Dec 04, 2022
Benjamin Muller, Deepanshu Gupta, Siddharth Patwardhan, Jean-Philippe Fauconnier, David Vandyke, Sachin Agarwal

Figure 1 for Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Figure 2 for Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Figure 3 for Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Figure 4 for Languages You Know Influence Those You Learn: Impact of Language Characteristics on Multi-Lingual Text-to-Text Transfer
Viaarxiv icon

Uniform Complexity for Text Generation

Apr 11, 2022
Joseph Marvin Imperial

Figure 1 for Uniform Complexity for Text Generation
Figure 2 for Uniform Complexity for Text Generation
Figure 3 for Uniform Complexity for Text Generation
Figure 4 for Uniform Complexity for Text Generation
Viaarxiv icon

CoCa: Contrastive Captioners are Image-Text Foundation Models

May 04, 2022
Jiahui Yu, Zirui Wang, Vijay Vasudevan, Legg Yeung, Mojtaba Seyedhosseini, Yonghui Wu

Figure 1 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 2 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 3 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 4 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Viaarxiv icon

E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles

Jun 05, 2022
Zhenyu Hu, Zhenyu Wu, Pengcheng Pi, Yunhe Xue, Jiayi Shen, Jianchao Tan, Xiangru Lian, Zhangyang Wang, Ji Liu

Figure 1 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Figure 2 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Figure 3 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Figure 4 for E^2VTS: Energy-Efficient Video Text Spotting from Unmanned Aerial Vehicles
Viaarxiv icon