Alert button

"Text": models, code, and papers
Alert button

U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech

May 22, 2023
Xin Jing, Yi Chang, Zijiang Yang, Jiangjian Xie, Andreas Triantafyllopoulos, Bjoern W. Schuller

Figure 1 for U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
Figure 2 for U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
Figure 3 for U-DiT TTS: U-Diffusion Vision Transformer for Text-to-Speech
Viaarxiv icon

SViTT: Temporal Learning of Sparse Video-Text Transformers

Apr 18, 2023
Yi Li, Kyle Min, Subarna Tripathi, Nuno Vasconcelos

Figure 1 for SViTT: Temporal Learning of Sparse Video-Text Transformers
Figure 2 for SViTT: Temporal Learning of Sparse Video-Text Transformers
Figure 3 for SViTT: Temporal Learning of Sparse Video-Text Transformers
Figure 4 for SViTT: Temporal Learning of Sparse Video-Text Transformers
Viaarxiv icon

SimpLex: a lexical text simplification architecture

Apr 14, 2023
Ciprian-Octavian Truică, Andrei-Ionut Stan, Elena-Simona Apostol

Viaarxiv icon

DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining

May 20, 2023
Weifeng Jiang, Qianren Mao, Jianxin Li, Chenghua Lin, Weiyi Yang, Ting Deng, Zheng Wang

Figure 1 for DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining
Figure 2 for DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining
Figure 3 for DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining
Figure 4 for DisCo: Distilled Student Models Co-training for Semi-supervised Text Mining
Viaarxiv icon

Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages

May 15, 2023
Chunlan Ma, Ayyoob ImaniGooghari, Haotian Ye, Ehsaneddin Asgari, Hinrich Schütze

Figure 1 for Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages
Figure 2 for Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages
Figure 3 for Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages
Figure 4 for Taxi1500: A Multilingual Dataset for Text Classification in 1500 Languages
Viaarxiv icon

Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge

Jun 07, 2023
Wenhao Guan, Tao Li, Yishuang Li, Hukai Huang, Qingyang Hong, Lin Li

Figure 1 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Figure 2 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Figure 3 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Figure 4 for Interpretable Style Transfer for Text-to-Speech with ControlVAE and Diffusion Bridge
Viaarxiv icon

Alzheimer's Disease Detection from Spontaneous Speech and Text: A review

Jul 19, 2023
Vrindha M. K., Geethu V., Anurenjan P. R., Deepak S., Sreeni K. G.

Figure 1 for Alzheimer's Disease Detection from Spontaneous Speech and Text: A review
Figure 2 for Alzheimer's Disease Detection from Spontaneous Speech and Text: A review
Figure 3 for Alzheimer's Disease Detection from Spontaneous Speech and Text: A review
Viaarxiv icon

Is ChatGPT a Good Personality Recognizer? A Preliminary Study

Jul 08, 2023
Yu Ji, Wen Wu, Hong Zheng, Yi Hu, Xi Chen, Liang He

Figure 1 for Is ChatGPT a Good Personality Recognizer? A Preliminary Study
Figure 2 for Is ChatGPT a Good Personality Recognizer? A Preliminary Study
Figure 3 for Is ChatGPT a Good Personality Recognizer? A Preliminary Study
Figure 4 for Is ChatGPT a Good Personality Recognizer? A Preliminary Study
Viaarxiv icon

Actor-agnostic Multi-label Action Recognition with Multi-modal Query

Aug 08, 2023
Anindya Mondal, Sauradip Nag, Joaquin M Prada, Xiatian Zhu, Anjan Dutta

Figure 1 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 2 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 3 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Figure 4 for Actor-agnostic Multi-label Action Recognition with Multi-modal Query
Viaarxiv icon

OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation

Aug 08, 2023
Dongyang Yu, Shihao Wang, Yuan Fang, Wangpeng An

Figure 1 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Figure 2 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Figure 3 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Figure 4 for OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation
Viaarxiv icon