Alert button

"Text": models, code, and papers
Alert button

P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection

Nov 02, 2022
Yanxin Long, Jianhua Han, Runhui Huang, Xu Hang, Yi Zhu, Chunjing Xu, Xiaodan Liang

Figure 1 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Figure 2 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Figure 3 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Figure 4 for P$^3$OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Viaarxiv icon

Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?

Nov 25, 2022
Xuan Shi, Erica Cooper, Xin Wang, Junichi Yamagishi, Shrikanth Narayanan

Figure 1 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 2 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 3 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Figure 4 for Can Knowledge of End-to-End Text-to-Speech Models Improve Neural MIDI-to-Audio Synthesis Systems?
Viaarxiv icon

Using Auxiliary Tasks In Multimodal Fusion Of Wav2vec 2.0 And BERT For Multimodal Emotion Recognition

Feb 27, 2023
Dekai Sun, Yancheng He, Jiqing Han

Figure 1 for Using Auxiliary Tasks In Multimodal Fusion Of Wav2vec 2.0 And BERT For Multimodal Emotion Recognition
Figure 2 for Using Auxiliary Tasks In Multimodal Fusion Of Wav2vec 2.0 And BERT For Multimodal Emotion Recognition
Figure 3 for Using Auxiliary Tasks In Multimodal Fusion Of Wav2vec 2.0 And BERT For Multimodal Emotion Recognition
Figure 4 for Using Auxiliary Tasks In Multimodal Fusion Of Wav2vec 2.0 And BERT For Multimodal Emotion Recognition
Viaarxiv icon

Blind deblurring of hyperspectral document images

Mar 09, 2023
M. Ljubenovic, P. Guzzonato, G. Franceschin, A. Traviglia

Viaarxiv icon

MAC: A unified framework boosting low resource automatic speech recognition

Feb 15, 2023
Zeping Min, Qian Ge, Zhong Li, Weinan E

Figure 1 for MAC: A unified framework boosting low resource automatic speech recognition
Figure 2 for MAC: A unified framework boosting low resource automatic speech recognition
Figure 3 for MAC: A unified framework boosting low resource automatic speech recognition
Figure 4 for MAC: A unified framework boosting low resource automatic speech recognition
Viaarxiv icon

Class-Continuous Conditional Generative Neural Radiance Field

Jan 09, 2023
Jiwook Kim, Minhyeok Lee

Figure 1 for Class-Continuous Conditional Generative Neural Radiance Field
Figure 2 for Class-Continuous Conditional Generative Neural Radiance Field
Figure 3 for Class-Continuous Conditional Generative Neural Radiance Field
Figure 4 for Class-Continuous Conditional Generative Neural Radiance Field
Viaarxiv icon

Zero3D: Semantic-Driven Multi-Category 3D Shape Generation

Jan 31, 2023
Bo Han, Yitong Liu, Yixuan Shen

Figure 1 for Zero3D: Semantic-Driven Multi-Category 3D Shape Generation
Figure 2 for Zero3D: Semantic-Driven Multi-Category 3D Shape Generation
Figure 3 for Zero3D: Semantic-Driven Multi-Category 3D Shape Generation
Figure 4 for Zero3D: Semantic-Driven Multi-Category 3D Shape Generation
Viaarxiv icon

Pairwise Instance Relation Augmentation for Long-tailed Multi-label Text Classification

Nov 19, 2022
Lin Xiao, Pengyu Xu, Liping Jing, Xiangliang Zhang

Figure 1 for Pairwise Instance Relation Augmentation for Long-tailed Multi-label Text Classification
Figure 2 for Pairwise Instance Relation Augmentation for Long-tailed Multi-label Text Classification
Figure 3 for Pairwise Instance Relation Augmentation for Long-tailed Multi-label Text Classification
Figure 4 for Pairwise Instance Relation Augmentation for Long-tailed Multi-label Text Classification
Viaarxiv icon

DIALOG-22 RuATD Generated Text Detection

Jun 16, 2022
Narek Maloyan, Bulat Nutfullin, Eugene Ilyushin

Figure 1 for DIALOG-22 RuATD Generated Text Detection
Figure 2 for DIALOG-22 RuATD Generated Text Detection
Figure 3 for DIALOG-22 RuATD Generated Text Detection
Figure 4 for DIALOG-22 RuATD Generated Text Detection
Viaarxiv icon

Neural networks for learning personality traits from natural language

Feb 23, 2023
Giorgia Adorni

Viaarxiv icon