Alert button

"speech recognition": models, code, and papers
Alert button

Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jun 26, 2023
Samuel Cahyawijaya, Holy Lovenia, Willy Chung, Rita Frieske, Zihan Liu, Pascale Fung

Figure 1 for Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition
Figure 2 for Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition
Figure 3 for Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition
Figure 4 for Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition
Viaarxiv icon

Active Learning for Classifying 2D Grid-Based Level Completability

Add code
Bookmark button
Alert button
Sep 08, 2023
Mahsa Bazzaz, Seth Cooper

Figure 1 for Active Learning for Classifying 2D Grid-Based Level Completability
Figure 2 for Active Learning for Classifying 2D Grid-Based Level Completability
Figure 3 for Active Learning for Classifying 2D Grid-Based Level Completability
Figure 4 for Active Learning for Classifying 2D Grid-Based Level Completability
Viaarxiv icon

Prompting Audios Using Acoustic Properties For Emotion Representation

Oct 05, 2023
Hira Dhamyal, Benjamin Elizalde, Soham Deshmukh, Huaming Wang, Bhiksha Raj, Rita Singh

Figure 1 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 2 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 3 for Prompting Audios Using Acoustic Properties For Emotion Representation
Figure 4 for Prompting Audios Using Acoustic Properties For Emotion Representation
Viaarxiv icon

ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing

Add code
Bookmark button
Alert button
Oct 28, 2023
Quoc-Nam Nguyen, Thang Chau Phan, Duc-Vu Nguyen, Kiet Van Nguyen

Viaarxiv icon

Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization

Apr 27, 2023
Hamza Kheddar, Yassine Himeur, Somaya Al-Maadeed, Abbes Amira, Faycal Bensaali

Figure 1 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Figure 2 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Figure 3 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Figure 4 for Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization
Viaarxiv icon

AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning

Sep 04, 2023
Yi-Cheng Wang, Tzu-Ting Yang, Hsin-Wei Wang, Bi-Cheng Yan, Berlin Chen

Figure 1 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Figure 2 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Figure 3 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Figure 4 for AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning
Viaarxiv icon

Pre-Finetuning for Few-Shot Emotional Speech Recognition

Add code
Bookmark button
Alert button
Feb 28, 2023
Maximillian Chen, Zhou Yu

Figure 1 for Pre-Finetuning for Few-Shot Emotional Speech Recognition
Figure 2 for Pre-Finetuning for Few-Shot Emotional Speech Recognition
Figure 3 for Pre-Finetuning for Few-Shot Emotional Speech Recognition
Figure 4 for Pre-Finetuning for Few-Shot Emotional Speech Recognition
Viaarxiv icon

Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks

Add code
Bookmark button
Alert button
Aug 30, 2023
Payal Mohapatra, Akash Pandey, Yueyuan Sui, Qi Zhu

Viaarxiv icon

End-to-End Automatic Speech Recognition model for the Sudanese Dialect

Dec 21, 2022
Ayman Mansour, Wafaa F. Mukhtar

Figure 1 for End-to-End Automatic Speech Recognition model for the Sudanese Dialect
Figure 2 for End-to-End Automatic Speech Recognition model for the Sudanese Dialect
Figure 3 for End-to-End Automatic Speech Recognition model for the Sudanese Dialect
Figure 4 for End-to-End Automatic Speech Recognition model for the Sudanese Dialect
Viaarxiv icon

Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition

Add code
Bookmark button
Alert button
Nov 22, 2022
Injy Hamed, Amir Hussein, Oumnia Chellah, Shammur Chowdhury, Hamdy Mubarak, Sunayana Sitaram, Nizar Habash, Ahmed Ali

Figure 1 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Figure 2 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Figure 3 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Figure 4 for Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition
Viaarxiv icon