Alert button

"speech recognition": models, code, and papers
Alert button

Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition

Add code
Bookmark button
Alert button
Feb 02, 2023
HoLam Chung, Junan Li, Pengfei Liu1, Wai-Kim Leung, Xixin Wu, Helen Meng

Figure 1 for Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition
Figure 2 for Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition
Figure 3 for Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition
Figure 4 for Improving Rare Words Recognition through Homophone Extension and Unified Writing for Low-resource Cantonese Speech Recognition
Viaarxiv icon

A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset

Add code
Bookmark button
Alert button
Jan 21, 2023
Javad Peymanfard, Samin Heydarian, Ali Lashini, Hossein Zeinali, Mohammad Reza Mohammadi, Nasser Mozayani

Figure 1 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Figure 2 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Figure 3 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Figure 4 for A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset
Viaarxiv icon

LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge

Oct 17, 2022
Yan Jia, Mi Hong, Jingyu Hou, Kailong Ren, Sifan Ma, Jin Wang, Fangzhen Peng, Yinglin Ji, Lin Yang, Junjie Wang

Figure 1 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Figure 2 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Figure 3 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Figure 4 for LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge
Viaarxiv icon

Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings

Mar 13, 2023
Joel Shor, Ruyue Agnes Bi, Subhashini Venugopalan, Steven Ibara, Roman Goldenberg, Ehud Rivlin

Figure 1 for Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings
Figure 2 for Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings
Figure 3 for Clinical BERTScore: An Improved Measure of Automatic Speech Recognition Performance in Clinical Settings
Viaarxiv icon

PAMP: A unified framework boosting low resource automatic speech recognition

Add code
Bookmark button
Alert button
Feb 05, 2023
Zeping Min, Qian Ge, Zhong Li, Weinan E

Figure 1 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 2 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 3 for PAMP: A unified framework boosting low resource automatic speech recognition
Figure 4 for PAMP: A unified framework boosting low resource automatic speech recognition
Viaarxiv icon

Sparks of Large Audio Models: A Survey and Outlook

Add code
Bookmark button
Alert button
Aug 24, 2023
Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Heriberto Cuayáhuitl, Björn W. Schuller

Figure 1 for Sparks of Large Audio Models: A Survey and Outlook
Figure 2 for Sparks of Large Audio Models: A Survey and Outlook
Figure 3 for Sparks of Large Audio Models: A Survey and Outlook
Figure 4 for Sparks of Large Audio Models: A Survey and Outlook
Viaarxiv icon

End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations

Aug 15, 2023
Bolaji Yusuf, Jan Cernocky, Murat Saraclar

Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Apr 03, 2023
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolář, Stavros Petridis, Maja Pantic, Christian Fuegen

Figure 1 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 2 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 3 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 4 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Viaarxiv icon

Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages

Add code
Bookmark button
Alert button
Jul 20, 2023
Ephrem Afele Retta, Richard Sutcliffe, Jabar Mahmood, Michael Abebe Berwo, Eiad Almekhlafi, Sajjad Ahmed Khan, Shehzad Ashraf Chaudhry, Mustafa Mhamed, Jun Feng

Figure 1 for Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
Figure 2 for Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
Figure 3 for Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
Figure 4 for Cross-Corpus Multilingual Speech Emotion Recognition: Amharic vs. Other Languages
Viaarxiv icon

Decoupled Structure for Improved Adaptability of End-to-End Models

Aug 25, 2023
Keqi Deng, Philip C. Woodland

Viaarxiv icon