Alert button

"speech recognition": models, code, and papers
Alert button

Record Deduplication for Entity Distribution Modeling in ASR Transcripts

Jun 09, 2023
Tianyu Huang, Chung Hoon Hong, Carl Wivagg, Kanna Shimizu

Figure 1 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Figure 2 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Figure 3 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Figure 4 for Record Deduplication for Entity Distribution Modeling in ASR Transcripts
Viaarxiv icon

Alzheimer Disease Classification through ASR-based Transcriptions: Exploring the Impact of Punctuation and Pauses

Jun 06, 2023
Lucía Gómez-Zaragozá, Simone Wills, Cristian Tejedor-Garcia, Javier Marín-Morales, Mariano Alcañiz, Helmer Strik

Figure 1 for Alzheimer Disease Classification through ASR-based Transcriptions: Exploring the Impact of Punctuation and Pauses
Figure 2 for Alzheimer Disease Classification through ASR-based Transcriptions: Exploring the Impact of Punctuation and Pauses
Viaarxiv icon

Automatic Speech Recognition of Low-Resource Languages Based on Chukchi

Add code
Bookmark button
Alert button
Oct 11, 2022
Anastasia Safonova, Tatiana Yudina, Emil Nadimanov, Cydnie Davenport

Figure 1 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 2 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 3 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Figure 4 for Automatic Speech Recognition of Low-Resource Languages Based on Chukchi
Viaarxiv icon

RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition

Add code
Bookmark button
Alert button
May 28, 2023
Wei Zhou, Eugen Beck, Simon Berger, Ralf Schlüter, Hermann Ney

Figure 1 for RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition
Figure 2 for RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition
Figure 3 for RASR2: The RWTH ASR Toolkit for Generic Sequence-to-sequence Speech Recognition
Viaarxiv icon

A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition

May 06, 2022
Sanghyun Yoo, Inchul Song, Yoshua Bengio

Figure 1 for A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Figure 2 for A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Figure 3 for A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Figure 4 for A Highly Adaptive Acoustic Model for Accurate Multi-Dialect Speech Recognition
Viaarxiv icon

AfriNames: Most ASR models "butcher" African Names

Jun 02, 2023
Tobi Olatunji, Tejumade Afonja, Bonaventure F. P. Dossou, Atnafu Lambebo Tonja, Chris Chinenye Emezue, Amina Mardiyyah Rufai, Sahib Singh

Figure 1 for AfriNames: Most ASR models "butcher" African Names
Figure 2 for AfriNames: Most ASR models "butcher" African Names
Figure 3 for AfriNames: Most ASR models "butcher" African Names
Figure 4 for AfriNames: Most ASR models "butcher" African Names
Viaarxiv icon

Adaptive Activation Network For Low Resource Multilingual Speech Recognition

May 28, 2022
Jian Luo, Jianzong Wang, Ning Cheng, Zhenpeng Zheng, Jing Xiao

Figure 1 for Adaptive Activation Network For Low Resource Multilingual Speech Recognition
Figure 2 for Adaptive Activation Network For Low Resource Multilingual Speech Recognition
Figure 3 for Adaptive Activation Network For Low Resource Multilingual Speech Recognition
Figure 4 for Adaptive Activation Network For Low Resource Multilingual Speech Recognition
Viaarxiv icon

Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition

Jul 12, 2022
Rodolfo Zevallos, Luis Camacho, Nelsi Melgarejo

Figure 1 for Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition
Figure 2 for Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition
Figure 3 for Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition
Figure 4 for Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition
Viaarxiv icon

Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data

Add code
Bookmark button
Alert button
Mar 29, 2022
Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng

Figure 1 for Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
Figure 2 for Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
Figure 3 for Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
Figure 4 for Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data
Viaarxiv icon

Adversarial Training For Low-Resource Disfluency Correction

Add code
Bookmark button
Alert button
Jun 10, 2023
Vineet Bhat, Preethi Jyothi, Pushpak Bhattacharyya

Figure 1 for Adversarial Training For Low-Resource Disfluency Correction
Figure 2 for Adversarial Training For Low-Resource Disfluency Correction
Figure 3 for Adversarial Training For Low-Resource Disfluency Correction
Figure 4 for Adversarial Training For Low-Resource Disfluency Correction
Viaarxiv icon