Alert button

"speech recognition": models, code, and papers
Alert button

Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition

Jul 14, 2023
Wenxuan Wang, Guodong Ma, Yuke Li, Binbin Du

Figure 1 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Figure 2 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Figure 3 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Figure 4 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Viaarxiv icon

Federated Representation Learning for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Aug 03, 2023
Guruprasad V Rames, Gopinath Chennupati, Milind Rao, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo

Figure 1 for Federated Representation Learning for Automatic Speech Recognition
Figure 2 for Federated Representation Learning for Automatic Speech Recognition
Figure 3 for Federated Representation Learning for Automatic Speech Recognition
Figure 4 for Federated Representation Learning for Automatic Speech Recognition
Viaarxiv icon

Soft Random Sampling: A Theoretical and Empirical Analysis

Nov 21, 2023
Xiaodong Cui, Ashish Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury

Viaarxiv icon

Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation

Sep 04, 2023
Jiaxu Zhu, Weinan Tong, Yaoxun Xu, Changhe Song, Zhiyong Wu, Zhao You, Dan Su, Dong Yu, Helen Meng

Figure 1 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Figure 2 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Figure 3 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Figure 4 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Viaarxiv icon

Boosting Norwegian Automatic Speech Recognition

Jul 04, 2023
Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen, Svein Arne Brygfjeld

Figure 1 for Boosting Norwegian Automatic Speech Recognition
Figure 2 for Boosting Norwegian Automatic Speech Recognition
Figure 3 for Boosting Norwegian Automatic Speech Recognition
Figure 4 for Boosting Norwegian Automatic Speech Recognition
Viaarxiv icon

Adaptation of Whisper models to child speech recognition

Add code
Bookmark button
Alert button
Jul 24, 2023
Rishabh Jain, Andrei Barcovschi, Mariam Yiwere, Peter Corcoran, Horia Cucu

Figure 1 for Adaptation of Whisper models to child speech recognition
Figure 2 for Adaptation of Whisper models to child speech recognition
Figure 3 for Adaptation of Whisper models to child speech recognition
Figure 4 for Adaptation of Whisper models to child speech recognition
Viaarxiv icon

Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Sep 19, 2023
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi

Figure 1 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Figure 2 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Figure 3 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Figure 4 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Viaarxiv icon

Optimized Tokenization for Transcribed Error Correction

Add code
Bookmark button
Alert button
Oct 16, 2023
Tomer Wullach, Shlomo E. Chazan

Figure 1 for Optimized Tokenization for Transcribed Error Correction
Figure 2 for Optimized Tokenization for Transcribed Error Correction
Figure 3 for Optimized Tokenization for Transcribed Error Correction
Figure 4 for Optimized Tokenization for Transcribed Error Correction
Viaarxiv icon

Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer

Nov 15, 2023
Jin Qiu, Lu Huang, Boyu Li, Jun Zhang, Lu Lu, Zejun Ma

Viaarxiv icon