Alert button

"speech recognition": models, code, and papers
Alert button

Multilingual self-supervised speech representations improve the speech recognition of low-resource African languages with codeswitching

Nov 25, 2023
Tolúlopé Ògúnrèmí, Christopher D. Manning, Dan Jurafsky

Viaarxiv icon

Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Aug 03, 2023
Shaoshi Ling, Yuxuan Hu, Shuangbei Qian, Guoli Ye, Yao Qian, Yifan Gong, Ed Lin, Michael Zeng

Figure 1 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 2 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 3 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Figure 4 for Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech Recognition
Viaarxiv icon

Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition

Sep 05, 2023
Patrick Eickhoff, Matthias Möller, Theresa Pekarek Rosin, Johannes Twiefel, Stefan Wermter

Figure 1 for Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition
Figure 2 for Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition
Figure 3 for Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition
Figure 4 for Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition
Viaarxiv icon

Whisper-MCE: Whisper Model Finetuned for Better Performance with Mixed Languages

Oct 27, 2023
Peng Xie, XingYuan Liu, ZiWei Chen, Kani Chen, Yang Wang

Figure 1 for Whisper-MCE: Whisper Model Finetuned for Better Performance with Mixed Languages
Figure 2 for Whisper-MCE: Whisper Model Finetuned for Better Performance with Mixed Languages
Viaarxiv icon

Soft Random Sampling: A Theoretical and Empirical Analysis

Nov 21, 2023
Xiaodong Cui, Ashish Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury

Viaarxiv icon

Federated Representation Learning for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Aug 03, 2023
Guruprasad V Rames, Gopinath Chennupati, Milind Rao, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo

Figure 1 for Federated Representation Learning for Automatic Speech Recognition
Figure 2 for Federated Representation Learning for Automatic Speech Recognition
Figure 3 for Federated Representation Learning for Automatic Speech Recognition
Figure 4 for Federated Representation Learning for Automatic Speech Recognition
Viaarxiv icon

Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition

Jul 14, 2023
Wenxuan Wang, Guodong Ma, Yuke Li, Binbin Du

Figure 1 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Figure 2 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Figure 3 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Figure 4 for Language-Routing Mixture of Experts for Multilingual and Code-Switching Speech Recognition
Viaarxiv icon

Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation

Sep 04, 2023
Jiaxu Zhu, Weinan Tong, Yaoxun Xu, Changhe Song, Zhiyong Wu, Zhao You, Dan Su, Dong Yu, Helen Meng

Figure 1 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Figure 2 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Figure 3 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Figure 4 for Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation
Viaarxiv icon

Boosting Norwegian Automatic Speech Recognition

Jul 04, 2023
Javier de la Rosa, Rolv-Arild Braaten, Per Egil Kummervold, Freddy Wetjen, Svein Arne Brygfjeld

Figure 1 for Boosting Norwegian Automatic Speech Recognition
Figure 2 for Boosting Norwegian Automatic Speech Recognition
Figure 3 for Boosting Norwegian Automatic Speech Recognition
Figure 4 for Boosting Norwegian Automatic Speech Recognition
Viaarxiv icon

Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Sep 19, 2023
Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi

Figure 1 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Figure 2 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Figure 3 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Figure 4 for Harnessing the Zero-Shot Power of Instruction-Tuned Large Language Model in End-to-End Speech Recognition
Viaarxiv icon