Alert button

"speech recognition": models, code, and papers
Alert button

RescoreBERT: Discriminative Speech Recognition Rescoring with BERT

Feb 07, 2022
Liyan Xu, Yile Gu, Jari Kolehmainen, Haidar Khan, Ankur Gandhe, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko

Figure 1 for RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
Figure 2 for RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
Figure 3 for RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
Figure 4 for RescoreBERT: Discriminative Speech Recognition Rescoring with BERT
Viaarxiv icon

QuickVC: Many-to-any Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Add code
Bookmark button
Alert button
Feb 20, 2023
Houjian Guo, Chaoran Liu, Carlos Toshinori Ishi, Hiroshi Ishiguro

Figure 1 for QuickVC: Many-to-any Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Figure 2 for QuickVC: Many-to-any Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Figure 3 for QuickVC: Many-to-any Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Figure 4 for QuickVC: Many-to-any Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Viaarxiv icon

EmoSpeech: Guiding FastSpeech2 Towards Emotional Text to Speech

Add code
Bookmark button
Alert button
Jun 28, 2023
Daria Diatlova, Vitaly Shutov

Viaarxiv icon

Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model

Add code
Bookmark button
Alert button
Oct 31, 2021
Martin Kocour, Kateřina Žmolíková, Lucas Ondel, Ján Švec, Marc Delcroix, Tsubasa Ochiai, Lukáš Burget, Jan Černocký

Figure 1 for Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
Figure 2 for Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
Figure 3 for Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
Figure 4 for Revisiting joint decoding based multi-talker speech recognition with DNN acoustic model
Viaarxiv icon

Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge

Add code
Bookmark button
Alert button
Oct 13, 2022
Shuhao Deng, Chengfei Li, Jinfeng Bai, Qingqing Zhang, Wei-Qiang Zhang, Runyan Yang, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge
Viaarxiv icon

Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems

Add code
Bookmark button
Alert button
Mar 29, 2022
Nicholas Mehlman, Anirudh Sreeram, Raghuveer Peri, Shrikanth Narayanan

Figure 1 for Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems
Figure 2 for Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems
Figure 3 for Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems
Figure 4 for Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems
Viaarxiv icon

End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes

Add code
Bookmark button
Alert button
Aug 09, 2021
Rohit Kumar, Anurenjan Purushothaman, Anirudh Sreeram, Sriram Ganapathy

Figure 1 for End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes
Figure 2 for End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes
Figure 3 for End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes
Figure 4 for End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes
Viaarxiv icon

Word Order Does Not Matter For Speech Recognition

Oct 18, 2021
Vineel Pratap, Qiantong Xu, Tatiana Likhomanenko, Gabriel Synnaeve, Ronan Collobert

Figure 1 for Word Order Does Not Matter For Speech Recognition
Figure 2 for Word Order Does Not Matter For Speech Recognition
Figure 3 for Word Order Does Not Matter For Speech Recognition
Figure 4 for Word Order Does Not Matter For Speech Recognition
Viaarxiv icon

OkwuGbé: End-to-End Speech Recognition for Fon and Igbo

Add code
Bookmark button
Alert button
Mar 16, 2021
Bonaventure F. P. Dossou, Chris C. Emezue

Figure 1 for OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Figure 2 for OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Figure 3 for OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Figure 4 for OkwuGbé: End-to-End Speech Recognition for Fon and Igbo
Viaarxiv icon