Alert button

"speech recognition": models, code, and papers
Alert button

Multilingual Speech Recognition using Knowledge Transfer across Learning Processes

Oct 15, 2021
Rimita Lahiri, Kenichi Kumatani, Eric Sun, Yao Qian

Figure 1 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Figure 2 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Figure 3 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Figure 4 for Multilingual Speech Recognition using Knowledge Transfer across Learning Processes
Viaarxiv icon

Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition

Feb 17, 2022
Chao-Han Huck Yang, Zeeshan Ahmed, Yile Gu, Joseph Szurley, Roger Ren, Linda Liu, Andreas Stolcke, Ivan Bulyko

Figure 1 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Figure 2 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Figure 3 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Figure 4 for Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition
Viaarxiv icon

Effect of different splitting criteria on the performance of speech emotion recognition

Oct 26, 2022
Bagus Tris Atmaja, Akira Sasou

Figure 1 for Effect of different splitting criteria on the performance of speech emotion recognition
Figure 2 for Effect of different splitting criteria on the performance of speech emotion recognition
Figure 3 for Effect of different splitting criteria on the performance of speech emotion recognition
Figure 4 for Effect of different splitting criteria on the performance of speech emotion recognition
Viaarxiv icon

Reducing language context confusion for end-to-end code-switching automatic speech recognition

Jan 28, 2022
Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Jianhua Tao, Yu Ting Yeung, Liqun Deng

Figure 1 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 2 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 3 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Figure 4 for Reducing language context confusion for end-to-end code-switching automatic speech recognition
Viaarxiv icon

The Use of Voice Source Features for Sung Speech Recognition

Add code
Bookmark button
Alert button
Feb 23, 2021
Gerardo Roa Dabike, Jon Barker

Figure 1 for The Use of Voice Source Features for Sung Speech Recognition
Figure 2 for The Use of Voice Source Features for Sung Speech Recognition
Figure 3 for The Use of Voice Source Features for Sung Speech Recognition
Figure 4 for The Use of Voice Source Features for Sung Speech Recognition
Viaarxiv icon

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

Nov 04, 2022
Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran

Figure 1 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 2 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 3 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 4 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Viaarxiv icon

A study on native American English speech recognition by Indian listeners with varying word familiarity level

Dec 08, 2021
Abhayjeet Singh, Achuth Rao MV, Rakesh Vaideeswaran, Chiranjeevi Yarra, Prasanta Kumar Ghosh

Figure 1 for A study on native American English speech recognition by Indian listeners with varying word familiarity level
Figure 2 for A study on native American English speech recognition by Indian listeners with varying word familiarity level
Figure 3 for A study on native American English speech recognition by Indian listeners with varying word familiarity level
Figure 4 for A study on native American English speech recognition by Indian listeners with varying word familiarity level
Viaarxiv icon

Efficient Domain Adaptation for Speech Foundation Models

Feb 03, 2023
Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Francoise Beaufays

Figure 1 for Efficient Domain Adaptation for Speech Foundation Models
Figure 2 for Efficient Domain Adaptation for Speech Foundation Models
Figure 3 for Efficient Domain Adaptation for Speech Foundation Models
Figure 4 for Efficient Domain Adaptation for Speech Foundation Models
Viaarxiv icon

Fusing information streams in end-to-end audio-visual speech recognition

Apr 19, 2021
Wentao Yu, Steffen Zeiler, Dorothea Kolossa

Figure 1 for Fusing information streams in end-to-end audio-visual speech recognition
Figure 2 for Fusing information streams in end-to-end audio-visual speech recognition
Figure 3 for Fusing information streams in end-to-end audio-visual speech recognition
Figure 4 for Fusing information streams in end-to-end audio-visual speech recognition
Viaarxiv icon