Alert button

"speech recognition": models, code, and papers
Alert button

Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems

Add code
Bookmark button
Alert button
Mar 02, 2022
Xiaoqiang Wang, Yanqing Liu, Jinyu Li, Veljko Miljanic, Sheng Zhao, Hosam Khalil

Figure 1 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Figure 2 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Figure 3 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Figure 4 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Viaarxiv icon

SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition

Add code
Bookmark button
Alert button
Oct 11, 2021
Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Han, Shinji Watanabe

Figure 1 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 2 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 3 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 4 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Viaarxiv icon

Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

Apr 02, 2021
Chengdong Liang, Menglong Xu, Xiao-Lei Zhang

Viaarxiv icon

Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy

Feb 03, 2021
James Mou, Jun Li

Figure 1 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 2 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 3 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 4 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Viaarxiv icon

Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jul 02, 2021
Timo Lohrenz, Patrick Schwarz, Zhengyang Li, Tim Fingscheidt

Figure 1 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 2 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 3 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 4 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Viaarxiv icon

Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition

Add code
Bookmark button
Alert button
Mar 31, 2022
Anirudh Gupta, Rishabh Gaur, Ankur Dhuriya, Harveen Singh Chadha, Neeraj Chhimwal, Priyanshi Shah, Vivek Raghavan

Figure 1 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Figure 2 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Figure 3 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Figure 4 for Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition
Viaarxiv icon

Arabic Code-Switching Speech Recognition using Monolingual Data

Jul 04, 2021
Ahmed Ali, Shammur Chowdhury, Amir Hussein, Yasser Hifny

Figure 1 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 2 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 3 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 4 for Arabic Code-Switching Speech Recognition using Monolingual Data
Viaarxiv icon

Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence

Apr 18, 2023
Yicheng Hsu, Mingsian R. Bai

Figure 1 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 2 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 3 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Figure 4 for Array Configuration-Agnostic Personal Voice Activity Detection Based on Spatial Coherence
Viaarxiv icon

Regeneration Learning: A Learning Paradigm for Data Generation

Jan 21, 2023
Xu Tan, Tao Qin, Jiang Bian, Tie-Yan Liu, Yoshua Bengio

Figure 1 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 2 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 3 for Regeneration Learning: A Learning Paradigm for Data Generation
Figure 4 for Regeneration Learning: A Learning Paradigm for Data Generation
Viaarxiv icon