Alert button

"speech recognition": models, code, and papers
Alert button

Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset

Add code
Bookmark button
Alert button
Jan 07, 2022
Tiezheng Yu, Rita Frieske, Peng Xu, Samuel Cahyawijaya, Cheuk Tung Shadow Yiu, Holy Lovenia, Wenliang Dai, Elham J. Barezi, Qifeng Chen, Xiaojuan Ma, Bertram E. Shi, Pascale Fung

Figure 1 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Figure 2 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Figure 3 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Figure 4 for Automatic Speech Recognition Datasets in Cantonese Language: A Survey and a New Dataset
Viaarxiv icon

Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser

Add code
Bookmark button
Alert button
Apr 08, 2022
Sonal Joshi, Saurabh Kataria, Yiwen Shao, Piotr Zelasko, Jesus Villalba, Sanjeev Khudanpur, Najim Dehak

Figure 1 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 2 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 3 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 4 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Viaarxiv icon

Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy

Feb 03, 2021
James Mou, Jun Li

Figure 1 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 2 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 3 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Figure 4 for Effects of Number of Filters of Convolutional Layers on Speech Recognition Model Accuracy
Viaarxiv icon

Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

Apr 02, 2021
Chengdong Liang, Menglong Xu, Xiao-Lei Zhang

Viaarxiv icon

SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition

Add code
Bookmark button
Alert button
Oct 11, 2021
Jing Pan, Tao Lei, Kwangyoun Kim, Kyu Han, Shinji Watanabe

Figure 1 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 2 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 3 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 4 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Viaarxiv icon

Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition

Add code
Bookmark button
Alert button
Jul 02, 2021
Timo Lohrenz, Patrick Schwarz, Zhengyang Li, Tim Fingscheidt

Figure 1 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 2 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 3 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Figure 4 for Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition
Viaarxiv icon

Simple and Effective Unsupervised Speech Translation

Add code
Bookmark button
Alert button
Oct 18, 2022
Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino

Figure 1 for Simple and Effective Unsupervised Speech Translation
Figure 2 for Simple and Effective Unsupervised Speech Translation
Figure 3 for Simple and Effective Unsupervised Speech Translation
Figure 4 for Simple and Effective Unsupervised Speech Translation
Viaarxiv icon

Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems

Add code
Bookmark button
Alert button
Mar 02, 2022
Xiaoqiang Wang, Yanqing Liu, Jinyu Li, Veljko Miljanic, Sheng Zhao, Hosam Khalil

Figure 1 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Figure 2 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Figure 3 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Figure 4 for Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition Systems
Viaarxiv icon

Arabic Code-Switching Speech Recognition using Monolingual Data

Jul 04, 2021
Ahmed Ali, Shammur Chowdhury, Amir Hussein, Yasser Hifny

Figure 1 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 2 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 3 for Arabic Code-Switching Speech Recognition using Monolingual Data
Figure 4 for Arabic Code-Switching Speech Recognition using Monolingual Data
Viaarxiv icon