Alert button

"speech recognition": models, code, and papers
Alert button

Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages

Jan 24, 2022
A. Madhavaraj, Ramakrishnan Angarai Ganesan

Figure 1 for Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages
Figure 2 for Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages
Figure 3 for Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages
Figure 4 for Data and knowledge-driven approaches for multilingual training to improve the performance of speech recognition systems of Indian languages
Viaarxiv icon

Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework

Oct 29, 2020
Dhruv Guliani, Francoise Beaufays, Giovanni Motta

Figure 1 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Figure 2 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Figure 3 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Figure 4 for Training Speech Recognition Models with Federated Learning: A Quality/Cost Framework
Viaarxiv icon

Towards A Unified Conformer Structure: from ASR to ASV Task

Add code
Bookmark button
Alert button
Nov 14, 2022
Dexin Liao, Tao Jiang, Feng Wang, Lin Li, Qingyang Hong

Figure 1 for Towards A Unified Conformer Structure: from ASR to ASV Task
Figure 2 for Towards A Unified Conformer Structure: from ASR to ASV Task
Figure 3 for Towards A Unified Conformer Structure: from ASR to ASV Task
Figure 4 for Towards A Unified Conformer Structure: from ASR to ASV Task
Viaarxiv icon

Multiresolution and Multimodal Speech Recognition with Transformers

Apr 29, 2020
Georgios Paraskevopoulos, Srinivas Parthasarathy, Aparna Khare, Shiva Sundaram

Figure 1 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 2 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 3 for Multiresolution and Multimodal Speech Recognition with Transformers
Figure 4 for Multiresolution and Multimodal Speech Recognition with Transformers
Viaarxiv icon

KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos

Add code
Bookmark button
Alert button
Mar 01, 2019
Egor Lakomkin, Sven Magg, Cornelius Weber, Stefan Wermter

Figure 1 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 2 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 3 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Figure 4 for KT-Speech-Crawler: Automatic Dataset Construction for Speech Recognition from YouTube Videos
Viaarxiv icon

End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend

Add code
Bookmark button
Alert button
Feb 23, 2021
Wangyou Zhang, Christoph Boeddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian

Figure 1 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Figure 2 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Figure 3 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Viaarxiv icon

Speech recognition with quaternion neural networks

Add code
Bookmark button
Alert button
Nov 21, 2018
Titouan Parcollet, Mirco Ravanelli, Mohamed Morchid, Georges Linarès, Renato De Mori

Figure 1 for Speech recognition with quaternion neural networks
Figure 2 for Speech recognition with quaternion neural networks
Figure 3 for Speech recognition with quaternion neural networks
Viaarxiv icon

Transformer with Bidirectional Decoder for Speech Recognition

Aug 11, 2020
Xi Chen, Songyang Zhang, Dandan Song, Peng Ouyang, Shouyi Yin

Figure 1 for Transformer with Bidirectional Decoder for Speech Recognition
Figure 2 for Transformer with Bidirectional Decoder for Speech Recognition
Figure 3 for Transformer with Bidirectional Decoder for Speech Recognition
Figure 4 for Transformer with Bidirectional Decoder for Speech Recognition
Viaarxiv icon

Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition

Feb 18, 2021
Gary Yeung, Ruchao Fan, Abeer Alwan

Figure 1 for Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition
Figure 2 for Fundamental Frequency Feature Normalization and Data Augmentation for Child Speech Recognition
Viaarxiv icon

Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition

Dec 11, 2020
Valentin Mendelev, Tina Raissi, Guglielmo Camporese, Manuel Giollo

Figure 1 for Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition
Figure 2 for Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition
Figure 3 for Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition
Figure 4 for Improved Robustness to Disfluencies in RNN-Transducer Based Speech Recognition
Viaarxiv icon