Alert button

"speech": models, code, and papers
Alert button

End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend

Feb 23, 2021
Wangyou Zhang, Christoph Boeddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian

Figure 1 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Figure 2 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Figure 3 for End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend
Viaarxiv icon

Part of speech and gramset tagging algorithms for unknown words based on morphological dictionaries of the Veps and Karelian languages

Mar 22, 2021
Andrew Krizhanovsky, Natalia Krizhanovsky, Irina Novak

Figure 1 for Part of speech and gramset tagging algorithms for unknown words based on morphological dictionaries of the Veps and Karelian languages
Figure 2 for Part of speech and gramset tagging algorithms for unknown words based on morphological dictionaries of the Veps and Karelian languages
Figure 3 for Part of speech and gramset tagging algorithms for unknown words based on morphological dictionaries of the Veps and Karelian languages
Figure 4 for Part of speech and gramset tagging algorithms for unknown words based on morphological dictionaries of the Veps and Karelian languages
Viaarxiv icon

KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition

Sep 26, 2020
Soohwan Kim, Seyoung Bae, Cheolhwang Won

Figure 1 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Figure 2 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Figure 3 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Figure 4 for KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition
Viaarxiv icon

Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT

Feb 15, 2021
Ye Bai, Jiangyan Yi, Jianhua Tao, Zhengkun Tian, Zhengqi Wen, Shuai Zhang

Figure 1 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Figure 2 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Figure 3 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Figure 4 for Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT
Viaarxiv icon

Transformer-based end-to-end speech recognition with residual Gaussian-based self-attention

Mar 29, 2021
Chengdong Liang, Menglong Xu, Xiao-Lei Zhang

Viaarxiv icon

Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation

Jun 04, 2019
Elizabeth Salesky, Matthias Sperber, Alan W Black

Figure 1 for Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Figure 2 for Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Figure 3 for Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Figure 4 for Exploring Phoneme-Level Speech Representations for End-to-End Speech Translation
Viaarxiv icon

Refining Automatic Speech Recognition System for older adults

Nov 17, 2020
Liu Chen, Meysam Asgari

Figure 1 for Refining Automatic Speech Recognition System for older adults
Figure 2 for Refining Automatic Speech Recognition System for older adults
Figure 3 for Refining Automatic Speech Recognition System for older adults
Figure 4 for Refining Automatic Speech Recognition System for older adults
Viaarxiv icon

The MSXF TTS System for ICASSP 2022 ADD Challenge

Jan 27, 2022
Chunyong Yang, Pengfei Liu, Yanli Chen, Hongbin Wang, Min Liu

Figure 1 for The MSXF TTS System for ICASSP 2022 ADD Challenge
Viaarxiv icon

Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition

Feb 23, 2021
Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao

Figure 1 for Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition
Figure 2 for Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition
Figure 3 for Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition
Figure 4 for Unidirectional Memory-Self-Attention Transducer for Online Speech Recognition
Viaarxiv icon

A Recurrent Variational Autoencoder for Speech Enhancement

Oct 24, 2019
Simon Leglaive, Xavier Alameda-Pineda, Laurent Girin, Radu Horaud

Figure 1 for A Recurrent Variational Autoencoder for Speech Enhancement
Figure 2 for A Recurrent Variational Autoencoder for Speech Enhancement
Figure 3 for A Recurrent Variational Autoencoder for Speech Enhancement
Figure 4 for A Recurrent Variational Autoencoder for Speech Enhancement
Viaarxiv icon