Alert button

"speech recognition": models, code, and papers
Alert button

Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition

Jul 24, 2015
Haşim Sak, Andrew Senior, Kanishka Rao, Françoise Beaufays

Figure 1 for Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition
Figure 2 for Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition
Figure 3 for Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition
Figure 4 for Fast and Accurate Recurrent Neural Network Acoustic Models for Speech Recognition
Viaarxiv icon

Contaminated speech training methods for robust DNN-HMM distant speech recognition

Oct 10, 2017
Mirco Ravanelli, Maurizio Omologo

Figure 1 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Figure 2 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Figure 3 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Figure 4 for Contaminated speech training methods for robust DNN-HMM distant speech recognition
Viaarxiv icon

Twin Regularization for online speech recognition

Jun 12, 2018
Mirco Ravanelli, Dmitriy Serdyuk, Yoshua Bengio

Figure 1 for Twin Regularization for online speech recognition
Figure 2 for Twin Regularization for online speech recognition
Figure 3 for Twin Regularization for online speech recognition
Figure 4 for Twin Regularization for online speech recognition
Viaarxiv icon

SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition

Oct 04, 2019
Zhen Huang, Tim Ng, Leo Liu, Henry Mason, Xiaodan Zhuang, Daben Liu

Figure 1 for SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition
Figure 2 for SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition
Figure 3 for SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition
Figure 4 for SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition
Viaarxiv icon

Scaling ASR Improves Zero and Few Shot Learning

Nov 29, 2021
Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed

Figure 1 for Scaling ASR Improves Zero and Few Shot Learning
Figure 2 for Scaling ASR Improves Zero and Few Shot Learning
Figure 3 for Scaling ASR Improves Zero and Few Shot Learning
Figure 4 for Scaling ASR Improves Zero and Few Shot Learning
Viaarxiv icon

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition

Apr 07, 2022
Zhao You, Shulin Feng, Dan Su, Dong Yu

Figure 1 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 2 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 3 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Figure 4 for 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Viaarxiv icon

End-to-end Audiovisual Speech Recognition

Feb 22, 2018
Stavros Petridis, Themos Stafylakis, Pingchuan Ma, Feipeng Cai, Georgios Tzimiropoulos, Maja Pantic

Figure 1 for End-to-end Audiovisual Speech Recognition
Figure 2 for End-to-end Audiovisual Speech Recognition
Figure 3 for End-to-end Audiovisual Speech Recognition
Figure 4 for End-to-end Audiovisual Speech Recognition
Viaarxiv icon

Advances and Challenges in Deep Lip Reading

Oct 15, 2021
Marzieh Oghbaie, Arian Sabaghi, Kooshan Hashemifard, Mohammad Akbari

Figure 1 for Advances and Challenges in Deep Lip Reading
Figure 2 for Advances and Challenges in Deep Lip Reading
Figure 3 for Advances and Challenges in Deep Lip Reading
Figure 4 for Advances and Challenges in Deep Lip Reading
Viaarxiv icon

Multi-sequence Intermediate Conditioning for CTC-based ASR

Apr 01, 2022
Yusuke Fujita, Tatsuya Komatsu, Yusuke Kida

Figure 1 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 2 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 3 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 4 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Viaarxiv icon

Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?

Oct 29, 2019
Bhavya Ghai, Buvana Ramanan, Klaus Mueller

Figure 1 for Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
Figure 2 for Does Speech enhancement of publicly available data help build robust Speech Recognition Systems?
Viaarxiv icon