Alert button

"speech recognition": models, code, and papers
Alert button

Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning

Add code
Bookmark button
Alert button
Jan 31, 2017
Suyoun Kim, Takaaki Hori, Shinji Watanabe

Figure 1 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Figure 2 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Figure 3 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Figure 4 for Joint CTC-Attention based End-to-End Speech Recognition using Multi-task Learning
Viaarxiv icon

Short-Term Word-Learning in a Dynamically Changing Environment

Add code
Bookmark button
Alert button
Mar 29, 2022
Christian Huber, Rishu Kumar, Ondřej Bojar, Alexander Waibel

Figure 1 for Short-Term Word-Learning in a Dynamically Changing Environment
Figure 2 for Short-Term Word-Learning in a Dynamically Changing Environment
Figure 3 for Short-Term Word-Learning in a Dynamically Changing Environment
Figure 4 for Short-Term Word-Learning in a Dynamically Changing Environment
Viaarxiv icon

Improved training for online end-to-end speech recognition systems

Add code
Bookmark button
Alert button
Aug 30, 2018
Suyoun Kim, Michael L. Seltzer, Jinyu Li, Rui Zhao

Figure 1 for Improved training for online end-to-end speech recognition systems
Figure 2 for Improved training for online end-to-end speech recognition systems
Figure 3 for Improved training for online end-to-end speech recognition systems
Viaarxiv icon

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

Mar 29, 2022
Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata

Figure 1 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 2 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 3 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 4 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Viaarxiv icon

Pseudo Label Is Better Than Human Label

Mar 28, 2022
Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo, Trevor Strohman

Figure 1 for Pseudo Label Is Better Than Human Label
Figure 2 for Pseudo Label Is Better Than Human Label
Figure 3 for Pseudo Label Is Better Than Human Label
Figure 4 for Pseudo Label Is Better Than Human Label
Viaarxiv icon

An Adaptive Psychoacoustic Model for Automatic Speech Recognition

Sep 14, 2016
Peng Dai, Xue Teng, Frank Rudzicz, Ing Yann Soon

Figure 1 for An Adaptive Psychoacoustic Model for Automatic Speech Recognition
Figure 2 for An Adaptive Psychoacoustic Model for Automatic Speech Recognition
Figure 3 for An Adaptive Psychoacoustic Model for Automatic Speech Recognition
Figure 4 for An Adaptive Psychoacoustic Model for Automatic Speech Recognition
Viaarxiv icon

Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing

May 14, 2022
Heli Qi, Sashi Novitasari, Sakriani Sakti, Satoshi Nakamura

Figure 1 for Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing
Figure 2 for Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing
Figure 3 for Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing
Viaarxiv icon

Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition

Oct 30, 2018
Xinpei Zhou, Jiwei Li, Xi Zhou

Figure 1 for Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition
Figure 2 for Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition
Figure 3 for Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition
Figure 4 for Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition
Viaarxiv icon

Graph based manifold regularized deep neural networks for automatic speech recognition

Jun 19, 2016
Vikrant Singh Tomar, Richard C. Rose

Figure 1 for Graph based manifold regularized deep neural networks for automatic speech recognition
Figure 2 for Graph based manifold regularized deep neural networks for automatic speech recognition
Figure 3 for Graph based manifold regularized deep neural networks for automatic speech recognition
Figure 4 for Graph based manifold regularized deep neural networks for automatic speech recognition
Viaarxiv icon

A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition

Sep 22, 2014
Roland Maas, Christian Huemmer, Armin Sehr, Walter Kellermann

Figure 1 for A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition
Figure 2 for A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition
Figure 3 for A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition
Figure 4 for A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition
Viaarxiv icon