Alert button

"speech recognition": models, code, and papers
Alert button

Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees

Oct 08, 2021
Yuanchao Wang, Wenji Du, Chenghao Cai, Yanyan Xu

Figure 1 for Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees
Figure 2 for Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees
Figure 3 for Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees
Figure 4 for Explaining the Attention Mechanism of End-to-End Speech Recognition Using Decision Trees
Viaarxiv icon

Joint unsupervised and supervised learning for context-aware language identification

Mar 29, 2023
Jinseok Park, Hyung Yong Kim, Jihwan Park, Byeong-Yeol Kim, Shukjae Choi, Yunkyu Lim

Figure 1 for Joint unsupervised and supervised learning for context-aware language identification
Figure 2 for Joint unsupervised and supervised learning for context-aware language identification
Figure 3 for Joint unsupervised and supervised learning for context-aware language identification
Figure 4 for Joint unsupervised and supervised learning for context-aware language identification
Viaarxiv icon

Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition

Add code
Bookmark button
Alert button
Oct 11, 2021
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng

Figure 1 for Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Figure 2 for Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Figure 3 for Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Figure 4 for Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition
Viaarxiv icon

Deformable TDNN with adaptive receptive fields for speech recognition

Apr 30, 2021
Keyu An, Yi Zhang, Zhijian Ou

Figure 1 for Deformable TDNN with adaptive receptive fields for speech recognition
Figure 2 for Deformable TDNN with adaptive receptive fields for speech recognition
Figure 3 for Deformable TDNN with adaptive receptive fields for speech recognition
Figure 4 for Deformable TDNN with adaptive receptive fields for speech recognition
Viaarxiv icon

Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription

Jul 08, 2022
Xianrui Zheng, Chao Zhang, Philip C. Woodland

Figure 1 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 2 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 3 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Figure 4 for Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription
Viaarxiv icon

Efficient Sequence Transduction by Jointly Predicting Tokens and Durations

Add code
Bookmark button
Alert button
Apr 13, 2023
Hainan Xu, Fei Jia, Somshubra Majumdar, He Huang, Shinji Watanabe, Boris Ginsburg

Figure 1 for Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Figure 2 for Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Figure 3 for Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Figure 4 for Efficient Sequence Transduction by Jointly Predicting Tokens and Durations
Viaarxiv icon

Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR

Add code
Bookmark button
Alert button
Apr 28, 2023
Ruchao Fan, Yunzheng Zhu, Jinhan Wang, Abeer Alwan

Figure 1 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Figure 2 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Figure 3 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Figure 4 for Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR
Viaarxiv icon

Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP

Add code
Bookmark button
Alert button
Mar 31, 2023
Sara Papi, Marco Gaido, Andrea Pilzer, Matteo Negri

Figure 1 for Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP
Figure 2 for Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP
Figure 3 for Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP
Figure 4 for Reproducibility is Nothing without Correctness: The Importance of Testing Code in NLP
Viaarxiv icon

Mask scalar prediction for improving robust automatic speech recognition

Apr 26, 2022
Arun Narayanan, James Walker, Sankaran Panchapagesan, Nathan Howard, Yuma Koizumi

Figure 1 for Mask scalar prediction for improving robust automatic speech recognition
Figure 2 for Mask scalar prediction for improving robust automatic speech recognition
Figure 3 for Mask scalar prediction for improving robust automatic speech recognition
Figure 4 for Mask scalar prediction for improving robust automatic speech recognition
Viaarxiv icon

Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses

Add code
Bookmark button
Alert button
Nov 16, 2021
Viet Anh Trinh, Sebastian Braun

Figure 1 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Figure 2 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Figure 3 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Figure 4 for Unsupervised Speech Enhancement with speech recognition embedding and disentanglement losses
Viaarxiv icon