Alert button

"speech recognition": models, code, and papers
Alert button

Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition

Mar 17, 2022
Mengzhe Geng, Xurong Xie, Zi Ye, Tianzi Wang, Guinan Li, Shujie Hu, Xunying Liu, Helen Meng

Figure 1 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 2 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 3 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Figure 4 for Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition
Viaarxiv icon

Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition

Jul 15, 2022
Xun Gong, Zhikai Zhou, Yanmin Qian

Figure 1 for Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition
Figure 2 for Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition
Figure 3 for Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition
Figure 4 for Knowledge Transfer and Distillation from Autoregressive to Non-Autoregressive Speech Recognition
Viaarxiv icon

Self-Supervised Learning-Based Source Separation for Meeting Data

Apr 03, 2023
Yuang Li, Xianrui Zheng, Philip C. Woodland

Figure 1 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 2 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 3 for Self-Supervised Learning-Based Source Separation for Meeting Data
Figure 4 for Self-Supervised Learning-Based Source Separation for Meeting Data
Viaarxiv icon

Improving Speech Recognition for Indic Languages using Language Model

Mar 30, 2022
Ankur Dhuriya, Harveen Singh Chadha, Anirudh Gupta, Priyanshi Shah, Neeraj Chhimwal, Rishabh Gaur, Vivek Raghavan

Figure 1 for Improving Speech Recognition for Indic Languages using Language Model
Figure 2 for Improving Speech Recognition for Indic Languages using Language Model
Figure 3 for Improving Speech Recognition for Indic Languages using Language Model
Figure 4 for Improving Speech Recognition for Indic Languages using Language Model
Viaarxiv icon

Successes and critical failures of neural networks in capturing human-like speech recognition

Apr 06, 2022
Federico Adolfi, Jeffrey S. Bowers, David Poeppel

Figure 1 for Successes and critical failures of neural networks in capturing human-like speech recognition
Figure 2 for Successes and critical failures of neural networks in capturing human-like speech recognition
Figure 3 for Successes and critical failures of neural networks in capturing human-like speech recognition
Figure 4 for Successes and critical failures of neural networks in capturing human-like speech recognition
Viaarxiv icon

Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training

Add code
Bookmark button
Alert button
Apr 12, 2023
Rui-Chen Zheng, Yang Ai, Zhen-Hua Ling

Figure 1 for Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training
Figure 2 for Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training
Figure 3 for Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training
Figure 4 for Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training
Viaarxiv icon

Conversational Speech Recognition By Learning Conversation-level Characteristics

Feb 17, 2022
Kun Wei, Yike Zhang, Sining Sun, Lei Xie, Long Ma

Figure 1 for Conversational Speech Recognition By Learning Conversation-level Characteristics
Figure 2 for Conversational Speech Recognition By Learning Conversation-level Characteristics
Viaarxiv icon

Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition

Add code
Bookmark button
Alert button
Jun 30, 2022
Yongjun Jiang, Jian Yu, Wenwen Yang, Bihong Zhang, Yanfeng Wang

Figure 1 for Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition
Figure 2 for Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition
Figure 3 for Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition
Figure 4 for Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition
Viaarxiv icon

Boosting Cross-Domain Speech Recognition with Self-Supervision

Add code
Bookmark button
Alert button
Jun 20, 2022
Han Zhu, Gaofeng Cheng, Jindong Wang, Wenxin Hou, Pengyuan Zhang, Yonghong Yan

Figure 1 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 2 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 3 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Figure 4 for Boosting Cross-Domain Speech Recognition with Self-Supervision
Viaarxiv icon

A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition

Jun 22, 2022
Yingying Gao, Junlan Feng, Tianrui Wang, Chao Deng, Shilei Zhang

Figure 1 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Figure 2 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Figure 3 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Figure 4 for A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition
Viaarxiv icon