Alert button

"speech recognition": models, code, and papers
Alert button

Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model

Mar 13, 2023
Shuangping Huang, Yu Luo, Zhenzhou Zhuang, Jin-Gang Yu, Mengchao He, Yongpan Wang

Figure 1 for Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model
Figure 2 for Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model
Figure 3 for Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model
Figure 4 for Context-Aware Selective Label Smoothing for Calibrating Sequence Recognition Model
Viaarxiv icon

AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations

Feb 10, 2023
Jiachen Lian, Alexei Baevski, Wei-Ning Hsu, Michael Auli

Figure 1 for AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations
Figure 2 for AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations
Figure 3 for AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations
Figure 4 for AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations
Viaarxiv icon

Content-Context Factorized Representations for Automated Speech Recognition

May 19, 2022
David M. Chan, Shalini Ghosh

Figure 1 for Content-Context Factorized Representations for Automated Speech Recognition
Figure 2 for Content-Context Factorized Representations for Automated Speech Recognition
Figure 3 for Content-Context Factorized Representations for Automated Speech Recognition
Figure 4 for Content-Context Factorized Representations for Automated Speech Recognition
Viaarxiv icon

Learning a Dual-Mode Speech Recognition Model via Self-Pruning

Jul 25, 2022
Chunxi Liu, Yuan Shangguan, Haichuan Yang, Yangyang Shi, Raghuraman Krishnamoorthi, Ozlem Kalinli

Figure 1 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Figure 2 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Figure 3 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Figure 4 for Learning a Dual-Mode Speech Recognition Model via Self-Pruning
Viaarxiv icon

Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion

Add code
Bookmark button
Alert button
May 16, 2023
Xintao Zhao, Shuai Wang, Yang Chao, Zhiyong Wu, Helen Meng

Figure 1 for Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion
Figure 2 for Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion
Figure 3 for Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion
Figure 4 for Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion
Viaarxiv icon

Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition

Nov 17, 2022
Xurong Xie, Xunying Liu, Hui Chen, Hongan Wang

Figure 1 for Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition
Figure 2 for Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition
Figure 3 for Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition
Figure 4 for Unsupervised Model-based speaker adaptation of end-to-end lattice-free MMI model for speech recognition
Viaarxiv icon

Investigation of Data Augmentation Techniques for Disordered Speech Recognition

Jan 14, 2022
Mengzhe Geng, Xurong Xie, Shansong Liu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Meng

Figure 1 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 2 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 3 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Figure 4 for Investigation of Data Augmentation Techniques for Disordered Speech Recognition
Viaarxiv icon

Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features

Aug 04, 2021
Rupam Ojha, C Chandra Sekhar

Figure 1 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Figure 2 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Figure 3 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Figure 4 for Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features
Viaarxiv icon

Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition

Sep 17, 2022
Ye Bai, Jie Li, Wenjing Han, Hao Ni, Kaituo Xu, Zhuo Zhang, Cheng Yi, Xiaorui Wang

Figure 1 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 2 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 3 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Figure 4 for Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
Viaarxiv icon

Challenges and Opportunities of Speech Recognition for Bengali Language

Sep 27, 2021
M. F. Mridha, Abu Quwsar Ohi, Md. Abdul Hamid, Muhammad Mostafa Monowar

Figure 1 for Challenges and Opportunities of Speech Recognition for Bengali Language
Figure 2 for Challenges and Opportunities of Speech Recognition for Bengali Language
Figure 3 for Challenges and Opportunities of Speech Recognition for Bengali Language
Figure 4 for Challenges and Opportunities of Speech Recognition for Bengali Language
Viaarxiv icon