Alert button

"speech recognition": models, code, and papers
Alert button

Enhancing Pre-trained ASR System Fine-tuning for Dysarthric Speech Recognition using Adversarial Data Augmentation

Jan 01, 2024
Huimeng Wang, Zengrui Jin, Mengzhe Geng, Shujie Hu, Guinan Li, Tianzi Wang, Haoning Xu, Xunying Liu

Viaarxiv icon

BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators

Jan 08, 2024
Zihan Zhang, Jiayao Sun, Xianjun Xia, Chuanzeng Huang, Yijian Xiao, Lei Xie

Viaarxiv icon

Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition

Add code
Bookmark button
Alert button
Nov 06, 2023
Rabindra Nath Nandi, Mehadi Hasan Menon, Tareq Al Muntasir, Sagor Sarker, Quazi Sarwar Muhtaseem, Md. Tariqul Islam, Shammur Absar Chowdhury, Firoj Alam

Viaarxiv icon

Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation

Nov 09, 2023
Zhaofeng Lin, Tanvina Patel, Odette Scharenborg

Viaarxiv icon

Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors

Oct 25, 2023
Marek Kubis, Paweł Skórzewski, Marcin Sowański, Tomasz Ziętkiewicz

Figure 1 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Figure 2 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Figure 3 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Figure 4 for Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors
Viaarxiv icon

Punctuation Restoration Improves Structure Understanding without Supervision

Feb 13, 2024
Junghyun Min, Minho Lee, Woochul Lee, Yeonsoo Lee

Viaarxiv icon

Accented Speech Recognition With Accent-specific Codebooks

Add code
Bookmark button
Alert button
Oct 27, 2023
Darshan Prabhu, Preethi Jyothi, Sriram Ganapathy, Vinit Unni

Figure 1 for Accented Speech Recognition With Accent-specific Codebooks
Figure 2 for Accented Speech Recognition With Accent-specific Codebooks
Figure 3 for Accented Speech Recognition With Accent-specific Codebooks
Figure 4 for Accented Speech Recognition With Accent-specific Codebooks
Viaarxiv icon

Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization

Jan 16, 2024
Ming Cheng, Ming Li

Viaarxiv icon

Analysis of Self-Supervised Speech Models on Children's Speech and Infant Vocalizations

Feb 10, 2024
Jialu Li, Mark Hasegawa-Johnson, Nancy L. McElwain

Viaarxiv icon