Alert button

"speech recognition": models, code, and papers
Alert button

An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

Jan 08, 2024
Runduo Han, Xiaopeng Yan, Weiming Xu, Pengcheng Guo, Jiayao Sun, He Wang, Quan Lu, Ning Jiang, Lei Xie

Viaarxiv icon

A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition

Nov 07, 2023
Andrei Barcovschi, Rishabh Jain, Peter Corcoran

Viaarxiv icon

Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization

Oct 16, 2023
Zhihong Lei, Ernest Pusateri, Shiyi Han, Leo Liu, Mingbin Xu, Tim Ng, Ruchir Travadi, Youyuan Zhang, Mirko Hannemann, Man-Hung Siu, Zhen Huang

Viaarxiv icon

End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2

Jan 11, 2024
Aniket Tathe, Anand Kamble, Suyash Kumbharkar, Atharva Bhandare, Anirban C. Mitra

Viaarxiv icon

FAT-HuBERT: Front-end Adaptive Training of Hidden-unit BERT for Distortion-Invariant Robust Speech Recognition

Nov 29, 2023
Dongning Yang, Wei Wang, Yanmin Qian

Viaarxiv icon

Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model

Oct 23, 2023
Joanna Hong, Se Jin Park, Yong Man Ro

Figure 1 for Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model
Figure 2 for Intuitive Multilingual Audio-Visual Speech Recognition with a Single-Trained Model
Viaarxiv icon

Improving ASR Contextual Biasing with Guided Attention

Jan 16, 2024
Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar, Shinji Watanabe

Viaarxiv icon

Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective

Jan 16, 2024
Alexander H. Liu, Sung-Lin Yeh, James Glass

Viaarxiv icon

Punctuation Restoration Improves Structure Understanding without Supervision

Feb 13, 2024
Junghyun Min, Minho Lee, Woochul Lee, Yeonsoo Lee

Viaarxiv icon

XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese

Jan 12, 2024
Panji Arisaputra, Alif Tri Handoyo, Amalia Zahra

Viaarxiv icon