Alert button

"speech recognition": models, code, and papers
Alert button

VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain

Add code
Bookmark button
Alert button
Apr 08, 2024
Khai Le-Duc

Viaarxiv icon

Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition

Apr 04, 2024
Hainan Xu, Zhehuai Chen, Fei Jia, Boris Ginsburg

Viaarxiv icon

DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Apr 11, 2024
Yi-Cheng Wang, Hsin-Wei Wang, Bi-Cheng Yan, Chi-Han Lin, Berlin Chen

Viaarxiv icon

TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition

Apr 19, 2024
Chengxin Chen, Pengyuan Zhang

Viaarxiv icon

Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian

Apr 03, 2024
Kaavya Chaparala, Guido Zarrella, Bruce Torres Fischer, Larry Kimura, Oiwi Parker Jones

Viaarxiv icon

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Mar 28, 2024
Yash Jain, David Chan, Pranav Dheram, Aparna Khare, Olabanji Shonibare, Venkatesh Ravichandran, Shalini Ghosh

Figure 1 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 2 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 3 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Figure 4 for Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition
Viaarxiv icon

More than words: Advancements and challenges in speech recognition for singing

Mar 14, 2024
Anna Kruspe

Viaarxiv icon

Efficient infusion of self-supervised representations in Automatic Speech Recognition

Apr 19, 2024
Darshan Prabhu, Sai Ganesh Mirishkar, Pankaj Wasnik

Viaarxiv icon

Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition

Add code
Bookmark button
Alert button
Mar 28, 2024
Siyuan Shen, Yu Gao, Feng Liu, Hanyang Wang, Aimin Zhou

Viaarxiv icon

Learn2Talk: 3D Talking Face Learns from 2D Talking Face

Apr 19, 2024
Yixiang Zhuang, Baoping Cheng, Yao Cheng, Yuntao Jin, Renshuai Liu, Chengyang Li, Xuan Cheng, Jing Liao, Juncong Lin

Viaarxiv icon