Alert button

"speech recognition": models, code, and papers
Alert button

Speech Diarization and ASR with GMM

Jul 11, 2023
Aayush Kumar Sharma, Vineet Bhavikatti, Amogh Nidawani, Dr. Siddappaji, Sanath P, Dr Geetishree Mishra

Figure 1 for Speech Diarization and ASR with GMM
Figure 2 for Speech Diarization and ASR with GMM
Figure 3 for Speech Diarization and ASR with GMM
Viaarxiv icon

Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization

Add code
Bookmark button
Alert button
Sep 28, 2023
Thilo von Neumann, Christoph Boeddeker, Tobias Cord-Landwehr, Marc Delcroix, Reinhold Haeb-Umbach

Figure 1 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Figure 2 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Figure 3 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Figure 4 for Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization
Viaarxiv icon

ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems

Add code
Bookmark button
Alert button
Feb 11, 2023
Daniel Hao Xian Yuen, Andrew Yong Chen Pang, Zhou Yang, Chun Yong Chong, Mei Kuan Lim, David Lo

Figure 1 for ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems
Viaarxiv icon

Crowdsourced and Automatic Speech Prominence Estimation

Add code
Bookmark button
Alert button
Oct 12, 2023
Max Morrison, Pranav Pawar, Nathan Pruyne, Jennifer Cole, Bryan Pardo

Figure 1 for Crowdsourced and Automatic Speech Prominence Estimation
Figure 2 for Crowdsourced and Automatic Speech Prominence Estimation
Figure 3 for Crowdsourced and Automatic Speech Prominence Estimation
Figure 4 for Crowdsourced and Automatic Speech Prominence Estimation
Viaarxiv icon

A Sidecar Separator Can Convert a Single-Speaker Speech Recognition System to a Multi-Speaker One

Add code
Bookmark button
Alert button
Feb 20, 2023
Lingwei Meng, Jiawen Kang, Mingyu Cui, Yuejiao Wang, Xixin Wu, Helen Meng

Figure 1 for A Sidecar Separator Can Convert a Single-Speaker Speech Recognition System to a Multi-Speaker One
Figure 2 for A Sidecar Separator Can Convert a Single-Speaker Speech Recognition System to a Multi-Speaker One
Figure 3 for A Sidecar Separator Can Convert a Single-Speaker Speech Recognition System to a Multi-Speaker One
Figure 4 for A Sidecar Separator Can Convert a Single-Speaker Speech Recognition System to a Multi-Speaker One
Viaarxiv icon

Improving CTC-AED model with integrated-CTC and auxiliary loss regularization

Aug 15, 2023
Daobin Zhu, Xiangdong Su, Hongbin Zhang

Figure 1 for Improving CTC-AED model with integrated-CTC and auxiliary loss regularization
Figure 2 for Improving CTC-AED model with integrated-CTC and auxiliary loss regularization
Figure 3 for Improving CTC-AED model with integrated-CTC and auxiliary loss regularization
Figure 4 for Improving CTC-AED model with integrated-CTC and auxiliary loss regularization
Viaarxiv icon

ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing

Add code
Bookmark button
Alert button
Oct 17, 2023
Quoc-Nam Nguyen, Thang Chau Phan, Duc-Vu Nguyen, Kiet Van Nguyen

Viaarxiv icon

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Add code
Bookmark button
Alert button
Jun 14, 2023
Zheng Liang, Zheshu Song, Ziyang Ma, Chenpeng Du, Kai Yu, Xie Chen

Figure 1 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 2 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 3 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Figure 4 for Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation
Viaarxiv icon

PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers

Mar 30, 2023
Rahul Pandey, Roger Ren, Qi Luo, Jing Liu, Ariya Rastrow, Ankur Gandhe, Denis Filimonov, Grant Strimel, Andreas Stolcke, Ivan Bulyko

Figure 1 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Figure 2 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Figure 3 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Figure 4 for PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers
Viaarxiv icon

AdVerb: Visually Guided Audio Dereverberation

Aug 23, 2023
Sanjoy Chowdhury, Sreyan Ghosh, Subhrajyoti Dasgupta, Anton Ratnarajah, Utkarsh Tyagi, Dinesh Manocha

Figure 1 for AdVerb: Visually Guided Audio Dereverberation
Figure 2 for AdVerb: Visually Guided Audio Dereverberation
Figure 3 for AdVerb: Visually Guided Audio Dereverberation
Figure 4 for AdVerb: Visually Guided Audio Dereverberation
Viaarxiv icon