Alert button

"speech recognition": models, code, and papers
Alert button

Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models

Jan 03, 2024
Rita Frieske, Bertram E. Shi

Figure 1 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 2 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 3 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Figure 4 for Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models
Viaarxiv icon

Large Language Models are Efficient Learners of Noise-Robust Speech Recognition

Add code
Bookmark button
Alert button
Jan 19, 2024
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Chao Zhang, Pin-Yu Chen, EnSiong Chng

Viaarxiv icon

On Robustness to Missing Video for Audiovisual Speech Recognition

Dec 19, 2023
Oscar Chang, Otavio Braga, Hank Liao, Dmitriy Serdyuk, Olivier Siohan

Figure 1 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 2 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 3 for On Robustness to Missing Video for Audiovisual Speech Recognition
Figure 4 for On Robustness to Missing Video for Audiovisual Speech Recognition
Viaarxiv icon

Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR

Feb 22, 2024
Rui Zhou, Xian Li, Ying Fang, Xiaofei Li

Viaarxiv icon

Attention-Guided Adaptation for Code-Switching Speech Recognition

Dec 14, 2023
Bobbi Aditya, Mahdin Rohmatillah, Liang-Hsuan Tai, Jen-Tzung Chien

Figure 1 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 2 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 3 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Figure 4 for Attention-Guided Adaptation for Code-Switching Speech Recognition
Viaarxiv icon

Deep Photonic Reservoir Computer for Speech Recognition

Dec 11, 2023
Enrico Picco, Alessandro Lupo, Serge Massar

Viaarxiv icon

Introduction to speech recognition

Feb 01, 2024
Gabriel Dauphin

Viaarxiv icon

Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition

Jan 19, 2024
Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth G. Shivakumar, Ankur Gandhe, Ariya Rastow, Jia Xu, Ivan Bulyko, Andreas Stolcke

Viaarxiv icon

Revisiting the Entropy Semiring for Neural Speech Recognition

Dec 19, 2023
Oscar Chang, Dongseong Hwang, Olivier Siohan

Figure 1 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 2 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 3 for Revisiting the Entropy Semiring for Neural Speech Recognition
Figure 4 for Revisiting the Entropy Semiring for Neural Speech Recognition
Viaarxiv icon

Conformer-Based Speech Recognition On Extreme Edge-Computing Devices

Dec 16, 2023
Mingbin Xu, Alex Jin, Sicheng Wang, Mu Su, Tim Ng, Henry Mason, Shiyi Han, Yaqiao Deng, Zhen Huang, Mahesh Krishnamoorthy

Viaarxiv icon