Alert button

"speech recognition": models, code, and papers
Alert button

Label Smoothing for Enhanced Text Sentiment Classification

Dec 11, 2023
Yijie Gao, Shijing Si

Figure 1 for Label Smoothing for Enhanced Text Sentiment Classification
Figure 2 for Label Smoothing for Enhanced Text Sentiment Classification
Figure 3 for Label Smoothing for Enhanced Text Sentiment Classification
Figure 4 for Label Smoothing for Enhanced Text Sentiment Classification
Viaarxiv icon

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults

Add code
Bookmark button
Alert button
Sep 12, 2023
Ahmed Adel Attia, Jing Liu, Wei Ai, Dorottya Demszky, Carol Espy-Wilson

Figure 1 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Figure 2 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Figure 3 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Figure 4 for Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults
Viaarxiv icon

Generative error correction for code-switching speech recognition using large language models

Add code
Bookmark button
Alert button
Oct 17, 2023
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng

Figure 1 for Generative error correction for code-switching speech recognition using large language models
Figure 2 for Generative error correction for code-switching speech recognition using large language models
Figure 3 for Generative error correction for code-switching speech recognition using large language models
Figure 4 for Generative error correction for code-switching speech recognition using large language models
Viaarxiv icon

Unimodal Aggregation for CTC-based Speech Recognition

Add code
Bookmark button
Alert button
Sep 15, 2023
Ying Fang, Xiaofei Li

Figure 1 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 2 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 3 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 4 for Unimodal Aggregation for CTC-based Speech Recognition
Viaarxiv icon

Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study

Add code
Bookmark button
Alert button
Sep 27, 2023
Xuankai Chang, Brian Yan, Kwanghee Choi, Jeeweon Jung, Yichen Lu, Soumi Maiti, Roshan Sharma, Jiatong Shi, Jinchuan Tian, Shinji Watanabe, Yuya Fujita, Takashi Maekaku, Pengcheng Guo, Yao-Fei Cheng, Pavel Denisov, Kohei Saijo, Hsiu-Hsuan Wang

Figure 1 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 2 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 3 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Figure 4 for Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
Viaarxiv icon

Speech and Text-Based Emotion Recognizer

Dec 10, 2023
Varun Sharma

Viaarxiv icon

Efficiency-oriented approaches for self-supervised speech representation learning

Dec 18, 2023
Luis Lugo, Valentin Vielzeuf

Viaarxiv icon

Phoneme-aware Encoding for Prefix-tree-based Contextual ASR

Dec 15, 2023
Hayato Futami, Emiru Tsunoo, Yosuke Kashiwagi, Hiroaki Ogawa, Siddhant Arora, Shinji Watanabe

Viaarxiv icon

KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods

Aug 23, 2023
Antoine Nzeyimana

Figure 1 for KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods
Figure 2 for KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods
Figure 3 for KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods
Figure 4 for KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods
Viaarxiv icon

Seq2seq for Automatic Paraphasia Detection in Aphasic Speech

Dec 16, 2023
Matthew Perez, Duc Le, Amrit Romana, Elise Jones, Keli Licata, Emily Mower Provost

Viaarxiv icon