Alert button

"speech recognition": models, code, and papers
Alert button

DiarizationLM: Speaker Diarization Post-Processing with Large Language Models

Jan 16, 2024
Quan Wang, Yiling Huang, Guanlong Zhao, Evan Clark, Wei Xia, Hank Liao

Viaarxiv icon

Multi-Modal Emotion Recognition by Text, Speech and Video Using Pretrained Transformers

Feb 11, 2024
Minoo Shayaninasab, Bagher Babaali

Viaarxiv icon

End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2

Jan 11, 2024
Aniket Tathe, Anand Kamble, Suyash Kumbharkar, Atharva Bhandare, Anirban C. Mitra

Viaarxiv icon

Frame-level emotional state alignment method for speech emotion recognition

Dec 27, 2023
Qifei Li, Yingming Gao, Cong Wang, Yayue Deng, Jinlong Xue, Yichen Han, Ya Li

Viaarxiv icon

Improving ASR Contextual Biasing with Guided Attention

Jan 16, 2024
Jiyang Tang, Kwangyoun Kim, Suwon Shon, Felix Wu, Prashant Sridhar, Shinji Watanabe

Viaarxiv icon

Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective

Jan 16, 2024
Alexander H. Liu, Sung-Lin Yeh, James Glass

Viaarxiv icon

DSNet: Disentangled Siamese Network with Neutral Calibration for Speech Emotion Recognition

Dec 25, 2023
Chengxin Chen, Pengyuan Zhang

Viaarxiv icon

Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder

Add code
Bookmark button
Alert button
Oct 06, 2023
Zih-Jyun Lin, Yi-Ju Chen, Po-Chih Kuo, Likai Huang, Chaur-Jong Hu, Cheng-Yu Chen

Figure 1 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Figure 2 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Figure 3 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Figure 4 for Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder
Viaarxiv icon

XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese

Jan 12, 2024
Panji Arisaputra, Alif Tri Handoyo, Amalia Zahra

Viaarxiv icon

Towards Automatic Data Augmentation for Disordered Speech Recognition

Dec 14, 2023
Zengrui Jin, Xurong Xie, Tianzi Wang, Mengzhe Geng, Jiajun Deng, Guinan Li, Shujie Hu, Xunying Liu

Viaarxiv icon