Alert button

"speech": models, code, and papers
Alert button

Wav2vec-based Detection and Severity Level Classification of Dysarthria from Speech

Sep 25, 2023
Farhad Javanmardi, Saska Tirronen, Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku

Viaarxiv icon

LanSER: Language-Model Supported Speech Emotion Recognition

Sep 07, 2023
Taesik Gong, Josh Belanich, Krishna Somandepalli, Arsha Nagrani, Brian Eoff, Brendan Jou

Figure 1 for LanSER: Language-Model Supported Speech Emotion Recognition
Figure 2 for LanSER: Language-Model Supported Speech Emotion Recognition
Figure 3 for LanSER: Language-Model Supported Speech Emotion Recognition
Figure 4 for LanSER: Language-Model Supported Speech Emotion Recognition
Viaarxiv icon

Crowdotic: A Privacy-Preserving Hospital Waiting Room Crowd Density Estimation with Non-speech Audio

Sep 20, 2023
Forsad Al Hossain, Tanjid Hasan Tonmoy, Andrew A. Lover, George A. Corey, Mohammad Arif Ul Alam, Tauhidur Rahman

Figure 1 for Crowdotic: A Privacy-Preserving Hospital Waiting Room Crowd Density Estimation with Non-speech Audio
Figure 2 for Crowdotic: A Privacy-Preserving Hospital Waiting Room Crowd Density Estimation with Non-speech Audio
Figure 3 for Crowdotic: A Privacy-Preserving Hospital Waiting Room Crowd Density Estimation with Non-speech Audio
Figure 4 for Crowdotic: A Privacy-Preserving Hospital Waiting Room Crowd Density Estimation with Non-speech Audio
Viaarxiv icon

Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement

Sep 03, 2023
Yu-Wen Chen, Julia Hirschberg, Yu Tsao

Figure 1 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 2 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 3 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Figure 4 for Noise robust speech emotion recognition with signal-to-noise ratio adapting speech enhancement
Viaarxiv icon

Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Chen Xu, Xiaoqian Liu, Erfeng He, Yuhao Zhang, Qianqian Dong, Tong Xiao, Jingbo Zhu, Dapeng Man, Wu Yang

Figure 1 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 2 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 3 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Figure 4 for Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition
Viaarxiv icon

Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems

Nov 20, 2023
Guangjing Wang, Ce Zhou, Yuanda Wang, Bocheng Chen, Hanqing Guo, Qiben Yan

Viaarxiv icon

PARK: Parkinson's Analysis with Remote Kinetic-tasks

Nov 21, 2023
Md Saiful Islam, Sangwu Lee, Abdelrahman Abdelkader, Sooyong Park, Ehsan Hoque

Viaarxiv icon

Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning

Sep 17, 2023
Zilu Guo, Jun Du, CHin-Hui Lee

Figure 1 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Figure 2 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Figure 3 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Figure 4 for Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning
Viaarxiv icon

Synthetic Speaking Children -- Why We Need Them and How to Make Them

Nov 08, 2023
Muhammad Ali Farooq, Dan Bigioi, Rishabh Jain, Wang Yao, Mariam Yiwere, Peter Corcoran

Viaarxiv icon

Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech

Add code
Bookmark button
Alert button
Sep 15, 2023
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li

Viaarxiv icon