Alert button

"speech": models, code, and papers
Alert button

Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model

Aug 22, 2023
Yuezhou Zhang, Amos A Folarin, Judith Dineley, Pauline Conde, Valeria de Angel, Shaoxiong Sun, Yatharth Ranjan, Zulqarnain Rashid, Callum Stewart, Petroula Laiou, Heet Sankesara, Linglong Qian, Faith Matcham, Katie M White, Carolin Oetzmann, Femke Lamers, Sara Siddi, Sara Simblett, Björn W. Schuller, Srinivasan Vairavan, Til Wykes, Josep Maria Haro, Brenda WJH Penninx, Vaibhav A Narayan, Matthew Hotopf, Richard JB Dobson, Nicholas Cummins, RADAR-CNS consortium

Figure 1 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Figure 2 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Figure 3 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Figure 4 for Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model
Viaarxiv icon

How Much Context Does My Attention-Based ASR System Need?

Add code
Bookmark button
Alert button
Oct 24, 2023
Robert Flynn, Anton Ragni

Figure 1 for How Much Context Does My Attention-Based ASR System Need?
Figure 2 for How Much Context Does My Attention-Based ASR System Need?
Figure 3 for How Much Context Does My Attention-Based ASR System Need?
Figure 4 for How Much Context Does My Attention-Based ASR System Need?
Viaarxiv icon

Analysis of XLS-R for Speech Quality Assessment

Add code
Bookmark button
Alert button
Aug 23, 2023
Bastiaan Tamm, Rik Vandenberghe, Hugo Van hamme

Viaarxiv icon

On Feature Importance and Interpretability of Speaker Representations

Oct 19, 2023
Frederik Rautenberg, Michael Kuhlmann, Jana Wiechmann, Fritz Seebauer, Petra Wagner, Reinhold Haeb-Umbach

Viaarxiv icon

High-Fidelity Noise Reduction with Differentiable Signal Processing

Oct 17, 2023
Christian J. Steinmetz, Thomas Walther, Joshua D. Reiss

Figure 1 for High-Fidelity Noise Reduction with Differentiable Signal Processing
Figure 2 for High-Fidelity Noise Reduction with Differentiable Signal Processing
Figure 3 for High-Fidelity Noise Reduction with Differentiable Signal Processing
Figure 4 for High-Fidelity Noise Reduction with Differentiable Signal Processing
Viaarxiv icon

BadSQA: Stealthy Backdoor Attacks Using Presence Events as Triggers in Non-Intrusive Speech Quality Assessment

Sep 04, 2023
Ying Ren, Kailai Shen, Zhe Ye, Diqun Yan

Viaarxiv icon

Towards End-to-End Spoken Grammatical Error Correction

Nov 09, 2023
Stefano Bannò, Rao Ma, Mengjie Qian, Kate M. Knill, Mark J. F. Gales

Viaarxiv icon

USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset

Sep 14, 2023
Chengguang Gan, Qinghao Zhang, Tatsunori Mori

Figure 1 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Figure 2 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Figure 3 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Figure 4 for USA: Universal Sentiment Analysis Model & Construction of Japanese Sentiment Text Classification and Part of Speech Dataset
Viaarxiv icon

GPT-4V(ision) as A Social Media Analysis Engine

Nov 13, 2023
Hanjia Lyu, Jinfa Huang, Daoan Zhang, Yongsheng Yu, Xinyi Mou, Jinsheng Pan, Zhengyuan Yang, Zhongyu Wei, Jiebo Luo

Viaarxiv icon

CPPF: A contextual and post-processing-free model for automatic speech recognition

Add code
Bookmark button
Alert button
Sep 21, 2023
Lei Zhang, Zhengkun Tian, Xiang Chen, Jiaming Sun, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 2 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Figure 3 for CPPF: A contextual and post-processing-free model for automatic speech recognition
Viaarxiv icon