Alert button

"speech": models, code, and papers
Alert button

Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?

Oct 27, 2022
Dominik Wagner, Ilja Baumann, Franziska Braun, Sebastian P. Bayerl, Elmar Nöth, Korbinian Riedhammer, Tobias Bocklet

Figure 1 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Figure 2 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Figure 3 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Figure 4 for Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?
Viaarxiv icon

Diffusion-based Generative Speech Source Separation

Nov 02, 2022
Robin Scheibler, Youna Ji, Soo-Whan Chung, Jaeuk Byun, Soyeon Choe, Min-Seok Choi

Figure 1 for Diffusion-based Generative Speech Source Separation
Figure 2 for Diffusion-based Generative Speech Source Separation
Figure 3 for Diffusion-based Generative Speech Source Separation
Figure 4 for Diffusion-based Generative Speech Source Separation
Viaarxiv icon

SumREN: Summarizing Reported Speech about Events in News

Add code
Bookmark button
Alert button
Dec 02, 2022
Revanth Gangi Reddy, Heba Elfardy, Hou Pong Chan, Kevin Small, Heng Ji

Figure 1 for SumREN: Summarizing Reported Speech about Events in News
Figure 2 for SumREN: Summarizing Reported Speech about Events in News
Figure 3 for SumREN: Summarizing Reported Speech about Events in News
Figure 4 for SumREN: Summarizing Reported Speech about Events in News
Viaarxiv icon

Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks

Add code
Bookmark button
Alert button
Apr 20, 2023
Yiming Zhu, Peixian Zhang, Ehsan-Ul Haq, Pan Hui, Gareth Tyson

Figure 1 for Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks
Figure 2 for Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks
Figure 3 for Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks
Figure 4 for Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks
Viaarxiv icon

Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering

Add code
Bookmark button
Alert button
Nov 07, 2022
Helena Bonaldi, Sara Dellantonio, Serra Sinem Tekiroglu, Marco Guerini

Figure 1 for Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
Figure 2 for Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
Figure 3 for Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
Figure 4 for Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering
Viaarxiv icon

Multi-blank Transducers for Speech Recognition

Add code
Bookmark button
Alert button
Nov 04, 2022
Hainan Xu, Fei Jia, Somshubra Majumdar, Shinji Watanabe, Boris Ginsburg

Figure 1 for Multi-blank Transducers for Speech Recognition
Figure 2 for Multi-blank Transducers for Speech Recognition
Figure 3 for Multi-blank Transducers for Speech Recognition
Figure 4 for Multi-blank Transducers for Speech Recognition
Viaarxiv icon

Universal Source Separation with Weakly Labelled Data

Add code
Bookmark button
Alert button
May 11, 2023
Qiuqiang Kong, Ke Chen, Haohe Liu, Xingjian Du, Taylor Berg-Kirkpatrick, Shlomo Dubnov, Mark D. Plumbley

Figure 1 for Universal Source Separation with Weakly Labelled Data
Figure 2 for Universal Source Separation with Weakly Labelled Data
Figure 3 for Universal Source Separation with Weakly Labelled Data
Figure 4 for Universal Source Separation with Weakly Labelled Data
Viaarxiv icon

The Secret Source : Incorporating Source Features to Improve Acoustic-to-Articulatory Speech Inversion

Add code
Bookmark button
Alert button
Oct 29, 2022
Yashish M. Siriwardena, Carol Espy-Wilson

Figure 1 for The Secret Source : Incorporating Source Features to Improve Acoustic-to-Articulatory Speech Inversion
Figure 2 for The Secret Source : Incorporating Source Features to Improve Acoustic-to-Articulatory Speech Inversion
Figure 3 for The Secret Source : Incorporating Source Features to Improve Acoustic-to-Articulatory Speech Inversion
Figure 4 for The Secret Source : Incorporating Source Features to Improve Acoustic-to-Articulatory Speech Inversion
Viaarxiv icon

SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking

Add code
Bookmark button
Alert button
Nov 22, 2022
Vinay Kothapally, J. H. L. Hansen

Figure 1 for SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Figure 2 for SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Figure 3 for SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Figure 4 for SkipConvGAN: Monaural Speech Dereverberation using Generative Adversarial Networks via Complex Time-Frequency Masking
Viaarxiv icon

QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion

Add code
Bookmark button
Alert button
Feb 23, 2023
Houjian Guo, Chaoran Liu, Carlos Toshinori Ishi, Hiroshi Ishiguro

Figure 1 for QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Figure 2 for QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Figure 3 for QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Figure 4 for QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
Viaarxiv icon