Alert button

"speech": models, code, and papers
Alert button

Noisy Speech Based Temporal Decomposition to Improve Fundamental Frequency Estimation

Dec 18, 2021
A. Queiroz, R. Coelho

Figure 1 for Noisy Speech Based Temporal Decomposition to Improve Fundamental Frequency Estimation
Figure 2 for Noisy Speech Based Temporal Decomposition to Improve Fundamental Frequency Estimation
Figure 3 for Noisy Speech Based Temporal Decomposition to Improve Fundamental Frequency Estimation
Figure 4 for Noisy Speech Based Temporal Decomposition to Improve Fundamental Frequency Estimation
Viaarxiv icon

REAL-M: Towards Speech Separation on Real Mixtures

Add code
Bookmark button
Alert button
Oct 20, 2021
Cem Subakan, Mirco Ravanelli, Samuele Cornell, François Grondin

Figure 1 for REAL-M: Towards Speech Separation on Real Mixtures
Figure 2 for REAL-M: Towards Speech Separation on Real Mixtures
Figure 3 for REAL-M: Towards Speech Separation on Real Mixtures
Figure 4 for REAL-M: Towards Speech Separation on Real Mixtures
Viaarxiv icon

To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets

Jun 16, 2022
Jean-Marc Valin, Ritwik Giri, Shrikant Venkataramani, Umut Isik, Arvindh Krishnaswamy

Figure 1 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 2 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 3 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Figure 4 for To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets
Viaarxiv icon

Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach

Jul 31, 2021
Sheikh Muhammad Sarwar, Vanessa Murdock

Figure 1 for Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach
Figure 2 for Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach
Figure 3 for Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach
Figure 4 for Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach
Viaarxiv icon

Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks

Nov 04, 2022
Paul Didier, Toon van Waterschoot, Simon Doclo, Marc Moonen

Figure 1 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Figure 2 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Figure 3 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Figure 4 for Sampling Rate Offset Estimation and Compensation for Distributed Adaptive Node-Specific Signal Estimation in Wireless Acoustic Sensor Networks
Viaarxiv icon

Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask

Oct 08, 2021
Shaoshi Ling, Chen Shen, Meng Cai, Zejun Ma

Figure 1 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 2 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 3 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Figure 4 for Improving Pseudo-label Training For End-to-end Speech Recognition Using Gradient Mask
Viaarxiv icon

Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis

Add code
Bookmark button
Alert button
Jun 22, 2021
Jian Cong, Shan Yang, Lei Xie, Dan Su

Figure 1 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Figure 2 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Figure 3 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Figure 4 for Glow-WaveGAN: Learning Speech Representations from GAN-based Variational Auto-Encoder For High Fidelity Flow-based Speech Synthesis
Viaarxiv icon

Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech

Add code
Bookmark button
Alert button
Oct 04, 2021
Ying Qin, Wei Liu, Zhiyuan Peng, Si-Ioi Ng, Jingyu Li, Haibo Hu, Tan Lee

Figure 1 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 2 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 3 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Figure 4 for Exploiting Pre-Trained ASR Models for Alzheimer's Disease Recognition Through Spontaneous Speech
Viaarxiv icon

AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence

Add code
Bookmark button
Alert button
Nov 02, 2021
Yun-Ning Hung, Karn N. Watcharasupat, Chih-Wei Wu, Iroro Orife, Kelian Li, Pavan Seshadri, Junyoung Lee

Figure 1 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Figure 2 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Figure 3 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Figure 4 for AVASpeech-SMAD: A Strongly Labelled Speech and Music Activity Detection Dataset with Label Co-Occurrence
Viaarxiv icon

"Notic My Speech" -- Blending Speech Patterns With Multimedia

Jun 12, 2020
Dhruva Sahrawat, Yaman Kumar, Shashwat Aggarwal, Yifang Yin, Rajiv Ratn Shah, Roger Zimmermann

Figure 1 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 2 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 3 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Figure 4 for "Notic My Speech" -- Blending Speech Patterns With Multimedia
Viaarxiv icon