Alert button

"speech": models, code, and papers
Alert button

Dialogs Re-enacted Across Languages

Nov 18, 2022
Nigel G. Ward, Jonathan E. Avila, Emilia Rivas

Figure 1 for Dialogs Re-enacted Across Languages
Figure 2 for Dialogs Re-enacted Across Languages
Figure 3 for Dialogs Re-enacted Across Languages
Figure 4 for Dialogs Re-enacted Across Languages
Viaarxiv icon

Neural Architecture Search: Insights from 1000 Papers

Jan 25, 2023
Colin White, Mahmoud Safari, Rhea Sukthanker, Binxin Ru, Thomas Elsken, Arber Zela, Debadeepta Dey, Frank Hutter

Figure 1 for Neural Architecture Search: Insights from 1000 Papers
Figure 2 for Neural Architecture Search: Insights from 1000 Papers
Figure 3 for Neural Architecture Search: Insights from 1000 Papers
Figure 4 for Neural Architecture Search: Insights from 1000 Papers
Viaarxiv icon

Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise

Nov 19, 2022
Iván López-Espejo, Ram C. M. C. Shekar, Zheng-Hua Tan, Jesper Jensen, John H. L. Hansen

Figure 1 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Figure 2 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Figure 3 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Figure 4 for Filterbank Learning for Small-Footprint Keyword Spotting Robust to Noise
Viaarxiv icon

On the Impact of Noises in Crowd-Sourced Data for Speech Translation

Jul 01, 2022
Siqi Ouyang, Rong Ye, Lei Li

Figure 1 for On the Impact of Noises in Crowd-Sourced Data for Speech Translation
Figure 2 for On the Impact of Noises in Crowd-Sourced Data for Speech Translation
Figure 3 for On the Impact of Noises in Crowd-Sourced Data for Speech Translation
Figure 4 for On the Impact of Noises in Crowd-Sourced Data for Speech Translation
Viaarxiv icon

DDS: A new device-degraded speech dataset for speech enhancement

Sep 28, 2021
Haoyu Li, Junichi Yamagishi

Figure 1 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 2 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 3 for DDS: A new device-degraded speech dataset for speech enhancement
Figure 4 for DDS: A new device-degraded speech dataset for speech enhancement
Viaarxiv icon

Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data

May 30, 2022
Sungwon Kim, Heeseung Kim, Sungroh Yoon

Figure 1 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Figure 2 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Figure 3 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Figure 4 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Viaarxiv icon

Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes

May 18, 2022
Antonis Maronikolakis, Philip Baader, Hinrich Schütze

Figure 1 for Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes
Figure 2 for Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes
Figure 3 for Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes
Figure 4 for Analyzing Hate Speech Data along Racial, Gender and Intersectional Axes
Viaarxiv icon

Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition

Jul 12, 2022
Siddique Latif, Rajib Rana, Sara Khalifa, Raja Jurdak, Björn W. Schuller

Figure 1 for Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition
Figure 2 for Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition
Figure 3 for Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition
Figure 4 for Multitask Learning from Augmented Auxiliary Data for Improving Speech Emotion Recognition
Viaarxiv icon

Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers

Mar 30, 2022
Zhenhao Jin, Xiang Hao, Xiangdong Su

Figure 1 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 2 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 3 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Figure 4 for Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers
Viaarxiv icon

The Role of Voice Persona in Expressive Communication:An Argument for Relevance in Speech Synthesis Design

Sep 06, 2022
Camille Noufi, Lloyd May, Jonathan Berger

Figure 1 for The Role of Voice Persona in Expressive Communication:An Argument for Relevance in Speech Synthesis Design
Figure 2 for The Role of Voice Persona in Expressive Communication:An Argument for Relevance in Speech Synthesis Design
Viaarxiv icon