Alert button

"speech": models, code, and papers
Alert button

A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution

Oct 27, 2022
Yisi Liu, Peter Wu, Alan W Black, Gopala K. Anumanchipalli

Figure 1 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 2 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 3 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Figure 4 for A Fast and Accurate Pitch Estimation Algorithm Based on the Pseudo Wigner-Ville Distribution
Viaarxiv icon

Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise

Add code
Bookmark button
Alert button
Mar 29, 2022
Tuomo Raitio, Petko Petkov, Jiangchuan Li, Muhammed Shifas, Andrea Davis, Yannis Stylianou

Figure 1 for Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise
Figure 2 for Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise
Figure 3 for Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise
Figure 4 for Vocal effort modeling in neural TTS for improving the intelligibility of synthetic speech in noise
Viaarxiv icon

Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data

Add code
Bookmark button
Alert button
May 30, 2022
Sungwon Kim, Heeseung Kim, Sungroh Yoon

Figure 1 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Figure 2 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Figure 3 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Figure 4 for Guided-TTS 2: A Diffusion Model for High-quality Adaptive Text-to-Speech with Untranscribed Data
Viaarxiv icon

Articulation GAN: Unsupervised modeling of articulatory learning

Add code
Bookmark button
Alert button
Oct 27, 2022
Gašper Beguš, Alan Zhou, Peter Wu, Gopala K Anumanchipalli

Figure 1 for Articulation GAN: Unsupervised modeling of articulatory learning
Figure 2 for Articulation GAN: Unsupervised modeling of articulatory learning
Figure 3 for Articulation GAN: Unsupervised modeling of articulatory learning
Figure 4 for Articulation GAN: Unsupervised modeling of articulatory learning
Viaarxiv icon

Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos

Add code
Bookmark button
Alert button
Jul 22, 2022
Panagiotis P. Filntisis, George Retsinas, Foivos Paraperas-Papantoniou, Athanasios Katsamanis, Anastasios Roussos, Petros Maragos

Figure 1 for Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos
Figure 2 for Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos
Figure 3 for Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos
Figure 4 for Visual Speech-Aware Perceptual 3D Facial Expression Reconstruction from Videos
Viaarxiv icon

Can Self-Supervised Learning solve the problem of child speech recognition?

Add code
Bookmark button
Alert button
Apr 06, 2022
Rishabh Jain, Mariam Yiwere, Dan Bigioi, Peter Corcoran

Figure 1 for Can Self-Supervised Learning solve the problem of child speech recognition?
Figure 2 for Can Self-Supervised Learning solve the problem of child speech recognition?
Figure 3 for Can Self-Supervised Learning solve the problem of child speech recognition?
Viaarxiv icon

Multi-Channel Speech Denoising for Machine Ears

Feb 17, 2022
Cong Han, E. Merve Kaya, Kyle Hoefer, Malcolm Slaney, Simon Carlile

Figure 1 for Multi-Channel Speech Denoising for Machine Ears
Figure 2 for Multi-Channel Speech Denoising for Machine Ears
Figure 3 for Multi-Channel Speech Denoising for Machine Ears
Figure 4 for Multi-Channel Speech Denoising for Machine Ears
Viaarxiv icon

VLSP2022-EVJVQA Challenge: Multilingual Visual Question Answering

Add code
Bookmark button
Alert button
Feb 28, 2023
Ngan Luu-Thuy Nguyen, Nghia Hieu Nguyen, Duong T. D Vo, Khanh Quoc Tran, Kiet Van Nguyen

Figure 1 for VLSP2022-EVJVQA Challenge: Multilingual Visual Question Answering
Figure 2 for VLSP2022-EVJVQA Challenge: Multilingual Visual Question Answering
Figure 3 for VLSP2022-EVJVQA Challenge: Multilingual Visual Question Answering
Figure 4 for VLSP2022-EVJVQA Challenge: Multilingual Visual Question Answering
Viaarxiv icon

Chaotic Variational Auto encoder-based Adversarial Machine Learning

Feb 25, 2023
Pavan Venkata Sainadh Reddy, Yelleti Vivek, Gopi Pranay, Vadlamani Ravi

Figure 1 for Chaotic Variational Auto encoder-based Adversarial Machine Learning
Figure 2 for Chaotic Variational Auto encoder-based Adversarial Machine Learning
Figure 3 for Chaotic Variational Auto encoder-based Adversarial Machine Learning
Figure 4 for Chaotic Variational Auto encoder-based Adversarial Machine Learning
Viaarxiv icon

Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments

Add code
Bookmark button
Alert button
Mar 22, 2022
Antonis Maronikolakis, Axel Wisiorek, Leah Nann, Haris Jabbar, Sahana Udupa, Hinrich Schuetze

Figure 1 for Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments
Figure 2 for Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments
Figure 3 for Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments
Figure 4 for Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments
Viaarxiv icon