Picture for Tomoki Toda

Tomoki Toda

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

Add code
Oct 07, 2023
Figure 1 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 2 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 3 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 4 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Viaarxiv icon

Improving severity preservation of healthy-to-pathological voice conversion with global style tokens

Add code
Oct 04, 2023
Viaarxiv icon

Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders

Add code
Sep 18, 2023
Figure 1 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Figure 2 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Figure 3 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Figure 4 for Electrolaryngeal Speech Intelligibility Enhancement Through Robust Linguistic Encoders
Viaarxiv icon

AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion

Add code
Sep 15, 2023
Figure 1 for AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion
Figure 2 for AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion
Figure 3 for AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion
Figure 4 for AAS-VC: On the Generalization Ability of Automatic Alignment Search based Non-autoregressive Sequence-to-sequence Voice Conversion
Viaarxiv icon

Audio Difference Learning for Audio Captioning

Add code
Sep 15, 2023
Figure 1 for Audio Difference Learning for Audio Captioning
Figure 2 for Audio Difference Learning for Audio Captioning
Figure 3 for Audio Difference Learning for Audio Captioning
Viaarxiv icon

Evaluating Methods for Ground-Truth-Free Foreign Accent Conversion

Add code
Sep 05, 2023
Viaarxiv icon

Preference-based training framework for automatic speech quality assessment using deep neural network

Add code
Aug 29, 2023
Viaarxiv icon

The Singing Voice Conversion Challenge 2023

Add code
Jul 06, 2023
Figure 1 for The Singing Voice Conversion Challenge 2023
Figure 2 for The Singing Voice Conversion Challenge 2023
Figure 3 for The Singing Voice Conversion Challenge 2023
Figure 4 for The Singing Voice Conversion Challenge 2023
Viaarxiv icon

An Analysis of Personalized Speech Recognition System Development for the Deaf and Hard-of-Hearing

Add code
Jun 24, 2023
Viaarxiv icon

Text-to-speech synthesis based on latent variable conversion using diffusion probabilistic model and variational autoencoder

Add code
Dec 16, 2022
Viaarxiv icon