Picture for Tomoki Hayashi

Tomoki Hayashi

ESPnet2-TTS: Extending the Edge of TTS Research

Add code
Oct 15, 2021
Figure 1 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 2 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 3 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 4 for ESPnet2-TTS: Extending the Edge of TTS Research
Viaarxiv icon

S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations

Add code
Oct 12, 2021
Figure 1 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 2 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 3 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 4 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Viaarxiv icon

On Prosody Modeling for ASR+TTS based Voice Conversion

Add code
Jul 20, 2021
Figure 1 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 2 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 3 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 4 for On Prosody Modeling for ASR+TTS based Voice Conversion
Viaarxiv icon

Anomalous Sound Detection Using a Binary Classification Model and Class Centroids

Add code
Jun 11, 2021
Figure 1 for Anomalous Sound Detection Using a Binary Classification Model and Class Centroids
Figure 2 for Anomalous Sound Detection Using a Binary Classification Model and Class Centroids
Figure 3 for Anomalous Sound Detection Using a Binary Classification Model and Class Centroids
Viaarxiv icon

Non-autoregressive sequence-to-sequence voice conversion

Add code
Apr 14, 2021
Figure 1 for Non-autoregressive sequence-to-sequence voice conversion
Figure 2 for Non-autoregressive sequence-to-sequence voice conversion
Figure 3 for Non-autoregressive sequence-to-sequence voice conversion
Figure 4 for Non-autoregressive sequence-to-sequence voice conversion
Viaarxiv icon

crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder

Add code
Mar 04, 2021
Figure 1 for crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder
Figure 2 for crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder
Viaarxiv icon

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

Add code
Dec 23, 2020
Figure 1 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Figure 2 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Viaarxiv icon

Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations

Add code
Oct 23, 2020
Figure 1 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Figure 2 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Figure 3 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Figure 4 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Viaarxiv icon

The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS

Add code
Oct 06, 2020
Figure 1 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 2 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 3 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 4 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Viaarxiv icon

Pretraining Techniques for Sequence-to-Sequence Voice Conversion

Add code
Aug 07, 2020
Figure 1 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Figure 2 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Figure 3 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Figure 4 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Viaarxiv icon