Alert button
Picture for Tomoki Hayashi

Tomoki Hayashi

Alert button

ESPnet2-TTS: Extending the Edge of TTS Research

Add code
Bookmark button
Alert button
Oct 15, 2021
Tomoki Hayashi, Ryuichi Yamamoto, Takenori Yoshimura, Peter Wu, Jiatong Shi, Takaaki Saeki, Yooncheol Ju, Yusuke Yasuda, Shinnosuke Takamichi, Shinji Watanabe

Figure 1 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 2 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 3 for ESPnet2-TTS: Extending the Edge of TTS Research
Figure 4 for ESPnet2-TTS: Extending the Edge of TTS Research
Viaarxiv icon

S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations

Add code
Bookmark button
Alert button
Oct 12, 2021
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Hung-Yi Lee, Shinji Watanabe, Tomoki Toda

Figure 1 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 2 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 3 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Figure 4 for S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations
Viaarxiv icon

On Prosody Modeling for ASR+TTS based Voice Conversion

Add code
Bookmark button
Alert button
Jul 20, 2021
Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe, Tomoki Toda

Figure 1 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 2 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 3 for On Prosody Modeling for ASR+TTS based Voice Conversion
Figure 4 for On Prosody Modeling for ASR+TTS based Voice Conversion
Viaarxiv icon

Anomalous Sound Detection Using a Binary Classification Model and Class Centroids

Add code
Bookmark button
Alert button
Jun 11, 2021
Ibuki Kuroyanagi, Tomoki Hayashi, Kazuya Takeda, Tomoki Toda

Figure 1 for Anomalous Sound Detection Using a Binary Classification Model and Class Centroids
Figure 2 for Anomalous Sound Detection Using a Binary Classification Model and Class Centroids
Figure 3 for Anomalous Sound Detection Using a Binary Classification Model and Class Centroids
Viaarxiv icon

Non-autoregressive sequence-to-sequence voice conversion

Add code
Bookmark button
Alert button
Apr 14, 2021
Tomoki Hayashi, Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda

Figure 1 for Non-autoregressive sequence-to-sequence voice conversion
Figure 2 for Non-autoregressive sequence-to-sequence voice conversion
Figure 3 for Non-autoregressive sequence-to-sequence voice conversion
Figure 4 for Non-autoregressive sequence-to-sequence voice conversion
Viaarxiv icon

crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder

Add code
Bookmark button
Alert button
Mar 04, 2021
Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda

Figure 1 for crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder
Figure 2 for crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder
Viaarxiv icon

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

Add code
Bookmark button
Alert button
Dec 23, 2020
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang

Figure 1 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Figure 2 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Viaarxiv icon

Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations

Add code
Bookmark button
Alert button
Oct 23, 2020
Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi, Tomoki Toda

Figure 1 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Figure 2 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Figure 3 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Figure 4 for Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations
Viaarxiv icon

The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS

Add code
Bookmark button
Alert button
Oct 06, 2020
Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda

Figure 1 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 2 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 3 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Figure 4 for The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS
Viaarxiv icon

Pretraining Techniques for Sequence-to-Sequence Voice Conversion

Add code
Bookmark button
Alert button
Aug 07, 2020
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda

Figure 1 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Figure 2 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Figure 3 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Figure 4 for Pretraining Techniques for Sequence-to-Sequence Voice Conversion
Viaarxiv icon