Alert button
Picture for Tomoki Toda

Tomoki Toda

Alert button

Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language

Add code
Bookmark button
Alert button
Dec 16, 2022
Yusuke Yasuda, Tomoki Toda

Figure 1 for Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language
Figure 2 for Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language
Figure 3 for Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language
Figure 4 for Investigation of Japanese PnG BERT language model in text-to-speech synthesis for pitch accent language
Viaarxiv icon

Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning

Add code
Bookmark button
Alert button
Nov 15, 2022
Yuka Hashizume, Li Li, Tomoki Toda

Figure 1 for Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning
Figure 2 for Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning
Figure 3 for Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning
Figure 4 for Music Similarity Calculation of Individual Instrumental Sounds Using Metric Learning
Viaarxiv icon

Analysis of Noisy-target Training for DNN-based speech enhancement

Add code
Bookmark button
Alert button
Nov 02, 2022
Takuya Fujimura, Tomoki Toda

Figure 1 for Analysis of Noisy-target Training for DNN-based speech enhancement
Figure 2 for Analysis of Noisy-target Training for DNN-based speech enhancement
Figure 3 for Analysis of Noisy-target Training for DNN-based speech enhancement
Figure 4 for Analysis of Noisy-target Training for DNN-based speech enhancement
Viaarxiv icon

Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

Add code
Bookmark button
Alert button
Nov 02, 2022
Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

Figure 1 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 2 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 3 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 4 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Viaarxiv icon

Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder

Add code
Bookmark button
Alert button
Oct 31, 2022
Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda

Figure 1 for Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder
Figure 2 for Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder
Viaarxiv icon

NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit

Add code
Bookmark button
Alert button
Oct 28, 2022
Ryuichi Yamamoto, Reo Yoneyama, Tomoki Toda

Figure 1 for NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
Figure 2 for NNSVS: A Neural Network-Based Singing Voice Synthesis Toolkit
Viaarxiv icon

Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion

Add code
Bookmark button
Alert button
Oct 19, 2022
Ding Ma, Lester Phillip Violeta, Kazuhiro Kobayashi, Tomoki Toda

Figure 1 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Figure 2 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Figure 3 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Figure 4 for Two-stage training method for Japanese electrolaryngeal speech enhancement based on sequence-to-sequence voice conversion
Viaarxiv icon

A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System

Add code
Bookmark button
Alert button
Jul 13, 2022
Yi-Chiao Wu, Patrick Lumban Tobing, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda

Figure 1 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 2 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 3 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 4 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Viaarxiv icon

A Comparative Study of Self-supervised Speech Representation Based Voice Conversion

Add code
Bookmark button
Alert button
Jul 10, 2022
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Tomoki Toda

Figure 1 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 2 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 3 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 4 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Viaarxiv icon

An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions

Add code
Bookmark button
Alert button
Jun 30, 2022
Yeonjong Choi, Chao Xie, Tomoki Toda

Figure 1 for An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions
Figure 2 for An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions
Figure 3 for An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions
Figure 4 for An Evaluation of Three-Stage Voice Conversion Framework for Noisy and Reverberant Conditions
Viaarxiv icon