Alert button
Picture for Tomoki Toda

Tomoki Toda

Alert button

Learning Multidimensional Disentangled Representations of Instrumental Sounds for Musical Similarity Assessment

Add code
Bookmark button
Alert button
Apr 10, 2024
Yuka Hashizume, Li Li, Atsushi Miyashita, Tomoki Toda

Viaarxiv icon

Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection

Add code
Bookmark button
Alert button
Mar 18, 2024
Takuya Fujimura, Keisuke Imoto, Tomoki Toda

Figure 1 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 2 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 3 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Figure 4 for Discriminative Neighborhood Smoothing for Generative Anomalous Sound Detection
Viaarxiv icon

Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment

Add code
Bookmark button
Alert button
Mar 10, 2024
Yusuke Yasuda, Tomoki Toda

Figure 1 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 2 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 3 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Figure 4 for Automatic design optimization of preference-based subjective evaluation with online learning in crowdsourcing environment
Viaarxiv icon

MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, ASR Error Detection, and ASR Error Correction

Add code
Bookmark button
Alert button
Jan 24, 2024
Jiajun He, Xiaohan Shi, Xingfeng Li, Tomoki Toda

Viaarxiv icon

On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition

Add code
Bookmark button
Alert button
Nov 14, 2023
Xiaohan Shi, Jiajun He, Xingfeng Li, Tomoki Toda

Viaarxiv icon

A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023

Add code
Bookmark button
Alert button
Oct 08, 2023
Ryuichi Yamamoto, Reo Yoneyama, Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda

Figure 1 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Figure 2 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Figure 3 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Figure 4 for A Comparative Study of Voice Conversion Models with Large-Scale Speech and Singing Data: The T13 Systems for the Singing Voice Conversion Challenge 2023
Viaarxiv icon

ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction

Add code
Bookmark button
Alert button
Oct 08, 2023
Jiajun He, Zekun Yang, Tomoki Toda

Figure 1 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Figure 2 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Figure 3 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Figure 4 for ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correction
Viaarxiv icon

The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains

Add code
Bookmark button
Alert button
Oct 07, 2023
Erica Cooper, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi

Figure 1 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 2 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 3 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Figure 4 for The VoiceMOS Challenge 2023: Zero-shot Subjective Speech Quality Prediction for Multiple Domains
Viaarxiv icon

Improving severity preservation of healthy-to-pathological voice conversion with global style tokens

Add code
Bookmark button
Alert button
Oct 04, 2023
Bence Mark Halpern, Wen-Chin Huang, Lester Phillip Violeta, R. J. J. H. van Son, Tomoki Toda

Figure 1 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Figure 2 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Figure 3 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Figure 4 for Improving severity preservation of healthy-to-pathological voice conversion with global style tokens
Viaarxiv icon