Alert button
Picture for Wen-Chin Huang

Wen-Chin Huang

Alert button

The Singing Voice Conversion Challenge 2023

Add code
Bookmark button
Alert button
Jun 26, 2023
Wen-Chin Huang, Lester Phillip Violeta, Songxiang Liu, Jiatong Shi, Yusuke Yasuda, Tomoki Toda

Figure 1 for The Singing Voice Conversion Challenge 2023
Figure 2 for The Singing Voice Conversion Challenge 2023
Figure 3 for The Singing Voice Conversion Challenge 2023
Figure 4 for The Singing Voice Conversion Challenge 2023
Viaarxiv icon

A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation

Add code
Bookmark button
Alert button
Jan 25, 2023
Wen-Chin Huang, Benjamin Peloquin, Justine Kao, Changhan Wang, Hongyu Gong, Elizabeth Salesky, Yossi Adi, Ann Lee, Peng-Jen Chen

Figure 1 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 2 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 3 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Figure 4 for A Holistic Cascade System, benchmark, and Human Evaluation Protocol for Expressive Speech-to-Speech Translation
Viaarxiv icon

Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition

Add code
Bookmark button
Alert button
Nov 02, 2022
Lester Phillip Violeta, Ding Ma, Wen-Chin Huang, Tomoki Toda

Figure 1 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 2 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 3 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Figure 4 for Intermediate Fine-Tuning Using Imperfect Synthetic Speech for Improving Electrolaryngeal Speech Recognition
Viaarxiv icon

A Comparative Study of Self-supervised Speech Representation Based Voice Conversion

Add code
Bookmark button
Alert button
Jul 10, 2022
Wen-Chin Huang, Shu-Wen Yang, Tomoki Hayashi, Tomoki Toda

Figure 1 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 2 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 3 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Figure 4 for A Comparative Study of Self-supervised Speech Representation Based Voice Conversion
Viaarxiv icon

Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition

Add code
Bookmark button
Alert button
Mar 30, 2022
Lester Phillip Violeta, Wen-Chin Huang, Tomoki Toda

Figure 1 for Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition
Figure 2 for Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition
Figure 3 for Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition
Figure 4 for Investigating Self-supervised Pretraining Frameworks for Pathological Speech Recognition
Viaarxiv icon

The VoiceMOS Challenge 2022

Add code
Bookmark button
Alert button
Mar 28, 2022
Wen-Chin Huang, Erica Cooper, Yu Tsao, Hsin-Min Wang, Tomoki Toda, Junichi Yamagishi

Figure 1 for The VoiceMOS Challenge 2022
Figure 2 for The VoiceMOS Challenge 2022
Figure 3 for The VoiceMOS Challenge 2022
Figure 4 for The VoiceMOS Challenge 2022
Viaarxiv icon

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities

Add code
Bookmark button
Alert button
Mar 14, 2022
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

Figure 1 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 2 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 3 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Figure 4 for SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities
Viaarxiv icon

Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion

Add code
Bookmark button
Alert button
Nov 13, 2021
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

Figure 1 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 2 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 3 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 4 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Viaarxiv icon

LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech

Add code
Bookmark button
Alert button
Oct 18, 2021
Wen-Chin Huang, Erica Cooper, Junichi Yamagishi, Tomoki Toda

Figure 1 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Figure 2 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Figure 3 for LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech
Viaarxiv icon

Generalization Ability of MOS Prediction Networks

Add code
Bookmark button
Alert button
Oct 18, 2021
Erica Cooper, Wen-Chin Huang, Tomoki Toda, Junichi Yamagishi

Figure 1 for Generalization Ability of MOS Prediction Networks
Figure 2 for Generalization Ability of MOS Prediction Networks
Figure 3 for Generalization Ability of MOS Prediction Networks
Figure 4 for Generalization Ability of MOS Prediction Networks
Viaarxiv icon