Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Wen-Chin Huang

Non-autoregressive sequence-to-sequence voice conversion


Apr 14, 2021
Tomoki Hayashi, Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda

* Accepted to ICASSP2021. Demo HP: https://kan-bayashi.github.io/NonARSeq2SeqVC/ 

  Access Paper or Ask Questions

The AS-NU System for the M2VoC Challenge


Apr 07, 2021
Cheng-Hung Hu, Yi-Chiao Wu, Wen-Chin Huang, Yu-Huai Peng, Yu-Wen Chen, Pin-Jui Ku, Tomoki Toda, Yu Tsao, Hsin-Min Wang


  Access Paper or Ask Questions

crank: An Open-Source Software for Nonparallel Voice Conversion Based on Vector-Quantized Variational Autoencoder


Mar 04, 2021
Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda

* Accepted to ICASSP 2021 

  Access Paper or Ask Questions

EMA2S: An End-to-End Multimodal Articulatory-to-Speech System


Feb 07, 2021
Yu-Wen Chen, Kuo-Hsuan Hung, Shang-Yi Chuang, Jonathan Sherman, Wen-Chin Huang, Xugang Lu, Yu Tsao


  Access Paper or Ask Questions

Speech Recognition by Simply Fine-tuning BERT


Jan 30, 2021
Wen-Chin Huang, Chia-Hua Wu, Shang-Bao Luo, Kuan-Yu Chen, Hsin-Min Wang, Tomoki Toda

* Accepted to ICASSP 2021 

  Access Paper or Ask Questions

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans


Dec 23, 2020
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang


  Access Paper or Ask Questions

Any-to-One Sequence-to-Sequence Voice Conversion using Self-Supervised Discrete Speech Representations


Oct 23, 2020
Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi, Tomoki Toda

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

The NU Voice Conversion System for the Voice Conversion Challenge 2020: On the Effectiveness of Sequence-to-sequence Models and Autoregressive Neural Vocoders


Oct 09, 2020
Wen-Chin Huang, Patrick Lumban Tobing, Yi-Chiao Wu, Kazuhiro Kobayashi, Tomoki Toda

* Accepted to the ISCA Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 

  Access Paper or Ask Questions

The Sequence-to-Sequence Baseline for the Voice Conversion Challenge 2020: Cascading ASR and TTS


Oct 06, 2020
Wen-Chin Huang, Tomoki Hayashi, Shinji Watanabe, Tomoki Toda

* Accepted to the ISCA Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge 2020 

  Access Paper or Ask Questions

Pretraining Techniques for Sequence-to-Sequence Voice Conversion


Aug 07, 2020
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda

* Preprint. Under review 

  Access Paper or Ask Questions

Many-to-Many Voice Transformer Network


Jun 07, 2020
Hirokazu Kameoka, Wen-Chin Huang, Kou Tanaka, Takuhiro Kaneko, Nobukatsu Hojo, Tomoki Toda

* submitted to IEEE/ACM Trans. ASLP 

  Access Paper or Ask Questions

Unsupervised Representation Disentanglement using Cross Domain Features and Adversarial Learning in Variational Autoencoder based Voice Conversion


Feb 07, 2020
Wen-Chin Huang, Hao Luo, Hsin-Te Hwang, Chen-Chou Lo, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang

* Accepted to IEEE Transactions on Emerging Topics in Computational Intelligence 

  Access Paper or Ask Questions

Voice Transformer Network: Sequence-to-Sequence Voice Conversion Using Transformer with Text-to-Speech Pretraining


Dec 14, 2019
Wen-Chin Huang, Tomoki Hayashi, Yi-Chiao Wu, Hirokazu Kameoka, Tomoki Toda

* Preprint. Work in progress 

  Access Paper or Ask Questions

MOSNet: Deep Learning based Objective Assessment for Voice Conversion


Apr 26, 2019
Chen-Chou Lo, Szu-Wei Fu, Wen-Chin Huang, Xin Wang, Junichi Yamagishi, Yu Tsao, Hsin-Min Wang

* Submitted to Interspeech2019 

  Access Paper or Ask Questions

Refined WaveNet Vocoder for Variational Autoencoder Based Voice Conversion


Nov 27, 2018
Wen-Chin Huang, Yi-Chiao Wu, Hsin-Te Hwang, Patrick Lumban Tobing, Tomoki Hayashi, Kazuhiro Kobayashi, Tomoki Toda, Yu Tsao, Hsin-Min Wang

* 5 pages, 5 figures, 2 tables. Submitted to IEEE ICASSP 2019 

  Access Paper or Ask Questions

Voice Conversion Based on Cross-Domain Features Using Variational Auto Encoders


Aug 29, 2018
Wen-Chin Huang, Hsin-Te Hwang, Yu-Huai Peng, Yu Tsao, Hsin-Min Wang

* Accepted to ISCSLP 2018 

  Access Paper or Ask Questions