Alert button
Picture for Yi-Chiao Wu

Yi-Chiao Wu

Alert button

ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter

Jan 22, 2024
Yi-Chiao Wu, Dejan Marković, Steven Krenn, Israel D. Gebru, Alexander Richard

Viaarxiv icon

Audiobox: Unified Audio Generation with Natural Language Prompts

Dec 25, 2023
Apoorv Vyas, Bowen Shi, Matthew Le, Andros Tjandra, Yi-Chiao Wu, Baishan Guo, Jiemin Zhang, Xinyue Zhang, Robert Adkins, William Ngan, Jeff Wang, Ivan Cruz, Bapi Akula, Akinniyi Akinyemi, Brian Ellis, Rashel Moritz, Yael Yungster, Alice Rakotoarison, Liang Tan, Chris Summers, Carleigh Wood, Joshua Lane, Mary Williamson, Wei-Ning Hsu

Viaarxiv icon

AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec

May 26, 2023
Yi-Chiao Wu, Israel D. Gebru, Dejan Marković, Alexander Richard

Figure 1 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 2 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 3 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 4 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Viaarxiv icon

Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder

Oct 31, 2022
Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda

Figure 1 for Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder
Figure 2 for Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder
Viaarxiv icon

A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System

Jul 13, 2022
Yi-Chiao Wu, Patrick Lumban Tobing, Kazuki Yasuhara, Noriyuki Matsunaga, Yamato Ohtani, Tomoki Toda

Figure 1 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 2 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 3 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 4 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Viaarxiv icon

Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation

May 12, 2022
Reo Yoneyama, Yi-Chiao Wu, Tomoki Toda

Figure 1 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Figure 2 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Figure 3 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Figure 4 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Viaarxiv icon

Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion

Nov 13, 2021
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

Figure 1 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 2 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 3 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 4 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Viaarxiv icon

HASA-net: A non-intrusive hearing-aid speech assessment network

Nov 10, 2021
Hsin-Tien Chiang, Yi-Chiao Wu, Cheng Yu, Tomoki Toda, Hsin-Min Wang, Yih-Chun Hu, Yu Tsao

Figure 1 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 2 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 3 for HASA-net: A non-intrusive hearing-aid speech assessment network
Figure 4 for HASA-net: A non-intrusive hearing-aid speech assessment network
Viaarxiv icon

Noisy-to-Noisy Voice Conversion Framework with Denoising Model

Sep 22, 2021
Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda

Figure 1 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 2 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 3 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Figure 4 for Noisy-to-Noisy Voice Conversion Framework with Denoising Model
Viaarxiv icon

Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder

Jun 10, 2021
Yi-Chiao Wu, Cheng-Hung Hu, Hung-Shin Lee, Yu-Huai Peng, Wen-Chin Huang, Yu Tsao, Hsin-Min Wang, Tomoki Toda

Figure 1 for Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Figure 2 for Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Figure 3 for Relational Data Selection for Data Augmentation of Speaker-dependent Multi-band MelGAN Vocoder
Viaarxiv icon