Picture for Yi-Chiao Wu

Yi-Chiao Wu

EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations

Add code
Jul 24, 2024
Figure 1 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 2 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 3 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Figure 4 for EMO-Codec: A Depth Look at Emotion Preservation Capacity of Legacy and Neural Codec Models With Subjective and Objective Evaluations
Viaarxiv icon

EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation

Add code
Jun 11, 2024
Figure 1 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 2 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 3 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Figure 4 for EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation
Viaarxiv icon

Multi-speaker Text-to-speech Training with Speaker Anonymized Data

Add code
May 20, 2024
Viaarxiv icon

ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter

Add code
Jan 22, 2024
Viaarxiv icon

Audiobox: Unified Audio Generation with Natural Language Prompts

Add code
Dec 25, 2023
Figure 1 for Audiobox: Unified Audio Generation with Natural Language Prompts
Figure 2 for Audiobox: Unified Audio Generation with Natural Language Prompts
Figure 3 for Audiobox: Unified Audio Generation with Natural Language Prompts
Figure 4 for Audiobox: Unified Audio Generation with Natural Language Prompts
Viaarxiv icon

AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec

Add code
May 26, 2023
Figure 1 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 2 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 3 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Figure 4 for AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec
Viaarxiv icon

Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder

Add code
Oct 31, 2022
Figure 1 for Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder
Figure 2 for Source-Filter HiFi-GAN: Fast and Pitch Controllable High-Fidelity Neural Vocoder
Viaarxiv icon

A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System

Add code
Jul 13, 2022
Figure 1 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 2 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 3 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Figure 4 for A Cyclical Approach to Synthetic and Natural Speech Mismatch Refinement of Neural Post-filter for Low-cost Text-to-speech System
Viaarxiv icon

Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation

Add code
May 12, 2022
Figure 1 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Figure 2 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Figure 3 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Figure 4 for Unified Source-Filter GAN with Harmonic-plus-Noise Source Excitation Generation
Viaarxiv icon

Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion

Add code
Nov 13, 2021
Figure 1 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 2 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 3 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Figure 4 for Direct Noisy Speech Modeling for Noisy-to-Noisy Voice Conversion
Viaarxiv icon