Picture for Hemin Yang

Hemin Yang

E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS

Add code
Jun 26, 2024
Figure 1 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 2 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 3 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Figure 4 for E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS
Viaarxiv icon

An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS

Add code
Jun 09, 2024
Viaarxiv icon

Total-Duration-Aware Duration Modeling for Text-to-Speech Systems

Add code
Jun 06, 2024
Figure 1 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 2 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 3 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Figure 4 for Total-Duration-Aware Duration Modeling for Text-to-Speech Systems
Viaarxiv icon

Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like

Add code
Feb 12, 2024
Viaarxiv icon

Diffusion Conditional Expectation Model for Efficient and Robust Target Speech Extraction

Add code
Sep 25, 2023
Viaarxiv icon

Real-Time Audio-Visual End-to-End Speech Enhancement

Add code
Mar 13, 2023
Figure 1 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 2 for Real-Time Audio-Visual End-to-End Speech Enhancement
Figure 3 for Real-Time Audio-Visual End-to-End Speech Enhancement
Viaarxiv icon

Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement

Add code
Jun 05, 2021
Figure 1 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Figure 2 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Figure 3 for Human Listening and Live Captioning: Multi-Task Training for Speech Enhancement
Viaarxiv icon