Picture for Yu Tsao

Yu Tsao

Graduate Program of Data Science, National Taiwan University and Academia Sinica, Taipei, Taiwan, Research Center for Information Technology Innovation, Academia Sinica, Taipei, Taiwan

TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations

Add code
Jul 02, 2024
Figure 1 for TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations
Figure 2 for TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations
Figure 3 for TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations
Figure 4 for TTSlow: Slow Down Text-to-Speech with Efficiency Robustness Evaluations
Viaarxiv icon

Unsupervised Face-Mask Speech Enhancement Using Generative Adversarial Networks with Human-in-the-Loop Assessment Metrics

Add code
Jul 02, 2024
Viaarxiv icon

Bridging the Gap: Integrating Pre-trained Speech Enhancement and Recognition Models for Robust Speech Recognition

Add code
Jun 18, 2024
Viaarxiv icon

SVSNet+: Enhancing Speaker Voice Similarity Assessment Models with Representations from Speech Foundation Models

Add code
Jun 12, 2024
Viaarxiv icon

Abnormal Respiratory Sound Identification Using Audio-Spectrogram Vision Transformer

Add code
May 14, 2024
Viaarxiv icon

An Investigation of Incorporating Mamba for Speech Enhancement

Add code
May 10, 2024
Viaarxiv icon

Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes

Add code
May 07, 2024
Figure 1 for Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
Figure 2 for Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
Figure 3 for Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
Figure 4 for Unmasking Illusions: Understanding Human Perception of Audiovisual Deepfakes
Viaarxiv icon

Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids

Add code
Feb 26, 2024
Figure 1 for Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids
Figure 2 for Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids
Figure 3 for Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids
Figure 4 for Towards Environmental Preference Based Speech Enhancement For Individualised Multi-Modal Hearing Aids
Viaarxiv icon

Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech

Add code
Feb 26, 2024
Viaarxiv icon

Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues

Add code
Feb 26, 2024
Viaarxiv icon