Picture for Haibin Wu

Haibin Wu

Laugh Now Cry Later: Controlling Time-Varying Emotional States of Flow-Matching-Based Zero-Shot Text-to-Speech

Add code
Jul 17, 2024
Viaarxiv icon

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

Add code
Jun 11, 2024
Figure 1 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Figure 2 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Figure 3 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Figure 4 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Viaarxiv icon

Neural Codec-based Adversarial Sample Detection for Speaker Verification

Add code
Jun 07, 2024
Viaarxiv icon

Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

Add code
Jun 07, 2024
Viaarxiv icon

Singing Voice Graph Modeling for SingFake Detection

Add code
Jun 05, 2024
Viaarxiv icon

A Large-Scale Evaluation of Speech Foundation Models

Add code
Apr 15, 2024
Figure 1 for A Large-Scale Evaluation of Speech Foundation Models
Figure 2 for A Large-Scale Evaluation of Speech Foundation Models
Figure 3 for A Large-Scale Evaluation of Speech Foundation Models
Figure 4 for A Large-Scale Evaluation of Speech Foundation Models
Viaarxiv icon

EMO-SUPERB: An In-depth Look at Speech Emotion Recognition

Add code
Feb 22, 2024
Figure 1 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Figure 2 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Figure 3 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Figure 4 for EMO-SUPERB: An In-depth Look at Speech Emotion Recognition
Viaarxiv icon

Towards audio language modeling -- an overview

Add code
Feb 20, 2024
Figure 1 for Towards audio language modeling -- an overview
Figure 2 for Towards audio language modeling -- an overview
Figure 3 for Towards audio language modeling -- an overview
Figure 4 for Towards audio language modeling -- an overview
Viaarxiv icon

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models

Add code
Feb 20, 2024
Figure 1 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 2 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 3 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Figure 4 for Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
Viaarxiv icon

Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification

Add code
Dec 14, 2023
Figure 1 for Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification
Figure 2 for Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification
Figure 3 for Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification
Figure 4 for Scalable Ensemble-based Detection Method against Adversarial Attacks for speaker verification
Viaarxiv icon