Picture for Hung-yi Lee

Hung-yi Lee

CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems

Add code
Jun 11, 2024
Figure 1 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Figure 2 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Figure 3 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Figure 4 for CodecFake: Enhancing Anti-Spoofing Models Against Deepfake Audios from Codec-Based Speech Synthesis Systems
Viaarxiv icon

Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper

Add code
Jun 09, 2024
Figure 1 for Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper
Figure 2 for Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper
Figure 3 for Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper
Figure 4 for Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper
Viaarxiv icon

DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models

Add code
Jun 08, 2024
Figure 1 for DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models
Figure 2 for DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models
Figure 3 for DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models
Figure 4 for DAISY: Data Adaptive Self-Supervised Early Exit for Speech Representation Models
Viaarxiv icon

Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition

Add code
Jun 07, 2024
Figure 1 for Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition
Figure 2 for Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition
Figure 3 for Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition
Figure 4 for Emo-bias: A Large Scale Evaluation of Social Bias on Speech Emotion Recognition
Viaarxiv icon

Neural Codec-based Adversarial Sample Detection for Speaker Verification

Add code
Jun 07, 2024
Viaarxiv icon

On the social bias of speech self-supervised models

Add code
Jun 07, 2024
Figure 1 for On the social bias of speech self-supervised models
Figure 2 for On the social bias of speech self-supervised models
Figure 3 for On the social bias of speech self-supervised models
Figure 4 for On the social bias of speech self-supervised models
Viaarxiv icon

Singing Voice Graph Modeling for SingFake Detection

Add code
Jun 05, 2024
Viaarxiv icon

Dataset-Distillation Generative Model for Speech Emotion Recognition

Add code
Jun 05, 2024
Figure 1 for Dataset-Distillation Generative Model for Speech Emotion Recognition
Figure 2 for Dataset-Distillation Generative Model for Speech Emotion Recognition
Figure 3 for Dataset-Distillation Generative Model for Speech Emotion Recognition
Figure 4 for Dataset-Distillation Generative Model for Speech Emotion Recognition
Viaarxiv icon

SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation

Add code
Jun 05, 2024
Figure 1 for SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation
Figure 2 for SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation
Figure 3 for SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation
Figure 4 for SYN2REAL: Leveraging Task Arithmetic for Mitigating Synthetic-Real Discrepancies in ASR Domain Adaptation
Viaarxiv icon

InstructionCP: A fast approach to transfer Large Language Models into target language

Add code
May 30, 2024
Figure 1 for InstructionCP: A fast approach to transfer Large Language Models into target language
Figure 2 for InstructionCP: A fast approach to transfer Large Language Models into target language
Figure 3 for InstructionCP: A fast approach to transfer Large Language Models into target language
Figure 4 for InstructionCP: A fast approach to transfer Large Language Models into target language
Viaarxiv icon