Picture for Jiangyan Yi

Jiangyan Yi

Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism

Add code
Jul 31, 2024
Viaarxiv icon

An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio

Add code
Jul 11, 2024
Figure 1 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Figure 2 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Figure 3 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Figure 4 for An Unsupervised Domain Adaptation Method for Locating Manipulated Region in partially fake Audio
Viaarxiv icon

Frequency-mix Knowledge Distillation for Fake Speech Detection

Add code
Jun 14, 2024
Figure 1 for Frequency-mix Knowledge Distillation for Fake Speech Detection
Figure 2 for Frequency-mix Knowledge Distillation for Fake Speech Detection
Figure 3 for Frequency-mix Knowledge Distillation for Fake Speech Detection
Figure 4 for Frequency-mix Knowledge Distillation for Fake Speech Detection
Viaarxiv icon

RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection

Add code
Jun 10, 2024
Figure 1 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Figure 2 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Figure 3 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Figure 4 for RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
Viaarxiv icon

TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking

Add code
Jun 07, 2024
Figure 1 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Figure 2 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Figure 3 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Figure 4 for TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
Viaarxiv icon

EVDA: Evolving Deepfake Audio Detection Continual Learning Benchmark

Add code
May 15, 2024
Viaarxiv icon

MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition

Add code
Apr 29, 2024
Figure 1 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Figure 2 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Figure 3 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Figure 4 for MER 2024: Semi-Supervised Learning, Noise Robustness, and Open-Vocabulary Multimodal Emotion Recognition
Viaarxiv icon

What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection

Add code
Dec 15, 2023
Viaarxiv icon

Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection

Add code
Oct 13, 2023
Figure 1 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Figure 2 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Figure 3 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Figure 4 for Learning to Behave Like Clean Speech: Dual-Branch Knowledge Distillation for Noise-Robust Fake Audio Detection
Viaarxiv icon

Controllable Residual Speaker Representation for Voice Conversion

Add code
Sep 15, 2023
Viaarxiv icon