Picture for Xiao-Lei Zhang

Xiao-Lei Zhang

MDD: a Mask Diffusion Detector to Protect Speaker Verification Systems from Adversarial Perturbations

Add code
Aug 26, 2025
Viaarxiv icon

PadAug: Robust Speaker Verification with Simple Waveform-Level Silence Padding

Add code
Aug 20, 2025
Viaarxiv icon

Angle-distance decomposition based on deep learning for active sonar detection

Add code
Jul 28, 2025
Viaarxiv icon

Bridging the Gap between Continuous and Informative Discrete Representations by Random Product Quantization

Add code
Apr 07, 2025
Viaarxiv icon

DualSpec: Text-to-spatial-audio Generation via Dual-Spectrogram Guided Diffusion Model

Add code
Feb 26, 2025
Viaarxiv icon

UniForm: A Unified Diffusion Transformer for Audio-Video Generation

Add code
Feb 08, 2025
Viaarxiv icon

Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR

Add code
Jan 24, 2025
Figure 1 for Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR
Figure 2 for Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR
Figure 3 for Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR
Figure 4 for Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR
Viaarxiv icon

Speaker Contrastive Learning for Source Speaker Tracing

Add code
Sep 16, 2024
Figure 1 for Speaker Contrastive Learning for Source Speaker Tracing
Figure 2 for Speaker Contrastive Learning for Source Speaker Tracing
Figure 3 for Speaker Contrastive Learning for Source Speaker Tracing
Figure 4 for Speaker Contrastive Learning for Source Speaker Tracing
Viaarxiv icon

Rethinking the Output Architecture for Sound Source Localization

Add code
Nov 21, 2023
Figure 1 for Rethinking the Output Architecture for Sound Source Localization
Figure 2 for Rethinking the Output Architecture for Sound Source Localization
Figure 3 for Rethinking the Output Architecture for Sound Source Localization
Figure 4 for Rethinking the Output Architecture for Sound Source Localization
Viaarxiv icon

Diffusion-Based Adversarial Purification for Speaker Verification

Add code
Oct 24, 2023
Viaarxiv icon