Picture for Xueyao Zhang

Xueyao Zhang

The Singing Voice Conversion Challenge 2025: From Singer Identity Conversion To Singing Style Conversion

Add code
Sep 19, 2025
Viaarxiv icon

AnyAccomp: Generalizable Accompaniment Generation via Quantized Melodic Bottleneck

Add code
Sep 17, 2025
Viaarxiv icon

Audio Deepfake Verification

Add code
Sep 10, 2025
Viaarxiv icon

Multi-Metric Preference Alignment for Generative Speech Restoration

Add code
Aug 24, 2025
Viaarxiv icon

From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring

Add code
Jun 11, 2025
Viaarxiv icon

SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset

Add code
May 14, 2025
Viaarxiv icon

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Add code
May 07, 2025
Viaarxiv icon

Vevo: Controllable Zero-Shot Voice Imitation with Self-Supervised Disentanglement

Add code
Feb 11, 2025
Viaarxiv icon

Metis: A Foundation Speech Generation Model with Masked Generative Pre-training

Add code
Feb 05, 2025
Viaarxiv icon

Overview of the Amphion Toolkit (v0.2)

Add code
Jan 26, 2025
Figure 1 for Overview of the Amphion Toolkit (v0.2)
Figure 2 for Overview of the Amphion Toolkit (v0.2)
Figure 3 for Overview of the Amphion Toolkit (v0.2)
Figure 4 for Overview of the Amphion Toolkit (v0.2)
Viaarxiv icon