Picture for Jixun Yao

Jixun Yao

EASY: Emotion-aware Speaker Anonymization via Factorized Distillation

Add code
May 21, 2025
Viaarxiv icon

ClapFM-EVC: High-Fidelity and Flexible Emotional Voice Conversion with Dual Control from Natural Language and Speech

Add code
May 20, 2025
Viaarxiv icon

SongEval: A Benchmark Dataset for Song Aesthetics Evaluation

Add code
May 16, 2025
Viaarxiv icon

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Add code
Mar 03, 2025
Viaarxiv icon

GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling

Add code
Feb 05, 2025
Viaarxiv icon

Fine-grained Preference Optimization Improves Zero-shot Text-to-Speech

Add code
Feb 05, 2025
Viaarxiv icon

DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification

Add code
Jan 09, 2025
Figure 1 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Figure 2 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Figure 3 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Figure 4 for DiffAttack: Diffusion-based Timbre-reserved Adversarial Attack in Speaker Identification
Viaarxiv icon

StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching

Add code
Dec 10, 2024
Figure 1 for StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching
Figure 2 for StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching
Figure 3 for StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching
Figure 4 for StableVC: Style Controllable Zero-Shot Voice Conversion with Conditional Flow Matching
Viaarxiv icon

CTEFM-VC: Zero-Shot Voice Conversion Based on Content-Aware Timbre Ensemble Modeling and Flow Matching

Add code
Nov 04, 2024
Viaarxiv icon

The NPU-HWC System for the ISCSLP 2024 Inspirational and Convincing Audio Generation Challenge

Add code
Oct 31, 2024
Viaarxiv icon