Picture for Zhizheng Wu

Zhizheng Wu

Over-the-Air Adversarial Attack Detection: from Datasets to Defenses

Add code
Sep 11, 2025
Viaarxiv icon

Audio Deepfake Verification

Add code
Sep 10, 2025
Viaarxiv icon

Multi-Metric Preference Alignment for Generative Speech Restoration

Add code
Aug 24, 2025
Viaarxiv icon

NVSpeech: An Integrated and Scalable Pipeline for Human-Like Speech Modeling with Paralinguistic Vocalizations

Add code
Aug 06, 2025
Viaarxiv icon

AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models

Add code
May 22, 2025
Viaarxiv icon

Neurodyne: Neural Pitch Manipulation with Representation Learning and Cycle-Consistency GAN

Add code
May 21, 2025
Viaarxiv icon

DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec for Speech Generation

Add code
May 19, 2025
Viaarxiv icon

SingNet: Towards a Large-Scale, Diverse, and In-the-Wild Singing Voice Dataset

Add code
May 14, 2025
Viaarxiv icon

Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment

Add code
May 07, 2025
Viaarxiv icon

Diff-SSL-G-Comp: Towards a Large-Scale and Diverse Dataset for Virtual Analog Modeling

Add code
Apr 06, 2025
Figure 1 for Diff-SSL-G-Comp: Towards a Large-Scale and Diverse Dataset for Virtual Analog Modeling
Figure 2 for Diff-SSL-G-Comp: Towards a Large-Scale and Diverse Dataset for Virtual Analog Modeling
Figure 3 for Diff-SSL-G-Comp: Towards a Large-Scale and Diverse Dataset for Virtual Analog Modeling
Figure 4 for Diff-SSL-G-Comp: Towards a Large-Scale and Diverse Dataset for Virtual Analog Modeling
Viaarxiv icon