Alert button
Picture for Zhizheng Wu

Zhizheng Wu

Alert button

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Add code
Bookmark button
Alert button
Mar 05, 2024
Zeqian Ju, Yuancheng Wang, Kai Shen, Xu Tan, Detai Xin, Dongchao Yang, Yanqing Liu, Yichong Leng, Kaitao Song, Siliang Tang, Zhizheng Wu, Tao Qin, Xiang-Yang Li, Wei Ye, Shikun Zhang, Jiang Bian, Lei He, Jinyu Li, Sheng Zhao

Figure 1 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 2 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 3 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Figure 4 for NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
Viaarxiv icon

SingVisio: Visual Analytics of Diffusion Model for Singing Voice Conversion

Add code
Bookmark button
Alert button
Feb 20, 2024
Liumeng Xue, Chaoren Wang, Mingxuan Wang, Xueyao Zhang, Jun Han, Zhizheng Wu

Viaarxiv icon

CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing

Add code
Bookmark button
Alert button
Jan 22, 2024
Xianghu Yue, Xiaohai Tian, Malu Zhang, Zhizheng Wu, Haizhou Li

Viaarxiv icon

Amphion: An Open-Source Audio, Music and Speech Generation Toolkit

Add code
Bookmark button
Alert button
Dec 15, 2023
Xueyao Zhang, Liumeng Xue, Yuancheng Wang, Yicheng Gu, Xi Chen, Zihao Fang, Haopeng Chen, Lexiao Zou, Chaoren Wang, Jun Han, Kai Chen, Haizhou Li, Zhizheng Wu

Figure 1 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 2 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 3 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Figure 4 for Amphion: An Open-Source Audio, Music and Speech Generation Toolkit
Viaarxiv icon

Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder

Add code
Bookmark button
Alert button
Nov 25, 2023
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Zhizheng Wu

Viaarxiv icon

Leveraging Content-based Features from Multiple Acoustic Models for Singing Voice Conversion

Add code
Bookmark button
Alert button
Oct 17, 2023
Xueyao Zhang, Yicheng Gu, Haopeng Chen, Zihao Fang, Lexiao Zou, Liumeng Xue, Zhizheng Wu

Viaarxiv icon

An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification

Add code
Bookmark button
Alert button
Oct 12, 2023
Jiaqi Li, Li Wang, Liumeng Xue, Lei Wang, Zhizheng Wu

Figure 1 for An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification
Figure 2 for An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification
Figure 3 for An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification
Figure 4 for An Initial Investigation of Neural Replay Simulator for Over-the-Air Adversarial Perturbations to Automatic Speaker Verification
Viaarxiv icon

Audio compression-assisted feature extraction for voice replay attack detection

Add code
Bookmark button
Alert button
Oct 10, 2023
Xiangyu Shi, Yuhao Luo, Li Wang, Haorui He, Hao Li, Lei Wang, Zhizheng Wu

Figure 1 for Audio compression-assisted feature extraction for voice replay attack detection
Figure 2 for Audio compression-assisted feature extraction for voice replay attack detection
Figure 3 for Audio compression-assisted feature extraction for voice replay attack detection
Figure 4 for Audio compression-assisted feature extraction for voice replay attack detection
Viaarxiv icon

AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification

Add code
Bookmark button
Alert button
Oct 09, 2023
Li Wang, Jiaqi Li, Yuhao Luo, Jiahao Zheng, Lei Wang, Hao Li, Ke Xu, Chengfang Fang, Jie Shi, Zhizheng Wu

Figure 1 for AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification
Figure 2 for AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification
Figure 3 for AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification
Figure 4 for AdvSV: An Over-the-Air Adversarial Attack Dataset for Speaker Verification
Viaarxiv icon