Alert button
Picture for Jixun Yao

Jixun Yao

Alert button

SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation

Add code
Bookmark button
Alert button
Oct 08, 2023
Yuanjun Lv, Jixun Yao, Peikun Chen, Hongbin Zhou, Heng Lu, Lei Xie

Viaarxiv icon

DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion

Add code
Bookmark button
Alert button
Sep 27, 2023
Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Shuai Wang, Jixun Yao, Lei Xie, Mengxiao Bi

Figure 1 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Figure 2 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Figure 3 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Figure 4 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Viaarxiv icon

PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts

Add code
Bookmark button
Alert button
Sep 17, 2023
Jixun Yao, Yuguang Yang, Yi Lei, Ziqian Ning, Yanni Hu, Yu Pan, Jingjing Yin, Hongbin Zhou, Heng Lu, Lei Xie

Figure 1 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Figure 2 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Figure 3 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Figure 4 for PromptVC: Flexible Stylistic Voice Conversion in Latent Space Driven by Natural Language Prompts
Viaarxiv icon

Timbre-reserved Adversarial Attack in Speaker Identification

Add code
Bookmark button
Alert button
Sep 02, 2023
Qing Wang, Jixun Yao, Li Zhang, Pengcheng Guo, Lei Xie

Figure 1 for Timbre-reserved Adversarial Attack in Speaker Identification
Figure 2 for Timbre-reserved Adversarial Attack in Speaker Identification
Figure 3 for Timbre-reserved Adversarial Attack in Speaker Identification
Figure 4 for Timbre-reserved Adversarial Attack in Speaker Identification
Viaarxiv icon

GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jun 16, 2023
Yu Pan, Yanni Hu, Yuguang Yang, Jixun Yao, Wen Fei, Lei Ma, Heng Lu

Figure 1 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Figure 2 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Figure 3 for GEmo-CLAP: Gender-Attribute-Enhanced Contrastive Language-Audio Pretraining for Speech Emotion Recognition
Viaarxiv icon

Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification

Add code
Bookmark button
Alert button
May 30, 2023
Qing Wang, Jixun Yao, Ziqian Wang, Pengcheng Guo, Lei Xie

Figure 1 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Figure 2 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Figure 3 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Figure 4 for Pseudo-Siamese Network based Timbre-reserved Black-box Adversarial Attack in Speaker Identification
Viaarxiv icon

DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding

Add code
Bookmark button
Alert button
May 21, 2023
Ziqian Ning, Yuepeng Jiang, Pengcheng Zhu, Jixun Yao, Shuai Wang, Lei Xie, Mengxiao Bi

Figure 1 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Figure 2 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Figure 3 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Figure 4 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Viaarxiv icon

UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis

Add code
Bookmark button
Alert button
Dec 06, 2022
Yi Lei, Shan Yang, Xinsheng Wang, Qicong Xie, Jixun Yao, Lei Xie, Dan Su

Figure 1 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 2 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 3 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Figure 4 for UniSyn: An End-to-End Unified Model for Text-to-Speech and Singing Voice Synthesis
Viaarxiv icon

Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features

Add code
Bookmark button
Alert button
Nov 09, 2022
Ziqian Ning, Qicong Xie, Pengcheng Zhu, Zhichao Wang, Liumeng Xue, Jixun Yao, Lei Xie, Mengxiao Bi

Figure 1 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 2 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 3 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 4 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Viaarxiv icon