Picture for Mengxiao Bi

Mengxiao Bi

HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

Add code
Jul 17, 2024
Viaarxiv icon

DualVC 3: Leveraging Language Model Generated Pseudo Context for End-to-end Low Latency Streaming Voice Conversion

Add code
Jun 12, 2024
Viaarxiv icon

EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis

Add code
Apr 02, 2024
Figure 1 for EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
Figure 2 for EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
Figure 3 for EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
Figure 4 for EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis
Viaarxiv icon

DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion

Add code
Sep 27, 2023
Figure 1 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Figure 2 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Figure 3 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Figure 4 for DualVC 2: Dynamic Masked Convolution for Unified Streaming and Non-Streaming Voice Conversion
Viaarxiv icon

Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models

Add code
Aug 31, 2023
Figure 1 for Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models
Figure 2 for Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models
Viaarxiv icon

DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding

Add code
May 21, 2023
Figure 1 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Figure 2 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Figure 3 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Figure 4 for DualVC: Dual-mode Voice Conversion using Intra-model Knowledge Distillation and Hybrid Predictive Coding
Viaarxiv icon

Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features

Add code
Nov 09, 2022
Figure 1 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 2 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 3 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Figure 4 for Expressive-VC: Highly Expressive Voice Conversion with Attention Fusion of Bottleneck and Perturbation Features
Viaarxiv icon

Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher

Add code
Mar 30, 2022
Figure 1 for Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
Figure 2 for Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
Figure 3 for Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
Figure 4 for Learn2Sing 2.0: Diffusion and Mutual Information-Based Target Speaker SVS by Learning from Singing Teacher
Viaarxiv icon

Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis

Add code
Jan 20, 2022
Figure 1 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Figure 2 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Figure 3 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Figure 4 for Opencpop: A High-Quality Open Source Chinese Popular Song Corpus for Singing Voice Synthesis
Viaarxiv icon

One-shot Voice Conversion For Style Transfer Based On Speaker Adaptation

Add code
Nov 24, 2021
Viaarxiv icon