Picture for Yusheng Tian

Yusheng Tian

User-Driven Voice Generation and Editing through Latent Space Navigation

Add code
Aug 30, 2024
Figure 1 for User-Driven Voice Generation and Editing through Latent Space Navigation
Figure 2 for User-Driven Voice Generation and Editing through Latent Space Navigation
Figure 3 for User-Driven Voice Generation and Editing through Latent Space Navigation
Figure 4 for User-Driven Voice Generation and Editing through Latent Space Navigation
Viaarxiv icon

Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss

Add code
Jan 08, 2024
Figure 1 for Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss
Figure 2 for Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss
Figure 3 for Creating Personalized Synthetic Voices from Articulation Impaired Speech Using Augmented Reconstruction Loss
Viaarxiv icon

Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models

Add code
May 27, 2023
Figure 1 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 2 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 3 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Figure 4 for Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
Viaarxiv icon

Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data

Add code
May 18, 2023
Viaarxiv icon

Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification

Add code
Oct 31, 2022
Figure 1 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Figure 2 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Figure 3 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Figure 4 for Convolution-Based Channel-Frequency Attention for Text-Independent Speaker Verification
Viaarxiv icon

Transport-Oriented Feature Aggregation for Speaker Embedding Learning

Add code
Jun 26, 2022
Figure 1 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Figure 2 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Figure 3 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Figure 4 for Transport-Oriented Feature Aggregation for Speaker Embedding Learning
Viaarxiv icon

Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification

Add code
Jun 15, 2022
Figure 1 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 2 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 3 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Figure 4 for Learnable Frequency Filters for Speech Feature Extraction in Speaker Verification
Viaarxiv icon

Improving End-to-End Speech-to-Intent Classification with Reptile

Add code
Aug 05, 2020
Figure 1 for Improving End-to-End Speech-to-Intent Classification with Reptile
Figure 2 for Improving End-to-End Speech-to-Intent Classification with Reptile
Figure 3 for Improving End-to-End Speech-to-Intent Classification with Reptile
Figure 4 for Improving End-to-End Speech-to-Intent Classification with Reptile
Viaarxiv icon