Picture for Hsin-Min Wang

Hsin-Min Wang

Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN

Add code
Sep 21, 2022
Figure 1 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Figure 2 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Figure 3 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Figure 4 for Mandarin Singing Voice Synthesis with Denoising Diffusion Probabilistic Wasserstein GAN
Viaarxiv icon

NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling

Add code
Jun 18, 2022
Figure 1 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 2 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 3 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Figure 4 for NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling
Viaarxiv icon

A Study of Using Cepstrogram for Countermeasure Against Replay Attacks

Add code
Apr 09, 2022
Figure 1 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Figure 2 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Figure 3 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Figure 4 for A Study of Using Cepstrogram for Countermeasure Against Replay Attacks
Viaarxiv icon

MTI-Net: A Multi-Target Speech Intelligibility Prediction Model

Add code
Apr 07, 2022
Figure 1 for MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
Figure 2 for MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
Figure 3 for MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
Figure 4 for MTI-Net: A Multi-Target Speech Intelligibility Prediction Model
Viaarxiv icon

MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids

Add code
Apr 07, 2022
Figure 1 for MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 2 for MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 3 for MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Figure 4 for MBI-Net: A Non-Intrusive Multi-Branched Speech Intelligibility Prediction Model for Hearing Aids
Viaarxiv icon

Filter-based Discriminative Autoencoders for Children Speech Recognition

Add code
Apr 01, 2022
Figure 1 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Figure 2 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Figure 3 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Figure 4 for Filter-based Discriminative Autoencoders for Children Speech Recognition
Viaarxiv icon

Generation of Speaker Representations Using Heterogeneous Training Batch Assembly

Add code
Mar 30, 2022
Figure 1 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Figure 2 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Figure 3 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Figure 4 for Generation of Speaker Representations Using Heterogeneous Training Batch Assembly
Viaarxiv icon

Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks

Add code
Mar 30, 2022
Figure 1 for Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks
Figure 2 for Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks
Figure 3 for Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks
Figure 4 for Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks
Viaarxiv icon

Multi-Target Filter and Detector for Speaker Diarization

Add code
Mar 30, 2022
Figure 1 for Multi-Target Filter and Detector for Speaker Diarization
Figure 2 for Multi-Target Filter and Detector for Speaker Diarization
Figure 3 for Multi-Target Filter and Detector for Speaker Diarization
Figure 4 for Multi-Target Filter and Detector for Speaker Diarization
Viaarxiv icon

Chain-based Discriminative Autoencoders for Speech Recognition

Add code
Mar 28, 2022
Figure 1 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 2 for Chain-based Discriminative Autoencoders for Speech Recognition
Figure 3 for Chain-based Discriminative Autoencoders for Speech Recognition
Viaarxiv icon