Picture for Kou Tanaka

Kou Tanaka

FastVoiceGrad: One-step Diffusion-Based Voice Conversion with Adversarial Conditional Diffusion Distillation

Add code
Sep 03, 2024
Viaarxiv icon

Training Generative Adversarial Network-Based Vocoder with Limited Data Using Augmentation-Conditional Discriminator

Add code
Mar 25, 2024
Viaarxiv icon

iSTFTNet2: Faster and More Lightweight iSTFT-Based Neural Vocoder Using 1D-2D CNN

Add code
Aug 14, 2023
Viaarxiv icon

Wave-U-Net Discriminator: Fast and Lightweight Discriminator for Generative Adversarial Network-Based Speech Synthesis

Add code
Mar 24, 2023
Viaarxiv icon

iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform

Add code
Mar 04, 2022
Figure 1 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Figure 2 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Figure 3 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Figure 4 for iSTFTNet: Fast and Lightweight Mel-Spectrogram Vocoder Incorporating Inverse Short-Time Fourier Transform
Viaarxiv icon

FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion

Add code
Apr 14, 2021
Figure 1 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Figure 2 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Figure 3 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Figure 4 for FastS2S-VC: Streaming Non-Autoregressive Sequence-to-Sequence Voice Conversion
Viaarxiv icon

MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames

Add code
Feb 25, 2021
Figure 1 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Figure 2 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Figure 3 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Figure 4 for MaskCycleGAN-VC: Learning Non-parallel Voice Conversion with Filling in Frames
Viaarxiv icon

CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion

Add code
Oct 22, 2020
Figure 1 for CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
Figure 2 for CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
Figure 3 for CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
Figure 4 for CycleGAN-VC3: Examining and Improving CycleGAN-VCs for Mel-spectrogram Conversion
Viaarxiv icon

Non-Parallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks

Add code
Sep 11, 2020
Figure 1 for Non-Parallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Figure 2 for Non-Parallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Figure 3 for Non-Parallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Figure 4 for Non-Parallel Voice Conversion with Augmented Classifier Star Generative Adversarial Networks
Viaarxiv icon

Many-to-Many Voice Transformer Network

Add code
Jun 07, 2020
Figure 1 for Many-to-Many Voice Transformer Network
Figure 2 for Many-to-Many Voice Transformer Network
Figure 3 for Many-to-Many Voice Transformer Network
Figure 4 for Many-to-Many Voice Transformer Network
Viaarxiv icon