Alert button
Picture for Hieu-Thi Luong

Hieu-Thi Luong

Alert button

LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example

Add code
Bookmark button
Alert button
Oct 11, 2021
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 2 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 3 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Figure 4 for LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example
Viaarxiv icon

Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance

Add code
Bookmark button
Alert button
Jun 25, 2021
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Figure 2 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Figure 3 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Figure 4 for Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance
Viaarxiv icon

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion

Add code
Bookmark button
Alert button
Oct 08, 2020
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 2 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 3 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Figure 4 for Latent linguistic embedding for cross-lingual text-to-speech and voice conversion
Viaarxiv icon

NAUTILUS: a Versatile Voice Cloning System

Add code
Bookmark button
Alert button
May 22, 2020
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for NAUTILUS: a Versatile Voice Cloning System
Figure 2 for NAUTILUS: a Versatile Voice Cloning System
Figure 3 for NAUTILUS: a Versatile Voice Cloning System
Figure 4 for NAUTILUS: a Versatile Voice Cloning System
Viaarxiv icon

Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech

Add code
Bookmark button
Alert button
Sep 14, 2019
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Figure 2 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Figure 3 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Figure 4 for Bootstrapping non-parallel voice conversion from speaker-adaptive text-to-speech
Viaarxiv icon

A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation

Add code
Bookmark button
Alert button
Jun 18, 2019
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Figure 2 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Figure 3 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Figure 4 for A Unified Speaker Adaptation Method for Speech Synthesis using Transcribed and Untranscribed Speech with Backpropagation
Viaarxiv icon

Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora

Add code
Bookmark button
Alert button
Apr 07, 2019
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

Figure 1 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Figure 2 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Figure 3 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Figure 4 for Training Multi-Speaker Neural Text-to-Speech Systems using Speaker-Imbalanced Speech Corpora
Viaarxiv icon

Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems

Add code
Bookmark button
Alert button
Oct 01, 2018
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Figure 2 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Figure 3 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Figure 4 for Scaling and bias codes for modeling speaker-adaptive DNN-based speech synthesis systems
Viaarxiv icon

Multimodal speech synthesis architecture for unsupervised speaker adaptation

Add code
Bookmark button
Alert button
Aug 20, 2018
Hieu-Thi Luong, Junichi Yamagishi

Figure 1 for Multimodal speech synthesis architecture for unsupervised speaker adaptation
Figure 2 for Multimodal speech synthesis architecture for unsupervised speaker adaptation
Figure 3 for Multimodal speech synthesis architecture for unsupervised speaker adaptation
Figure 4 for Multimodal speech synthesis architecture for unsupervised speaker adaptation
Viaarxiv icon

Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects

Add code
Bookmark button
Alert button
Aug 02, 2018
Hieu-Thi Luong, Xin Wang, Junichi Yamagishi, Nobuyuki Nishizawa

Figure 1 for Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects
Figure 2 for Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects
Figure 3 for Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects
Figure 4 for Investigating accuracy of pitch-accent annotations in neural network-based speech synthesis and denoising effects
Viaarxiv icon