Alert button
Picture for Eunwoo Song

Eunwoo Song

Alert button

Unified Speech-Text Pretraining for Spoken Dialog Modeling

Add code
Bookmark button
Alert button
Feb 08, 2024
Heeseung Kim, Soonshin Seo, Kyeongseok Jeong, Ohsung Kwon, Jungwhan Kim, Jaehong Lee, Eunwoo Song, Myungwoo Oh, Sungroh Yoon, Kang Min Yoo

Viaarxiv icon

Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech

Add code
Bookmark button
Alert button
Aug 28, 2023
Hyungchan Yoon, Changhwan Kim, Eunwoo Song, Hyun-Wook Yoon, Hong-Goo Kang

Figure 1 for Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech
Figure 2 for Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech
Figure 3 for Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech
Figure 4 for Pruning Self-Attention for Zero-Shot Multi-Speaker Text-to-Speech
Viaarxiv icon

Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis

Add code
Bookmark button
Alert button
Oct 28, 2022
Yuma Shirahata, Ryuichi Yamamoto, Eunwoo Song, Ryo Terashima, Jae-Min Kim, Kentaro Tachibana

Figure 1 for Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis
Figure 2 for Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis
Figure 3 for Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis
Figure 4 for Period VITS: Variational Inference with Explicit Pitch Modeling for End-to-end Emotional Speech Synthesis
Viaarxiv icon

Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems

Add code
Bookmark button
Alert button
Jul 01, 2022
Hyun-Wook Yoon, Ohsung Kwon, Hoyeon Lee, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim, Min-Jae Hwang

Figure 1 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 2 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 3 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Figure 4 for Language Model-Based Emotion Prediction Methods for Emotional Speech Synthesis Systems
Viaarxiv icon

TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder

Add code
Bookmark button
Alert button
Jun 30, 2022
Eunwoo Song, Ryuichi Yamamoto, Ohsung Kwon, Chan-Ho Song, Min-Jae Hwang, Suhyeon Oh, Hyun-Wook Yoon, Jin-Seob Kim, Jae-Min Kim

Figure 1 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 2 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 3 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Figure 4 for TTS-by-TTS 2: Data-selective augmentation for neural speech synthesis using ranking support vector machine with variational autoencoder
Viaarxiv icon

Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation

Add code
Bookmark button
Alert button
Apr 21, 2022
Ryo Terashima, Ryuichi Yamamoto, Eunwoo Song, Yuma Shirahata, Hyun-Wook Yoon, Jae-Min Kim, Kentaro Tachibana

Figure 1 for Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
Figure 2 for Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
Figure 3 for Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
Figure 4 for Cross-Speaker Emotion Transfer for Low-Resource Text-to-Speech Using Non-Parallel Voice Conversion with Pitch-Shift Data Augmentation
Viaarxiv icon

Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss

Add code
Bookmark button
Alert button
Jan 19, 2021
Eunwoo Song, Ryuichi Yamamoto, Min-Jae Hwang, Jin-Seob Kim, Ohsung Kwon, Jae-Min Kim

Figure 1 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 2 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 3 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Figure 4 for Improved parallel WaveGAN vocoder with perceptually weighted spectrogram loss
Viaarxiv icon

Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators

Add code
Bookmark button
Alert button
Oct 27, 2020
Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang, Jae-Min Kim

Figure 1 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Figure 2 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Figure 3 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Figure 4 for Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators
Viaarxiv icon

Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram

Add code
Bookmark button
Alert button
Oct 25, 2019
Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim

Figure 1 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Figure 2 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Figure 3 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Figure 4 for Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram
Viaarxiv icon