Picture for Konstantinos Markopoulos

Konstantinos Markopoulos

Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification

Apr 02, 2024
Viaarxiv icon

Generating Gender-Ambiguous Text-to-Speech Voices

Add code
Nov 01, 2022
Figure 1 for Generating Gender-Ambiguous Text-to-Speech Voices
Figure 2 for Generating Gender-Ambiguous Text-to-Speech Voices
Figure 3 for Generating Gender-Ambiguous Text-to-Speech Voices
Figure 4 for Generating Gender-Ambiguous Text-to-Speech Voices
Viaarxiv icon

Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation

Add code
Oct 31, 2022
Figure 1 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Figure 2 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Figure 3 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Figure 4 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Viaarxiv icon

Fine-grained Noise Control for Multispeaker Speech Synthesis

Add code
Apr 11, 2022
Figure 1 for Fine-grained Noise Control for Multispeaker Speech Synthesis
Figure 2 for Fine-grained Noise Control for Multispeaker Speech Synthesis
Figure 3 for Fine-grained Noise Control for Multispeaker Speech Synthesis
Figure 4 for Fine-grained Noise Control for Multispeaker Speech Synthesis
Viaarxiv icon

Karaoker: Alignment-free singing voice synthesis with speech training data

Add code
Apr 08, 2022
Figure 1 for Karaoker: Alignment-free singing voice synthesis with speech training data
Figure 2 for Karaoker: Alignment-free singing voice synthesis with speech training data
Figure 3 for Karaoker: Alignment-free singing voice synthesis with speech training data
Viaarxiv icon

Self supervised learning for robust voice cloning

Add code
Apr 07, 2022
Figure 1 for Self supervised learning for robust voice cloning
Figure 2 for Self supervised learning for robust voice cloning
Figure 3 for Self supervised learning for robust voice cloning
Viaarxiv icon

Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control

Add code
Nov 19, 2021
Figure 1 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 2 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 3 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 4 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Viaarxiv icon

Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control

Add code
Nov 17, 2021
Figure 1 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 2 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 3 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 4 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Viaarxiv icon

Cross-lingual Low Resource Speaker Adaptation Using Phonological Features

Add code
Nov 17, 2021
Figure 1 for Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Figure 2 for Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Figure 3 for Cross-lingual Low Resource Speaker Adaptation Using Phonological Features
Viaarxiv icon

High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency

Add code
Nov 17, 2021
Figure 1 for High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Figure 2 for High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Figure 3 for High Quality Streaming Speech Synthesis with Low, Sentence-Length-Independent Latency
Viaarxiv icon