Picture for Panos Kakoulidis

Panos Kakoulidis

MambaRate: Speech Quality Assessment Across Different Sampling Rates

Add code
Jul 16, 2025
Viaarxiv icon

Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification

Add code
Apr 02, 2024
Figure 1 for Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification
Figure 2 for Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification
Figure 3 for Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification
Figure 4 for Improved Text Emotion Prediction Using Combined Valence and Arousal Ordinal Classification
Viaarxiv icon

Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations

Add code
Feb 02, 2024
Figure 1 for Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations
Figure 2 for Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations
Figure 3 for Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations
Figure 4 for Low-Resource Cross-Domain Singing Voice Synthesis via Reduced Self-Supervised Speech Representations
Viaarxiv icon

Generating Gender-Ambiguous Text-to-Speech Voices

Add code
Nov 01, 2022
Figure 1 for Generating Gender-Ambiguous Text-to-Speech Voices
Figure 2 for Generating Gender-Ambiguous Text-to-Speech Voices
Figure 3 for Generating Gender-Ambiguous Text-to-Speech Voices
Figure 4 for Generating Gender-Ambiguous Text-to-Speech Voices
Viaarxiv icon

Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation

Add code
Oct 31, 2022
Figure 1 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Figure 2 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Figure 3 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Figure 4 for Cross-lingual Text-To-Speech with Flow-based Voice Conversion for Improved Pronunciation
Viaarxiv icon

Karaoker: Alignment-free singing voice synthesis with speech training data

Add code
Apr 08, 2022
Figure 1 for Karaoker: Alignment-free singing voice synthesis with speech training data
Figure 2 for Karaoker: Alignment-free singing voice synthesis with speech training data
Figure 3 for Karaoker: Alignment-free singing voice synthesis with speech training data
Viaarxiv icon

Self supervised learning for robust voice cloning

Add code
Apr 07, 2022
Figure 1 for Self supervised learning for robust voice cloning
Figure 2 for Self supervised learning for robust voice cloning
Figure 3 for Self supervised learning for robust voice cloning
Viaarxiv icon

Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis

Add code
Nov 19, 2021
Figure 1 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Figure 2 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Figure 3 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Figure 4 for Prosodic Clustering for Phoneme-level Prosody Control in End-to-End Speech Synthesis
Viaarxiv icon

Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control

Add code
Nov 19, 2021
Figure 1 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 2 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 3 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Figure 4 for Improved Prosodic Clustering for Multispeaker and Speaker-independent Phoneme-level Prosody Control
Viaarxiv icon

Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control

Add code
Nov 17, 2021
Figure 1 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 2 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 3 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Figure 4 for Rapping-Singing Voice Synthesis based on Phoneme-level Prosody Control
Viaarxiv icon