Picture for Lucas H. Ueda

Lucas H. Ueda

Crab: Multi Layer Contrastive Supervision to Improve Speech Emotion Recognition Under Both Acted and Natural Speech Condition

Add code
Mar 24, 2026
Viaarxiv icon

SelfTTS: cross-speaker style transfer through explicit embedding disentanglement and self-refinement using self-augmentation

Add code
Mar 23, 2026
Viaarxiv icon

Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 Matching

Add code
Oct 08, 2024
Figure 1 for Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 Matching
Figure 2 for Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 Matching
Figure 3 for Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 Matching
Figure 4 for Improving Data Augmentation-based Cross-Speaker Style Transfer for TTS with Singing Voice, Style Filtering, and F0 Matching
Viaarxiv icon

Exploring synthetic data for cross-speaker style transfer in style representation based TTS

Add code
Sep 25, 2024
Figure 1 for Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Figure 2 for Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Figure 3 for Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Figure 4 for Exploring synthetic data for cross-speaker style transfer in style representation based TTS
Viaarxiv icon