Picture for Zhenhua Ling

Zhenhua Ling

Pinhole Effect on Linkability and Dispersion in Speaker Anonymization

Add code
Aug 23, 2025
Viaarxiv icon

Select to Know: An Internal-External Knowledge Self-Selection Framework for Domain-Specific Question Answering

Add code
Aug 21, 2025
Viaarxiv icon

RISE: Reasoning Enhancement via Iterative Self-Exploration in Multi-hop Question Answering

Add code
May 28, 2025
Viaarxiv icon

UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech

Add code
May 15, 2025
Viaarxiv icon

DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles

Add code
Dec 04, 2024
Figure 1 for DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles
Figure 2 for DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles
Figure 3 for DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles
Figure 4 for DiffStyleTTS: Diffusion-based Hierarchical Prosody Modeling for Text-to-Speech with Diverse and Controllable Styles
Viaarxiv icon

Refining Self-Supervised Learnt Speech Representation using Brain Activations

Add code
Jun 12, 2024
Figure 1 for Refining Self-Supervised Learnt Speech Representation using Brain Activations
Figure 2 for Refining Self-Supervised Learnt Speech Representation using Brain Activations
Figure 3 for Refining Self-Supervised Learnt Speech Representation using Brain Activations
Figure 4 for Refining Self-Supervised Learnt Speech Representation using Brain Activations
Viaarxiv icon

Adversarial speech for voice privacy protection from Personalized Speech generation

Add code
Jan 22, 2024
Viaarxiv icon

Pre-training Language Model as a Multi-perspective Course Learner

Add code
May 06, 2023
Figure 1 for Pre-training Language Model as a Multi-perspective Course Learner
Figure 2 for Pre-training Language Model as a Multi-perspective Course Learner
Figure 3 for Pre-training Language Model as a Multi-perspective Course Learner
Figure 4 for Pre-training Language Model as a Multi-perspective Course Learner
Viaarxiv icon

Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis

Add code
Sep 14, 2022
Figure 1 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 2 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 3 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Figure 4 for Decoupled Pronunciation and Prosody Modeling in Meta-Learning-Based Multilingual Speech Synthesis
Viaarxiv icon

Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis

Add code
Mar 02, 2022
Figure 1 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 2 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 3 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Figure 4 for Speaker Adaption with Intuitive Prosodic Features for Statistical Parametric Speech Synthesis
Viaarxiv icon