Picture for Zhen-Hua Ling

Zhen-Hua Ling

The Voice Timbre Attribute Detection 2025 Challenge Evaluation Plan

Add code
May 14, 2025
Viaarxiv icon

Introducing voice timbre attribute detection

Add code
May 14, 2025
Viaarxiv icon

Weakly Supervised Temporal Sentence Grounding via Positive Sample Mining

Add code
May 10, 2025
Viaarxiv icon

Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models

Add code
Feb 09, 2025
Viaarxiv icon

RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation

Add code
Jan 23, 2025
Viaarxiv icon

Unispeaker: A Unified Approach for Multimodality-driven Speaker Generation

Add code
Jan 11, 2025
Viaarxiv icon

Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis

Add code
Dec 22, 2024
Viaarxiv icon

On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection

Add code
Dec 12, 2024
Figure 1 for On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection
Figure 2 for On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection
Figure 3 for On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection
Figure 4 for On the Generation and Removal of Speaker Adversarial Perturbation for Voice-Privacy Protection
Viaarxiv icon

Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Add code
Dec 09, 2024
Viaarxiv icon

A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions

Add code
Nov 19, 2024
Viaarxiv icon