Picture for Helin Wang

Helin Wang

DreamVoice: Text-Guided Voice Conversion

Add code
Jun 24, 2024
Viaarxiv icon

Noise-robust Speech Separation with Fast Generative Correction

Add code
Jun 11, 2024
Figure 1 for Noise-robust Speech Separation with Fast Generative Correction
Figure 2 for Noise-robust Speech Separation with Fast Generative Correction
Figure 3 for Noise-robust Speech Separation with Fast Generative Correction
Viaarxiv icon

Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback

Add code
Jun 02, 2024
Viaarxiv icon

Asynchronous and Segmented Bidirectional Encoding for NMT

Add code
Feb 19, 2024
Viaarxiv icon

Efficient Reinforcemen Learning via Decoupling Exploration and Utilization

Add code
Jan 17, 2024
Viaarxiv icon

Improving fairness for spoken language understanding in atypical speech with Text-to-Speech

Add code
Nov 16, 2023
Viaarxiv icon

DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction

Add code
Oct 10, 2023
Viaarxiv icon

DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

Add code
Jun 18, 2023
Figure 1 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 2 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 3 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 4 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Viaarxiv icon

Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset

Add code
Jun 08, 2023
Figure 1 for Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Figure 2 for Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Figure 3 for Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Figure 4 for Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset
Viaarxiv icon

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS

Add code
Nov 04, 2022
Figure 1 for NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS
Figure 2 for NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS
Figure 3 for NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS
Viaarxiv icon