Picture for Helin Wang

Helin Wang

DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction

Add code
Oct 10, 2023
Figure 1 for DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction
Figure 2 for DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction
Figure 3 for DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction
Viaarxiv icon

DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model

Add code
Jun 18, 2023
Figure 1 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 2 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 3 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Figure 4 for DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
Viaarxiv icon

Benchmarking Large Language Models on CMExam -- A Comprehensive Chinese Medical Exam Dataset

Add code
Jun 08, 2023
Viaarxiv icon

NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS

Add code
Nov 04, 2022
Viaarxiv icon

Diffsound: Discrete Diffusion Model for Text-to-sound Generation

Add code
Jul 20, 2022
Figure 1 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Figure 2 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Figure 3 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Figure 4 for Diffsound: Discrete Diffusion Model for Text-to-sound Generation
Viaarxiv icon

Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection

Add code
May 23, 2022
Figure 1 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Figure 2 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Figure 3 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Figure 4 for Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Viaarxiv icon

Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training

Add code
Apr 27, 2022
Figure 1 for Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Figure 2 for Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Figure 3 for Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Figure 4 for Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training
Viaarxiv icon

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection

Add code
Apr 05, 2022
Figure 1 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Figure 2 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Figure 3 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Figure 4 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Viaarxiv icon

A Two-student Learning Framework for Mixed Supervised Target Sound Detection

Add code
Apr 05, 2022
Figure 1 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Figure 2 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Figure 3 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Figure 4 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Viaarxiv icon

Improving Target Sound Extraction with Timestamp Information

Add code
Apr 02, 2022
Figure 1 for Improving Target Sound Extraction with Timestamp Information
Figure 2 for Improving Target Sound Extraction with Timestamp Information
Figure 3 for Improving Target Sound Extraction with Timestamp Information
Figure 4 for Improving Target Sound Extraction with Timestamp Information
Viaarxiv icon