Picture for Hui-Peng Du

Hui-Peng Du

Vision-Integrated High-Quality Neural Speech Coding

Add code
May 29, 2025
Viaarxiv icon

Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising

Add code
May 22, 2025
Viaarxiv icon

Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis

Add code
Dec 22, 2024
Viaarxiv icon

A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions

Add code
Nov 19, 2024
Viaarxiv icon

SAMOS: A Neural MOS Prediction Model Leveraging Semantic Representations and Acoustic Features

Add code
Nov 18, 2024
Viaarxiv icon

ESTVocoder: An Excitation-Spectral-Transformed Neural Vocoder Conditioned on Mel Spectrogram

Add code
Nov 18, 2024
Viaarxiv icon

Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion

Add code
Nov 17, 2024
Figure 1 for Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion
Figure 2 for Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion
Figure 3 for Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion
Figure 4 for Pitch-and-Spectrum-Aware Singing Quality Assessment with Bias Correction and Model Fusion
Viaarxiv icon

MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios

Add code
Nov 01, 2024
Viaarxiv icon

APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm

Add code
Oct 30, 2024
Viaarxiv icon

ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs

Add code
Oct 16, 2024
Viaarxiv icon