Picture for Rui-Chen Zheng

Rui-Chen Zheng

CodeSep: Low-Bitrate Codec-Driven Speech Separation with Base-Token Disentanglement and Auxiliary-Token Serial Prediction

Add code
Jan 19, 2026
Viaarxiv icon

Say More with Less: Variable-Frame-Rate Speech Tokenization via Adaptive Clustering and Implicit Duration Coding

Add code
Sep 04, 2025
Viaarxiv icon

Is GAN Necessary for Mel-Spectrogram-based Neural Vocoder?

Add code
Aug 11, 2025
Viaarxiv icon

Vision-Integrated High-Quality Neural Speech Coding

Add code
May 29, 2025
Figure 1 for Vision-Integrated High-Quality Neural Speech Coding
Figure 2 for Vision-Integrated High-Quality Neural Speech Coding
Figure 3 for Vision-Integrated High-Quality Neural Speech Coding
Figure 4 for Vision-Integrated High-Quality Neural Speech Coding
Viaarxiv icon

MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios

Add code
Nov 01, 2024
Figure 1 for MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios
Figure 2 for MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios
Figure 3 for MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios
Figure 4 for MDCTCodec: A Lightweight MDCT-based Neural Audio Codec towards High Sampling Rate and Low Bitrate Scenarios
Viaarxiv icon

APCodec+: A Spectrum-Coding-Based High-Fidelity and High-Compression-Rate Neural Audio Codec with Staged Training Paradigm

Add code
Oct 30, 2024
Viaarxiv icon

ERVQ: Enhanced Residual Vector Quantization with Intra-and-Inter-Codebook Optimization for Neural Audio Codecs

Add code
Oct 16, 2024
Viaarxiv icon

Stage-Wise and Prior-Aware Neural Speech Phase Prediction

Add code
Oct 07, 2024
Figure 1 for Stage-Wise and Prior-Aware Neural Speech Phase Prediction
Figure 2 for Stage-Wise and Prior-Aware Neural Speech Phase Prediction
Figure 3 for Stage-Wise and Prior-Aware Neural Speech Phase Prediction
Figure 4 for Stage-Wise and Prior-Aware Neural Speech Phase Prediction
Viaarxiv icon

Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement

Add code
Sep 19, 2023
Viaarxiv icon

Incorporating Ultrasound Tongue Images for Audio-Visual Speech Enhancement through Knowledge Distillation

Add code
May 24, 2023
Viaarxiv icon