Speech Enhancement


Speech enhancement is the process of improving the quality of speech signals by removing noise and other distortions.

ArrayDPS-Refine: Generative Refinement of Discriminative Multi-Channel Speech Enhancement

Add code
Mar 25, 2026
Viaarxiv icon

Unified Diffusion Refinement for Multi-Channel Speech Enhancement and Separation

Add code
Mar 25, 2026
Viaarxiv icon

AdaLTM: Adaptive Layer-wise Task Vector Merging for Categorical Speech Emotion Recognition with ASR Knowledge Integration

Add code
Mar 26, 2026
Viaarxiv icon

Cinematic Audio Source Separation Using Visual Cues

Add code
Mar 27, 2026
Viaarxiv icon

When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse

Add code
Mar 24, 2026
Viaarxiv icon

DiT-Flow: Speech Enhancement Robust to Multiple Distortions based on Flow Matching in Latent Space and Diffusion Transformers

Add code
Mar 23, 2026
Viaarxiv icon

Autoregressive Guidance of Deep Spatially Selective Filters using Bayesian Tracking for Efficient Extraction of Moving Speakers

Add code
Mar 24, 2026
Viaarxiv icon

Robust Multilingual Text-to-Pictogram Mapping for Scalable Reading Rehabilitation

Add code
Mar 25, 2026
Viaarxiv icon

SelfTTS: cross-speaker style transfer through explicit embedding disentanglement and self-refinement using self-augmentation

Add code
Mar 23, 2026
Viaarxiv icon

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Add code
Mar 24, 2026
Viaarxiv icon