Picture for Reinhold Haeb-Umbach

Reinhold Haeb-Umbach

Loose coupling of spectral and spatial models for multi-channel diarization and enhancement of meetings in dynamic environments

Add code
Jan 22, 2026
Viaarxiv icon

On the Role of Spatial Features in Foundation-Model-Based Speaker Diarization

Add code
Jan 05, 2026
Viaarxiv icon

Synthesizing speech with selected perceptual voice qualities - A case study with creaky voice

Add code
Nov 07, 2025
Viaarxiv icon

On the Application of Diffusion Models for Simultaneous Denoising and Dereverberation

Add code
Aug 26, 2025
Viaarxiv icon

Towards Frame-level Quality Predictions of Synthetic Speech

Add code
Aug 14, 2025
Viaarxiv icon

30+ Years of Source Separation Research: Achievements and Future Challenges

Add code
Jan 21, 2025
Viaarxiv icon

Speech Synthesis along Perceptual Voice Quality Dimensions

Add code
Jan 15, 2025
Figure 1 for Speech Synthesis along Perceptual Voice Quality Dimensions
Figure 2 for Speech Synthesis along Perceptual Voice Quality Dimensions
Figure 3 for Speech Synthesis along Perceptual Voice Quality Dimensions
Figure 4 for Speech Synthesis along Perceptual Voice Quality Dimensions
Viaarxiv icon

Microphone Array Signal Processing and Deep Learning for Speech Enhancement

Add code
Jan 13, 2025
Figure 1 for Microphone Array Signal Processing and Deep Learning for Speech Enhancement
Figure 2 for Microphone Array Signal Processing and Deep Learning for Speech Enhancement
Figure 3 for Microphone Array Signal Processing and Deep Learning for Speech Enhancement
Figure 4 for Microphone Array Signal Processing and Deep Learning for Speech Enhancement
Viaarxiv icon

Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models

Add code
Oct 28, 2024
Figure 1 for Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models
Figure 2 for Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models
Figure 3 for Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models
Viaarxiv icon

Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder

Add code
Sep 05, 2024
Viaarxiv icon