Picture for Jan Østergaard

Jan Østergaard

Aalborg University

Exploring Resolution-Wise Shared Attention in Hybrid Mamba-U-Nets for Improved Cross-Corpus Speech Enhancement

Add code
Oct 02, 2025
Viaarxiv icon

A Steered Response Power Method for Sound Source Localization With Generic Acoustic Models

Add code
Sep 19, 2025
Viaarxiv icon

Learning Robust Spatial Representations from Binaural Audio through Feature Distillation

Add code
Aug 28, 2025
Viaarxiv icon

Head-steered channel selection method for hearing aid applications using remote microphones

Add code
Aug 09, 2025
Viaarxiv icon

MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement

Add code
Jul 01, 2025
Figure 1 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 2 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 3 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Figure 4 for MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement
Viaarxiv icon

xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement

Add code
Jan 10, 2025
Figure 1 for xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement
Figure 2 for xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement
Figure 3 for xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement
Figure 4 for xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement
Viaarxiv icon

Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining

Add code
Jan 06, 2025
Figure 1 for Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
Figure 2 for Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
Figure 3 for Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
Figure 4 for Noise-Robust Target-Speaker Voice Activity Detection Through Self-Supervised Pretraining
Viaarxiv icon

Room impulse response prototyping using receiver distance estimations for high quality room equalisation algorithms

Add code
Sep 16, 2024
Figure 1 for Room impulse response prototyping using receiver distance estimations for high quality room equalisation algorithms
Figure 2 for Room impulse response prototyping using receiver distance estimations for high quality room equalisation algorithms
Figure 3 for Room impulse response prototyping using receiver distance estimations for high quality room equalisation algorithms
Figure 4 for Room impulse response prototyping using receiver distance estimations for high quality room equalisation algorithms
Viaarxiv icon

The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems

Add code
Jun 10, 2024
Figure 1 for The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Figure 2 for The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Figure 3 for The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Figure 4 for The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems
Viaarxiv icon

Deep low-latency joint speech transmission and enhancement over a gaussian channel

Add code
Apr 30, 2024
Figure 1 for Deep low-latency joint speech transmission and enhancement over a gaussian channel
Figure 2 for Deep low-latency joint speech transmission and enhancement over a gaussian channel
Figure 3 for Deep low-latency joint speech transmission and enhancement over a gaussian channel
Figure 4 for Deep low-latency joint speech transmission and enhancement over a gaussian channel
Viaarxiv icon