Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Monaural source separation: From anechoic to reverberant environments

Nov 15, 2021

Tobias Cord-Landwehr, Christoph Boeddeker, Thilo von Neumann, Catalin Zorila, Rama Doddipatla, Reinhold Haeb-Umbach

Figure 1 for Monaural source separation: From anechoic to reverberant environments

Figure 2 for Monaural source separation: From anechoic to reverberant environments

Figure 3 for Monaural source separation: From anechoic to reverberant environments

Figure 4 for Monaural source separation: From anechoic to reverberant environments

Share this with someone who'll enjoy it:

Abstract:Impressive progress in neural network-based single-channel speech source separation has been made in recent years. But those improvements have been mostly reported on anechoic data, a situation that is hardly met in practice. Taking the SepFormer as a starting point, which achieves state-of-the-art performance on anechoic mixtures, we gradually modify it to optimize its performance on reverberant mixtures. Although this leads to a word error rate improvement by 8 percentage points compared to the standard SepFormer implementation, the system ends up with only marginally better performance than our improved PIT-BLSTM separation system, that is optimized with rather straightforward means. This is surprising and at the same time sobering, challenging the practical usefulness of many improvements reported in recent years for monaural source separation on nonreverberant data.

* Submitted for ICASSP 2022

View paper on

Share this with someone who'll enjoy it:

Title:Monaural source separation: From anechoic to reverberant environments

Paper and Code