Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation

Jun 15, 2018

Hiroaki Nakajima, Yu Takahashi, Kazunobu Kondo, Yuji Hisaminato

Figure 1 for Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation

Figure 2 for Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation

Figure 3 for Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation

Figure 4 for Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation

Share this with someone who'll enjoy it:

Abstract:Recently, deep neural network (DNN) has made a breakthrough in monaural source enhancement. Through a training step by using a large amount of data, DNN estimates a mapping between mixed signals and clean signals. At this time, we use an objective function that numerically expresses the quality of a mapping by DNN. In the conventional methods, L1 norm, L2 norm, and Itakura-Saito divergence are often used as objective functions. Recently, an objective function based on short-time objective intelligibility (STOI) has also been proposed. However, these functions only indicate similarity between the clean signal and the estimated signal by DNN. In other words, they do not show the quality of noise reduction or source enhancement. Motivated by the fact, this paper adopts signal-to-distortion ratio (SDR) as the objective function. Since SDR virtually shows signal-to-noise ratio (SNR), maximizing SDR solves the above problem. The experimental results revealed that the proposed method achieved better performance than the conventional methods.

* This paper is submitted to 16th International Workshop on Acoustic Signal Enhancement (IWAENC)

View paper on

Share this with someone who'll enjoy it:

Title:Monaural source enhancement maximizing source-to-distortion ratio via automatic differentiation

Paper and Code