Alert button
Picture for Kohei Yatabe

Kohei Yatabe

Alert button

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping

Add code
Bookmark button
Alert button
Mar 31, 2022
Yuma Koizumi, Heiga Zen, Kohei Yatabe, Nanxin Chen, Michiel Bacchiani

Figure 1 for SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
Figure 2 for SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
Figure 3 for SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
Figure 4 for SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping
Viaarxiv icon

Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head

Add code
Bookmark button
Alert button
Feb 17, 2022
Kento Nagatomo, Masahiro Yasuda, Kohei Yatabe, Shoichiro Saito, Yasuhiro Oikawa

Figure 1 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Figure 2 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Figure 3 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Figure 4 for Wearable SELD dataset: Dataset for sound event localization and detection using wearable devices around head
Viaarxiv icon

APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization

Add code
Bookmark button
Alert button
Feb 16, 2022
Tomoro Tanaka, Kohei Yatabe, Masahiro Yasuda, Yasuhiro Oikawa

Figure 1 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Figure 2 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Figure 3 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Figure 4 for APPLADE: Adjustable Plug-and-play Audio Declipper Combining DNN with Sparse Optimization
Viaarxiv icon

Safeguarding test signals for acoustic measurement using arbitrary sounds

Add code
Bookmark button
Alert button
Dec 21, 2021
Hideki Kawahara, Kohei Yatabe

Figure 1 for Safeguarding test signals for acoustic measurement using arbitrary sounds
Figure 2 for Safeguarding test signals for acoustic measurement using arbitrary sounds
Figure 3 for Safeguarding test signals for acoustic measurement using arbitrary sounds
Figure 4 for Safeguarding test signals for acoustic measurement using arbitrary sounds
Viaarxiv icon

Objective measurement of pitch extractors' responses to frequency modulated sounds and two reference pitch extraction methods for analyzing voice pitch responses to auditory stimulation

Add code
Bookmark button
Alert button
Nov 05, 2021
Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise

Figure 1 for Objective measurement of pitch extractors' responses to frequency modulated sounds and two reference pitch extraction methods for analyzing voice pitch responses to auditory stimulation
Figure 2 for Objective measurement of pitch extractors' responses to frequency modulated sounds and two reference pitch extraction methods for analyzing voice pitch responses to auditory stimulation
Figure 3 for Objective measurement of pitch extractors' responses to frequency modulated sounds and two reference pitch extraction methods for analyzing voice pitch responses to auditory stimulation
Figure 4 for Objective measurement of pitch extractors' responses to frequency modulated sounds and two reference pitch extraction methods for analyzing voice pitch responses to auditory stimulation
Viaarxiv icon

Design of Tight Minimum-Sidelobe Windows by Riemannian Newton's Method

Add code
Bookmark button
Alert button
Nov 02, 2021
Daichi Kitahara, Kohei Yatabe

Figure 1 for Design of Tight Minimum-Sidelobe Windows by Riemannian Newton's Method
Figure 2 for Design of Tight Minimum-Sidelobe Windows by Riemannian Newton's Method
Viaarxiv icon

Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method

Add code
Bookmark button
Alert button
May 10, 2021
Koichi Saito, Tomohiko Nakamura, Kohei Yatabe, Yuma Koizumi, Hiroshi Saruwatari

Figure 1 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Figure 2 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Figure 3 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Figure 4 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Viaarxiv icon

Sparse time-frequency representation via atomic norm minimization

Add code
Bookmark button
Alert button
May 07, 2021
Tsubasa Kusano, Kohei Yatabe, Yasuhiro Oikawa

Figure 1 for Sparse time-frequency representation via atomic norm minimization
Figure 2 for Sparse time-frequency representation via atomic norm minimization
Figure 3 for Sparse time-frequency representation via atomic norm minimization
Figure 4 for Sparse time-frequency representation via atomic norm minimization
Viaarxiv icon

Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation

Add code
Bookmark button
Alert button
Apr 03, 2021
Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino

Figure 1 for Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation
Figure 2 for Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation
Figure 3 for Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation
Figure 4 for Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation
Viaarxiv icon

Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech

Add code
Bookmark button
Alert button
Jan 21, 2021
Takuya Fujimura, Yuma Koizumi, Kohei Yatabe, Ryoichi Miyazaki

Figure 1 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 2 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 3 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 4 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Viaarxiv icon