Alert button
Picture for Yuma Koizumi

Yuma Koizumi

Alert button

Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method

Add code
Bookmark button
Alert button
May 10, 2021
Koichi Saito, Tomohiko Nakamura, Kohei Yatabe, Yuma Koizumi, Hiroshi Saruwatari

Figure 1 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Figure 2 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Figure 3 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Figure 4 for Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method
Viaarxiv icon

Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech

Add code
Bookmark button
Alert button
Jan 21, 2021
Takuya Fujimura, Yuma Koizumi, Kohei Yatabe, Ryoichi Miyazaki

Figure 1 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 2 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 3 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Figure 4 for Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
Viaarxiv icon

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval

Add code
Bookmark button
Alert button
Dec 14, 2020
Yuma Koizumi, Yasunori Ohishi, Daisuke Niizumi, Daiki Takeuchi, Masahiro Yasuda

Figure 1 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Figure 2 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Figure 3 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Figure 4 for Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval
Viaarxiv icon

Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning

Add code
Bookmark button
Alert button
Sep 24, 2020
Daiki Takeuchi, Yuma Koizumi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Figure 1 for Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning
Figure 2 for Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning
Figure 3 for Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning
Viaarxiv icon

The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation

Add code
Bookmark button
Alert button
Jul 01, 2020
Yuma Koizumi, Daiki Takeuchi, Yasunori Ohishi, Noboru Harada, Kunio Kashino

Figure 1 for The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation
Figure 2 for The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation
Figure 3 for The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation
Viaarxiv icon

A Transformer-based Audio Captioning Model with Keyword Estimation

Add code
Bookmark button
Alert button
Jul 01, 2020
Yuma Koizumi, Ryo Masumura, Kyosuke Nishida, Masahiro Yasuda, Shoichiro Saito

Figure 1 for A Transformer-based Audio Captioning Model with Keyword Estimation
Figure 2 for A Transformer-based Audio Captioning Model with Keyword Estimation
Figure 3 for A Transformer-based Audio Captioning Model with Keyword Estimation
Viaarxiv icon

Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring

Add code
Bookmark button
Alert button
Jun 10, 2020
Yuma Koizumi, Yohei Kawaguchi, Keisuke Imoto, Toshiki Nakamura, Yuki Nikaido, Ryo Tanabe, Harsh Purohit, Kaori Suefusa, Takashi Endo, Masahiro Yasuda, Noboru Harada

Figure 1 for Description and Discussion on DCASE2020 Challenge Task2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring
Viaarxiv icon

Listen to What You Want: Neural Network-based Universal Sound Selector

Add code
Bookmark button
Alert button
Jun 10, 2020
Tsubasa Ochiai, Marc Delcroix, Yuma Koizumi, Hiroaki Ito, Keisuke Kinoshita, Shoko Araki

Figure 1 for Listen to What You Want: Neural Network-based Universal Sound Selector
Figure 2 for Listen to What You Want: Neural Network-based Universal Sound Selector
Figure 3 for Listen to What You Want: Neural Network-based Universal Sound Selector
Figure 4 for Listen to What You Want: Neural Network-based Universal Sound Selector
Viaarxiv icon

Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function

Add code
Bookmark button
Alert button
Feb 14, 2020
Masaki Kawanaka, Yuma Koizumi, Ryoichi Miyazaki, Kohei Yatabe

Figure 1 for Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
Figure 2 for Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
Figure 3 for Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
Figure 4 for Stable Training of DNN for Speech Enhancement based on Perceptually-Motivated Black-Box Cost Function
Viaarxiv icon

Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention

Add code
Bookmark button
Alert button
Feb 14, 2020
Yuma Koizumi, Kohei Yatabe, Marc Delcroix, Yoshiki Masuyama, Daiki Takeuchi

Figure 1 for Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Figure 2 for Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Figure 3 for Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Figure 4 for Speech Enhancement using Self-Adaptation and Multi-Head Self-Attention
Viaarxiv icon