Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Description and Discussion on DCASE 2022 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring Applying Domain Generalization Techniques



Kota Dohi , Keisuke Imoto , Noboru Harada , Daisuke Niizumi , Yuma Koizumi , Tomoya Nishida , Harsh Purohit , Takashi Endo , Masaaki Yamamoto , Yohei Kawaguchi

* arXiv admin note: substantial text overlap with arXiv:2106.04492 

   Access Paper or Ask Questions

Mask scalar prediction for improving robust automatic speech recognition



Arun Narayanan , James Walker , Sankaran Panchapagesan , Nathan Howard , Yuma Koizumi

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping



Yuma Koizumi , Heiga Zen , Kohei Yatabe , Nanxin Chen , Michiel Bacchiani

* Submitted to Interspeech 2022 

   Access Paper or Ask Questions

SNRi Target Training for Joint Speech Enhancement and Recognition



Yuma Koizumi , Shigeki Karita , Arun Narayanan , Sankaran Panchapagesan , Michiel Bacchiani

* Submitted to ICASSP 2022 

   Access Paper or Ask Questions

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement



Yuma Koizumi , Shigeki Karita , Scott Wisdom , Hakan Erdogan , John R. Hershey , Llion Jones , Michiel Bacchiani

* 5 pages, 2 figure. submitted to WASPAA 2021 

   Access Paper or Ask Questions

Description and Discussion on DCASE 2021 Challenge Task 2: Unsupervised Anomalous Sound Detection for Machine Condition Monitoring under Domain Shifted Conditions



Yohei Kawaguchi , Keisuke Imoto , Yuma Koizumi , Noboru Harada , Daisuke Niizumi , Kota Dohi , Ryo Tanabe , Harsh Purohit , Takashi Endo

* Submitted to DCASE 2021 Workshop. arXiv admin note: text overlap with arXiv:2006.05822 

   Access Paper or Ask Questions

Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant Method



Koichi Saito , Tomohiko Nakamura , Kohei Yatabe , Yuma Koizumi , Hiroshi Saruwatari

* 5 pages, 3 figures, accepted for European Signal Processing Conference 2021 (EUSIPCO 2021) 

   Access Paper or Ask Questions

Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech



Takuya Fujimura , Yuma Koizumi , Kohei Yatabe , Ryoichi Miyazaki


   Access Paper or Ask Questions

Audio Captioning using Pre-Trained Large-Scale Language Model Guided by Audio-based Similar Caption Retrieval



Yuma Koizumi , Yasunori Ohishi , Daisuke Niizumi , Daiki Takeuchi , Masahiro Yasuda

* Submitted to ICASSP 2021 

   Access Paper or Ask Questions

Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning



Daiki Takeuchi , Yuma Koizumi , Yasunori Ohishi , Noboru Harada , Kunio Kashino

* Accepted to DCASE2020 Workshop 

   Access Paper or Ask Questions

1
2
3
>>