Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Mask-based Neural Beamforming for Moving Speakers with Self-Attention-based Tracking



Tsubasa Ochiai , Marc Delcroix , Tomohiro Nakatani , Shoko Araki

* 11 pages, 7 figures, Submitted to IEEE/ACM Trans. Audio, Speech, and Language Processing 

   Access Paper or Ask Questions

Listen only to me! How well can target speech extraction handle false alarms?



Marc Delcroix , Keisuke Kinoshita , Tsubasa Ochiai , Katerina Zmolikova , Hiroshi Sato , Tomohiro Nakatani

* Submitted to Inerspeech 2022 

   Access Paper or Ask Questions

Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening



Ayako Yamamoto , Toshio Irino , Shoko Araki , Kenichi Arai , Atsunori Ogawa , Keisuke Kinoshita , Tomohiro Nakatani

* This paper was submitted to Interspeech 2022 (http://www.interspeech2022.org

   Access Paper or Ask Questions

$\text{ISS}_2$: An Extension of Iterative Source Steering Algorithm for Majorization-Minimization-Based Independent Vector Analysis



Rintaro Ikeshita , Tomohiro Nakatani


   Access Paper or Ask Questions

Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm



Tomohiro Nakatani , Rintaro Ikeshita , Keisuke Kinoshita , Hiroshi Sawada , Naoyuki Kamo , Shoko Araki

* Submitted to IEEE/ACM Trans. Audio, Speech, and Language Processing on 27 July 2021 

   Access Paper or Ask Questions

Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation



Tomohiro Nakatani , Rintaro Ikeshita , Keisuke Kinoshita , Hiroshi Sawada , Shoko Araki

* Accepted by IEEE ICASSP 2021 

   Access Paper or Ask Questions

Independent Deeply Learned Tensor Analysis for Determined Audio Source Separation



Naoki Narisawa , Rintaro Ikeshita , Norihiro Takamune , Daichi Kitamura , Tomohiko Nakamura , Hiroshi Saruwatari , Tomohiro Nakatani

* 5 pages, 2 figures, accepted for European Signal Processing Conference 2021 (EUSIPCO 2021) 

   Access Paper or Ask Questions

PILOT: Introducing Transformers for Probabilistic Sound Event Localization



Christopher Schymura , Benedikt Bönninghoff , Tsubasa Ochiai , Marc Delcroix , Keisuke Kinoshita , Tomohiro Nakatani , Shoko Araki , Dorothea Kolossa

* Accepted at INTERSPEECH 2021 

   Access Paper or Ask Questions

Comparison of remote experiments using crowdsourcing and laboratory experiments on speech intelligibility



Ayako Yamamoto , Toshio Irino , Kenichi Arai , Shoko Araki , Atsunori Ogawa , Keisuke Kinoshita , Tomohiro Nakatani

* This paper was submitted to Interspeech2021 

   Access Paper or Ask Questions

Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event Localization



Christopher Schymura , Tsubasa Ochiai , Marc Delcroix , Keisuke Kinoshita , Tomohiro Nakatani , Shoko Araki , Dorothea Kolossa

* Published in Proceedings of the 28th European Signal Processing Conference (EUSIPCO), 2020 

   Access Paper or Ask Questions

1
2
3
>>