Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations



Hiroshi Sato , Tsubasa Ochiai , Marc Delcroix , Keisuke Kinoshita , Takafumi Moriya , Naoki Makishima , Mana Ihori , Tomohiro Tanaka , Ryo Masumura

* 5 pages, 2 figures, 3 tables Submitted to Interspeech 2022 

   Access Paper or Ask Questions

Listen only to me! How well can target speech extraction handle false alarms?



Marc Delcroix , Keisuke Kinoshita , Tsubasa Ochiai , Katerina Zmolikova , Hiroshi Sato , Tomohiro Nakatani

* Submitted to Inerspeech 2022 

   Access Paper or Ask Questions

SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning



Marc Delcroix , Jorge Bennasar Vázquez , Tsubasa Ochiai , Keisuke Kinoshita , Yasunori Ohishi , Shoko Araki

* Submitted to IEEE/ACM Trans. Audio, Speech, and Language Processing 

   Access Paper or Ask Questions

Subjective intelligibility of speech sounds enhanced by ideal ratio mask via crowdsourced remote experiments with effective data screening



Ayako Yamamoto , Toshio Irino , Shoko Araki , Kenichi Arai , Atsunori Ogawa , Keisuke Kinoshita , Tomohiro Nakatani

* This paper was submitted to Interspeech 2022 (http://www.interspeech2022.org

   Access Paper or Ask Questions

Tight integration of neural- and clustering-based diarization through deep unfolding of infinite Gaussian mixture model



Keisuke Kinoshita , Marc Delcroix , Tomoharu Iwata

* Accepted to IEEE ICASSP-2022, 5 pages, 2 figures 

   Access Paper or Ask Questions

Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition



Hiroshi Sato , Tsubasa Ochiai , Marc Delcroix , Keisuke Kinoshita , Naoyuki Kamo , Takafumi Moriya

* 5 pages, 2 figures 

   Access Paper or Ask Questions

Switching Independent Vector Analysis and Its Extension to Blind and Spatially Guided Convolutional Beamforming Algorithm



Tomohiro Nakatani , Rintaro Ikeshita , Keisuke Kinoshita , Hiroshi Sawada , Naoyuki Kamo , Shoko Araki

* Submitted to IEEE/ACM Trans. Audio, Speech, and Language Processing on 27 July 2021 

   Access Paper or Ask Questions

SA-SDR: A novel loss function for separation of meeting style data



Thilo von Neumann , Keisuke Kinoshita , Christoph Boeddeker , Marc Delcroix , Reinhold Haeb-Umbach

* submitted to ICASSP 2022 

   Access Paper or Ask Questions

Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation



Tomohiro Nakatani , Rintaro Ikeshita , Keisuke Kinoshita , Hiroshi Sawada , Shoko Araki

* Accepted by IEEE ICASSP 2021 

   Access Paper or Ask Questions

Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers



Thilo von Neumann , Keisuke Kinoshita , Christoph Boeddeker , Marc Delcroix , Reinhold Haeb-Umbach

* Accepted at INTERSPEECH 2021 

   Access Paper or Ask Questions

1
2
3
4
>>