Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Shinji Watanabe

On Prosody Modeling for ASR+TTS based Voice Conversion


Jul 20, 2021
Wen-Chin Huang, Tomoki Hayashi, Xinjian Li, Shinji Watanabe, Tomoki Toda

* Submitted to ASRU2021. Under review 

  Access Paper or Ask Questions

Streaming End-to-End ASR based on Blockwise Non-Autoregressive Models


Jul 20, 2021
Tianzi Wang, Yuya Fujita, Xuankai Chang, Shinji Watanabe

* 5 pages, 1 figures, Interspeech21 conference 

  Access Paper or Ask Questions

Speech Representation Learning Combining Conformer CPC with Deep Cluster for the ZeroSpeech Challenge 2021


Jul 13, 2021
Takashi Maekaku, Xuankai Chang, Yuya Fujita, Li-Wei Chen, Shinji Watanabe, Alexander Rudnicky


  Access Paper or Ask Questions

ESPnet-ST IWSLT 2021 Offline Speech Translation System


Jul 06, 2021
Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe

* IWSLT 2021 

  Access Paper or Ask Questions

Towards Neural Diarization for Unlimited Numbers of Speakers Using Global and Local Attractors


Jul 04, 2021
Shota Horiguchi, Shinji Watanabe, Paola Garcia, Yawen Xue, Yuki Takashima, Yohei Kawaguchi


  Access Paper or Ask Questions

Rethinking End-to-End Evaluation of Decomposable Tasks: A Case Study on Spoken Language Understanding


Jun 29, 2021
Siddhant Arora, Alissa Ostapenko, Vijay Viswanathan, Siddharth Dalmia, Florian Metze, Shinji Watanabe, Alan W Black

* INTERSPEECH 2021 

  Access Paper or Ask Questions

Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization


Jun 20, 2021
Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Yawen Xue, Paola Garcia

* Submitted to IEEE TASLP. This article is based on our previous conference paper arxiv:2005.09921 

  Access Paper or Ask Questions

Multi-mode Transformer Transducer with Stochastic Future Context


Jun 17, 2021
Kwangyoun Kim, Felix Wu, Prashant Sridhar, Kyu J. Han, Shinji Watanabe

* Accepted to Interspeech 2021 

  Access Paper or Ask Questions

Layer Pruning on Demand with Intermediate CTC


Jun 17, 2021
Jaesong Lee, Jingu Kang, Shinji Watanabe

* Interspeech 2021 

  Access Paper or Ask Questions

Multi-Speaker ASR Combining Non-Autoregressive Conformer CTC and Conditional Speaker Chain


Jun 16, 2021
Pengcheng Guo, Xuankai Chang, Shinji Watanabe, Lei Xie

* Accepted by Interspeech 2021 

  Access Paper or Ask Questions

GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio


Jun 13, 2021
Guoguo Chen, Shuzhou Chai, Guanbo Wang, Jiayu Du, Wei-Qiang Zhang, Chao Weng, Dan Su, Daniel Povey, Jan Trmal, Junbo Zhang, Mingjie Jin, Sanjeev Khudanpur, Shinji Watanabe, Shuaijiang Zhao, Wei Zou, Xiangang Li, Xuchen Yao, Yongqing Wang, Yujun Wang, Zhao You, Zhiyong Yan


  Access Paper or Ask Questions

Leveraging Pre-trained Language Model for Speech Sentiment Analysis


Jun 11, 2021
Suwon Shon, Pablo Brusco, Jing Pan, Kyu J. Han, Shinji Watanabe

* To appear in Interspeech 2021 

  Access Paper or Ask Questions

Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization


Jun 09, 2021
Yuki Takashima, Yusuke Fujita, Shota Horiguchi, Shinji Watanabe, Paola Garc铆a, Kenji Nagamatsu

* Accepted for Interspeech 2021 

  Access Paper or Ask Questions

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection


Jun 08, 2021
Yuki Takashima, Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Paola Garc铆a, Kenji Nagamatsu

* IEEE Spoken Language Technology Workshop (SLT), 2021, pp. 849-856 
* Accepted for SLT 2021 

  Access Paper or Ask Questions

Data Augmentation Methods for End-to-end Speech Recognition on Distant-Talk Scenarios


Jun 07, 2021
Emiru Tsunoo, Kentaro Shibata, Chaitanya Narisetty, Yosuke Kashiwagi, Shinji Watanabe

* Accepted for Interspeech2021 

  Access Paper or Ask Questions

Self-Guided Curriculum Learning for Neural Machine Translation


May 15, 2021
Lei Zhou, Liang Ding, Kevin Duh, Shinji Watanabe, Ryohei Sasano, Koichi Takeda

* Work in progress 

  Access Paper or Ask Questions

End-to-End Diarization for Variable Number of Speakers with Local-Global Networks and Discriminative Speaker Embeddings


May 05, 2021
Soumi Maiti, Hakan Erdogan, Kevin Wilson, Scott Wisdom, Shinji Watanabe, John R. Hershey

* ICASSP 2021, SPE-54.1 
* 5 pages, 2 figures, ICASSP 2021 

  Access Paper or Ask Questions

SUPERB: Speech processing Universal PERformance Benchmark


May 03, 2021
Shu-wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee

* Submitted to Interspeech 2021 

  Access Paper or Ask Questions

Searchable Hidden Intermediates for End-to-End Models of Decomposable Sequence Tasks


May 02, 2021
Siddharth Dalmia, Brian Yan, Vikas Raunak, Florian Metze, Shinji Watanabe

* NAACL 2021. All code and models are released as part of the ESPnet toolkit: https://github.com/espnet/espnet 

  Access Paper or Ask Questions

EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition


Apr 13, 2021
Murali Karthick Baskar, Luk谩拧 Burget, Shinji Watanabe, Ramon Fernandez Astudillo, Jan "Honza'' 膶ernock媒


  Access Paper or Ask Questions

Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation


Apr 13, 2021
Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe

* Accepted at NAACL-HLT 2021 (short paper) 

  Access Paper or Ask Questions

SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition


Apr 06, 2021
Patrick K. O'Neill, Vitaly Lavrukhin, Somshubra Majumdar, Vahid Noroozi, Yuekai Zhang, Oleksii Kuchaiev, Jagadeesh Balam, Yuliya Dovzhenko, Keenan Freyberg, Michael D. Shulman, Boris Ginsburg, Shinji Watanabe, Georg Kucsko

* 5 pages, 1 figure. Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing


Apr 02, 2021
Wei Rao, Yihui Fu, Yanxin Hu, Xin Xu, Yvkai Jv, Jiangyu Han, Zhongjie Jiang, Lei Xie, Yannan Wang, Shinji Watanabe, Zheng-Hua Tan, Hui Bu, Tao Yu, Shidong Shang

* 5 pages, submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Leveraging End-to-End ASR for Endangered Language Documentation: An Empirical Study on Yolox贸chitl Mixtec


Feb 26, 2021
Jiatong Shi. Jonathan D. Amith, Rey Castillo Garc铆a, Esteban Guadalupe Sierra, Kevin Duh, Shinji Watanabe

* Accepted by EACL2021 

  Access Paper or Ask Questions

Dual-Path Modeling for Long Recording Speech Separation in Meetings


Feb 23, 2021
Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian

* Accepted by ICASSP 2021 

  Access Paper or Ask Questions

End-to-End Dereverberation, Beamforming, and Speech Recognition with Improved Numerical Stability and Advanced Frontend


Feb 23, 2021
Wangyou Zhang, Christoph Boeddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian

* 5 pages, 1 figure, accepted by ICASSP 2021 

  Access Paper or Ask Questions

Gaussian Kernelized Self-Attention for Long Sequence Data and Its Application to CTC-based Speech Recognition


Feb 18, 2021
Yosuke Kashiwagi, Emiru Tsunoo, Shinji Watanabe

* Accepted to ICASSP2021 

  Access Paper or Ask Questions

Deep Learning based Multi-Source Localization with Source Splitting and its Effectiveness in Multi-Talker Speech Recognition


Feb 16, 2021
Aswin Shanmugam Subramanian, Chao Weng, Shinji Watanabe, Meng Yu, Dong Yu

* Submitted to Computer Speech & Language 

  Access Paper or Ask Questions

Intermediate Loss Regularization for CTC-based Speech Recognition


Feb 05, 2021
Jaesong Lee, Shinji Watanabe

* Accepted at ICASSP 2021 

  Access Paper or Ask Questions

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap


Feb 02, 2021
Shota Horiguchi, Nelson Yalta, Paola Garcia, Yuki Takashima, Yawen Xue, Desh Raj, Zili Huang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur


  Access Paper or Ask Questions