Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Hirofumi Inaguma

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation


Oct 11, 2021
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe

* Accepted to ASRU2021 

  Access Paper or Ask Questions

ASR Rescoring and Confidence Estimation with ELECTRA


Oct 05, 2021
Hayato Futami, Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

* Accepted in ASRU2021 

  Access Paper or Ask Questions

Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates


Sep 27, 2021
Hirofumi Inaguma, Siddharth Dalmia, Brian Yan, Shinji Watanabe

* Accepted at IEEE ASRU 2021 

  Access Paper or Ask Questions

Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring


Sep 09, 2021
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe


  Access Paper or Ask Questions

VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording


Jul 15, 2021
Hirofumi Inaguma, Tatsuya Kawahara

* Accepted at Interspeech 2021 

  Access Paper or Ask Questions

StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR


Jul 15, 2021
Hirofumi Inaguma, Tatsuya Kawahara

* Accepted at Interspeech 2021 

  Access Paper or Ask Questions

ESPnet-ST IWSLT 2021 Offline Speech Translation System


Jul 06, 2021
Hirofumi Inaguma, Brian Yan, Siddharth Dalmia, Pengcheng Guo, Jiatong Shi, Kevin Duh, Shinji Watanabe

* IWSLT 2021 

  Access Paper or Ask Questions

Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation


Apr 13, 2021
Hirofumi Inaguma, Tatsuya Kawahara, Shinji Watanabe

* Accepted at NAACL-HLT 2021 (short paper) 

  Access Paper or Ask Questions

Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition


Feb 28, 2021
Hirofumi Inaguma, Tatsuya Kawahara


  Access Paper or Ask Questions

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans


Dec 23, 2020
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang


  Access Paper or Ask Questions

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder


Nov 06, 2020
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe


  Access Paper or Ask Questions

Improved Mask-CTC for Non-Autoregressive End-to-End ASR


Oct 26, 2020
Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi

* Submitted to ICASSP2021 

  Access Paper or Ask Questions

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR


Aug 09, 2020
Hayato Futami, Hirofumi Inaguma, Sei Ueno, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

* Accepted in INTERSPEECH2020 

  Access Paper or Ask Questions

Enhancing Monotonic Multihead Attention for Streaming ASR


May 23, 2020
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara

* Corrected AISHELL-1 results 

  Access Paper or Ask Questions

CTC-synchronous Training for Monotonic Attention Model


May 17, 2020
Hirofumi Inaguma, Masato Mimura, Tatsuya Kawahara


  Access Paper or Ask Questions

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR


May 15, 2020
Hirofumi Inaguma, Yashesh Gaur, Liang Lu, Jinyu Li, Yifan Gong

* Accepted at IEEE ICASSP 2020 

  Access Paper or Ask Questions

End-to-end speech-to-dialog-act recognition


Apr 23, 2020
Viet-Trung Dang, Tianyu Zhao, Sei Ueno, Hirofumi Inaguma, Tatsuya Kawahara


  Access Paper or Ask Questions

ESPnet-ST: All-in-One Speech Translation Toolkit


Apr 21, 2020
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Enrique Yalta Soplin, Tomoki Hayashi, Shinji Watanabe

* Accepted at ACL 2020 System Demonstration 

  Access Paper or Ask Questions

Multilingual End-to-End Speech Translation


Oct 31, 2019
Hirofumi Inaguma, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

* Accepted to ASRU 2019 

  Access Paper or Ask Questions

A Comparative Study on Transformer vs RNN in Speech Applications


Sep 28, 2019
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang

* IEEE Automatic Speech Recognition and Understanding Workshop 2019 
* Accepted at ASRU 2019 

  Access Paper or Ask Questions

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR


Sep 22, 2019
Hirofumi Inaguma, Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara

* SLT2018 

  Access Paper or Ask Questions

Transfer learning of language-independent end-to-end ASR with language model fusion


Nov 06, 2018
Hirofumi Inaguma, Jaejin Cho, Murali Karthick Baskar, Tatsuya Kawahara, Shinji Watanabe


  Access Paper or Ask Questions