Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Takaaki Hori

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition


Jun 16, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

* Accepted to Interspeech 2021 

  Access Paper or Ask Questions

Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers


Apr 19, 2021
Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Capturing Multi-Resolution Context by Dilated Self-Attention


Apr 07, 2021
Niko Moritz, Takaaki Hori, Jonathan Le Roux

* In Proc. ICASSP 2021 

  Access Paper or Ask Questions

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans


Dec 23, 2020
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang


  Access Paper or Ask Questions

Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training


Nov 26, 2020
Sameer Khurana, Niko Moritz, Takaaki Hori, Jonathan Le Roux

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Semi-Supervised Speech Recognition via Graph-based Temporal Classification


Oct 29, 2020
Niko Moritz, Takaaki Hori, Jonathan Le Roux

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

Multi-Pass Transformer for Machine Translation


Sep 23, 2020
Peng Gao, Chiori Hori, Shijie Geng, Takaaki Hori, Jonathan Le Roux

* 10 pages, 5 figures and 2 tables 

  Access Paper or Ask Questions

Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR


Feb 14, 2020
Leda Sarı, Niko Moritz, Takaaki Hori, Jonathan Le Roux

* To appear in Proc. ICASSP 2020 

  Access Paper or Ask Questions

Streaming automatic speech recognition with the transformer model


Jan 09, 2020
Niko Moritz, Takaaki Hori, Jonathan Le Roux


  Access Paper or Ask Questions

A Comparative Study on Transformer vs RNN in Speech Applications


Sep 28, 2019
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang

* IEEE Automatic Speech Recognition and Understanding Workshop 2019 
* Accepted at ASRU 2019 

  Access Paper or Ask Questions

Multi-Stream End-to-End Speech Recognition


Jun 17, 2019
Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Shinji Watanabe, Takaaki Hori, Hynek Hermansky

* submitted to IEEE TASLP. arXiv admin note: substantial text overlap with arXiv:1811.04897, arXiv:1811.04903 

  Access Paper or Ask Questions

Self-supervised Sequence-to-sequence ASR using Unpaired Speech and Text


Apr 30, 2019
Murali Karthick Baskar, Shinji Watanabe, Ramon Astudillo, Takaaki Hori, Lukáš Burget, Jan Černocký


  Access Paper or Ask Questions

Stream attention-based multi-array end-to-end speech recognition


Nov 12, 2018
Xiaofei Wang, Ruizhi Li, Sri Harish Mallid, Takaaki Hori, Shinji Watanabe, Hynek Hermansky


  Access Paper or Ask Questions

Multi-encoder multi-resolution framework for end-to-end speech recognition


Nov 12, 2018
Ruizhi Li, Xiaofei Wang, Sri Harish Mallidi, Takaaki Hori, Shinji Watanabe, Hynek Hermansky


  Access Paper or Ask Questions

Vectorization of hypotheses and speech for faster beam search in encoder decoder-based speech recognition


Nov 12, 2018
Hiroshi Seki, Takaaki Hori, Shinji Watanabe


  Access Paper or Ask Questions

Analysis of Multilingual Sequence-to-Sequence speech recognition systems


Nov 07, 2018
Martin Karafiát, Murali Karthick Baskar, Shinji Watanabe, Takaaki Hori, Matthew Wiesner, Jan "Honza'' Černocký

* arXiv admin note: text overlap with arXiv:1810.03459 

  Access Paper or Ask Questions

Promising Accurate Prefix Boosting for sequence-to-sequence ASR


Nov 07, 2018
Murali Karthick Baskar, Lukáš Burget, Shinji Watanabe, Martin Karafiát, Takaaki Hori, Jan Honza Černocký


  Access Paper or Ask Questions

CNN-based MultiChannel End-to-End Speech Recognition for everyday home environments


Nov 07, 2018
Nelson Yalta, Shinji Watanabe, Takaaki Hori, Kazuhiro Nakadai, Tetsuya Ogata

* 5 pages, 1 figure 

  Access Paper or Ask Questions

Cycle-consistency training for end-to-end speech recognition


Nov 02, 2018
Takaaki Hori, Ramon Astudillo, Tomoki Hayashi, Yu Zhang, Shinji Watanabe, Jonathan Le Roux

* Submitted to ICASSP'19 

  Access Paper or Ask Questions

End-to-end Speech Recognition with Word-based RNN Language Models


Aug 08, 2018
Takaaki Hori, Jaejin Cho, Shinji Watanabe


  Access Paper or Ask Questions

Back-Translation-Style Data Augmentation for End-to-End ASR


Jul 28, 2018
Tomoki Hayashi, Shinji Watanabe, Yu Zhang, Tomoki Toda, Takaaki Hori, Ramon Astudillo, Kazuya Takeda


  Access Paper or Ask Questions

End-to-End Audio Visual Scene-Aware Dialog using Multimodal Attention-Based Video Features


Jun 30, 2018
Chiori Hori, Huda Alamri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, Vincent Cartillier, Raphael Gontijo Lopes, Abhishek Das, Irfan Essa, Dhruv Batra, Devi Parikh

* A prototype system for the Audio Visual Scene-aware Dialog (AVSD) at DSTC7 

  Access Paper or Ask Questions

A Purely End-to-end System for Multi-speaker Speech Recognition


May 15, 2018
Hiroshi Seki, Takaaki Hori, Shinji Watanabe, Jonathan Le Roux, John R. Hershey

* ACL 2018 

  Access Paper or Ask Questions

ESPnet: End-to-End Speech Processing Toolkit


Mar 30, 2018
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai


  Access Paper or Ask Questions

End-to-end Conversation Modeling Track in DSTC6


Jan 30, 2018
Chiori Hori, Takaaki Hori

* Spoken dialog systems, End-to-End, conversation modeling, DSTC, DSTC6 

  Access Paper or Ask Questions

Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM


Jun 08, 2017
Takaaki Hori, Shinji Watanabe, Yu Zhang, William Chan

* Accepted for INTERSPEECH 2017 

  Access Paper or Ask Questions

Multichannel End-to-end Speech Recognition


Mar 14, 2017
Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey


  Access Paper or Ask Questions

Attention-Based Multimodal Fusion for Video Description


Mar 09, 2017
Chiori Hori, Takaaki Hori, Teng-Yok Lee, Kazuhiro Sumi, John R. Hershey, Tim K. Marks

* Resubmitted to the rebuttal for CVPR 2017 for review, 8 pages, 4 figures 

  Access Paper or Ask Questions