Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Shigeki Karita

DF-Conformer: Integrated architecture of Conv-TasNet and Conformer using linear complexity self-attention for speech enhancement


Jun 30, 2021
Yuma Koizumi, Shigeki Karita, Scott Wisdom, Hakan Erdogan, John R. Hershey, Llion Jones, Michiel Bacchiani

* 5 pages, 2 figure. submitted to WASPAA 2021 

  Access Paper or Ask Questions

A Comparative Study on Neural Architectures and Training Methods for Japanese Speech Recognition


Jun 09, 2021
Shigeki Karita, Yotaro Kubo, Michiel Adriaan Unico Bacchiani, Llion Jones

* to be published in INTERSPEECH2021 

  Access Paper or Ask Questions

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans


Dec 23, 2020
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang


  Access Paper or Ask Questions

Unsupervised Learning of Disentangled Speech Content and Style Representation


Oct 24, 2020
Andros Tjandra, Ruoming Pang, Yu Zhang, Shigeki Karita

* Submitted to ICASSP 2021 

  Access Paper or Ask Questions

ESPnet-ST: All-in-One Speech Translation Toolkit


Apr 21, 2020
Hirofumi Inaguma, Shun Kiyono, Kevin Duh, Shigeki Karita, Nelson Enrique Yalta Soplin, Tomoki Hayashi, Shinji Watanabe

* Accepted at ACL 2020 System Demonstration 

  Access Paper or Ask Questions

A Comparative Study on Transformer vs RNN in Speech Applications


Sep 28, 2019
Shigeki Karita, Nanxin Chen, Tomoki Hayashi, Takaaki Hori, Hirofumi Inaguma, Ziyan Jiang, Masao Someki, Nelson Enrique Yalta Soplin, Ryuichi Yamamoto, Xiaofei Wang, Shinji Watanabe, Takenori Yoshimura, Wangyou Zhang

* IEEE Automatic Speech Recognition and Understanding Workshop 2019 
* Accepted at ASRU 2019 

  Access Paper or Ask Questions

ESPnet: End-to-End Speech Processing Toolkit


Mar 30, 2018
Shinji Watanabe, Takaaki Hori, Shigeki Karita, Tomoki Hayashi, Jiro Nishitoba, Yuya Unno, Nelson Enrique Yalta Soplin, Jahn Heymann, Matthew Wiesner, Nanxin Chen, Adithya Renduchintala, Tsubasa Ochiai


  Access Paper or Ask Questions