Alert button
Picture for Yosuke Higuchi

Yosuke Higuchi

Alert button

Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models

Jan 25, 2022
Keqi Deng, Zehui Yang, Shinji Watanabe, Yosuke Higuchi, Gaofeng Cheng, Pengyuan Zhang

Figure 1 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Figure 2 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Figure 3 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Figure 4 for Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models
Viaarxiv icon

An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR

Oct 20, 2021
Huaibo Zhao, Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi

Figure 1 for An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR
Figure 2 for An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR
Figure 3 for An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR
Viaarxiv icon

A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation

Oct 11, 2021
Yosuke Higuchi, Nanxin Chen, Yuya Fujita, Hirofumi Inaguma, Tatsuya Komatsu, Jaesong Lee, Jumon Nozaki, Tianzi Wang, Shinji Watanabe

Figure 1 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 2 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 3 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Figure 4 for A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation
Viaarxiv icon

Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy

Oct 11, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Figure 1 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Figure 2 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Figure 3 for Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy
Viaarxiv icon

Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units

Oct 08, 2021
Yosuke Higuchi, Keita Karube, Tetsuji Ogawa, Tetsunori Kobayashi

Figure 1 for Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Figure 2 for Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Figure 3 for Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Figure 4 for Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Viaarxiv icon

Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring

Sep 09, 2021
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

Figure 1 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 2 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 3 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 4 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Viaarxiv icon

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

Jun 16, 2021
Yosuke Higuchi, Niko Moritz, Jonathan Le Roux, Takaaki Hori

Figure 1 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Figure 2 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Viaarxiv icon

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

Dec 23, 2020
Shinji Watanabe, Florian Boyer, Xuankai Chang, Pengcheng Guo, Tomoki Hayashi, Yosuke Higuchi, Takaaki Hori, Wen-Chin Huang, Hirofumi Inaguma, Naoyuki Kamo, Shigeki Karita, Chenda Li, Jing Shi, Aswin Shanmugam Subramanian, Wangyou Zhang

Figure 1 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Figure 2 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Viaarxiv icon

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder

Nov 06, 2020
Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe

Figure 1 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 2 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 3 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 4 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Viaarxiv icon

Improved Mask-CTC for Non-Autoregressive End-to-End ASR

Oct 26, 2020
Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi

Figure 1 for Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Figure 2 for Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Figure 3 for Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Viaarxiv icon