Alert button
Picture for Yusuke Kida

Yusuke Kida

Alert button

Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition

Add code
Bookmark button
Alert button
Sep 09, 2023
Huaibo Zhao, Yosuke Higuchi, Yusuke Kida, Tetsuji Ogawa, Tetsunori Kobayashi

Figure 1 for Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition
Figure 2 for Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition
Figure 3 for Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition
Figure 4 for Mask-CTC-based Encoder Pre-training for Streaming End-to-End Speech Recognition
Viaarxiv icon

Neural Diarization with Non-autoregressive Intermediate Attractors

Add code
Bookmark button
Alert button
Mar 13, 2023
Yusuke Fujita, Tatsuya Komatsu, Robin Scheibler, Yusuke Kida, Tetsuji Ogawa

Figure 1 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 2 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 3 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 4 for Neural Diarization with Non-autoregressive Intermediate Attractors
Viaarxiv icon

Conversation-oriented ASR with multi-look-ahead CBS architecture

Add code
Bookmark button
Alert button
Nov 02, 2022
Huaibo Zhao, Shinya Fujie, Tetsuji Ogawa, Jin Sakuma, Yusuke Kida, Tetsunori Kobayashi

Figure 1 for Conversation-oriented ASR with multi-look-ahead CBS architecture
Figure 2 for Conversation-oriented ASR with multi-look-ahead CBS architecture
Figure 3 for Conversation-oriented ASR with multi-look-ahead CBS architecture
Viaarxiv icon

Tourist Guidance Robot Based on HyperCLOVA

Add code
Bookmark button
Alert button
Oct 19, 2022
Takato Yamazaki, Katsumasa Yoshikawa, Toshiki Kawamoto, Masaya Ohagi, Tomoya Mizumoto, Shuta Ichimura, Yusuke Kida, Toshinori Sato

Figure 1 for Tourist Guidance Robot Based on HyperCLOVA
Figure 2 for Tourist Guidance Robot Based on HyperCLOVA
Figure 3 for Tourist Guidance Robot Based on HyperCLOVA
Figure 4 for Tourist Guidance Robot Based on HyperCLOVA
Viaarxiv icon

Better Intermediates Improve CTC Inference

Add code
Bookmark button
Alert button
Apr 01, 2022
Tatsuya Komatsu, Yusuke Fujita, Jaesong Lee, Lukas Lee, Shinji Watanabe, Yusuke Kida

Figure 1 for Better Intermediates Improve CTC Inference
Figure 2 for Better Intermediates Improve CTC Inference
Figure 3 for Better Intermediates Improve CTC Inference
Viaarxiv icon

Multi-sequence Intermediate Conditioning for CTC-based ASR

Add code
Bookmark button
Alert button
Apr 01, 2022
Yusuke Fujita, Tatsuya Komatsu, Yusuke Kida

Figure 1 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 2 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 3 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 4 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Viaarxiv icon

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR

Add code
Bookmark button
Alert button
Apr 01, 2022
Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida

Figure 1 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 2 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 3 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 4 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Viaarxiv icon

Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers

Add code
Bookmark button
Alert button
Apr 21, 2021
Yusuke Kida, Tatsuya Komatsu, Masahito Togami

Figure 1 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Figure 2 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Figure 3 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Figure 4 for Label-Synchronous Speech-to-Text Alignment for ASR Using Forward and Backward Transformers
Viaarxiv icon

Speaker Selective Beamformer with Keyword Mask Estimation

Add code
Bookmark button
Alert button
Oct 25, 2018
Yusuke Kida, Dung Tran, Motoi Omachi, Toru Taniguchi, Yuya Fujita

Figure 1 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 2 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 3 for Speaker Selective Beamformer with Keyword Mask Estimation
Figure 4 for Speaker Selective Beamformer with Keyword Mask Estimation
Viaarxiv icon