Alert button
Picture for Yusuke Fujita

Yusuke Fujita

Alert button

Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers

Add code
Bookmark button
Alert button
Jan 22, 2024
Michael Hentschel, Yuta Nishikawa, Tatsuya Komatsu, Yusuke Fujita

Viaarxiv icon

Audio Difference Learning for Audio Captioning

Add code
Bookmark button
Alert button
Sep 15, 2023
Tatsuya Komatsu, Yusuke Fujita, Kazuya Takeda, Tomoki Toda

Viaarxiv icon

Neural Diarization with Non-autoregressive Intermediate Attractors

Add code
Bookmark button
Alert button
Mar 13, 2023
Yusuke Fujita, Tatsuya Komatsu, Robin Scheibler, Yusuke Kida, Tetsuji Ogawa

Figure 1 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 2 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 3 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 4 for Neural Diarization with Non-autoregressive Intermediate Attractors
Viaarxiv icon

Better Intermediates Improve CTC Inference

Add code
Bookmark button
Alert button
Apr 01, 2022
Tatsuya Komatsu, Yusuke Fujita, Jaesong Lee, Lukas Lee, Shinji Watanabe, Yusuke Kida

Figure 1 for Better Intermediates Improve CTC Inference
Figure 2 for Better Intermediates Improve CTC Inference
Figure 3 for Better Intermediates Improve CTC Inference
Viaarxiv icon

Multi-sequence Intermediate Conditioning for CTC-based ASR

Add code
Bookmark button
Alert button
Apr 01, 2022
Yusuke Fujita, Tatsuya Komatsu, Yusuke Kida

Figure 1 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 2 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 3 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 4 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Viaarxiv icon

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR

Add code
Bookmark button
Alert button
Apr 01, 2022
Yu Nakagome, Tatsuya Komatsu, Yusuke Fujita, Shuta Ichimura, Yusuke Kida

Figure 1 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 2 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 3 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 4 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Viaarxiv icon

Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization

Add code
Bookmark button
Alert button
Jun 20, 2021
Shota Horiguchi, Yusuke Fujita, Shinji Watanabe, Yawen Xue, Paola Garcia

Figure 1 for Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization
Figure 2 for Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization
Figure 3 for Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization
Figure 4 for Encoder-Decoder Based Attractor Calculation for End-to-End Neural Diarization
Viaarxiv icon

Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization

Add code
Bookmark button
Alert button
Jun 09, 2021
Yuki Takashima, Yusuke Fujita, Shota Horiguchi, Shinji Watanabe, Paola García, Kenji Nagamatsu

Figure 1 for Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization
Figure 2 for Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization
Figure 3 for Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization
Figure 4 for Semi-Supervised Training with Pseudo-Labeling for End-to-End Neural Diarization
Viaarxiv icon

End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection

Add code
Bookmark button
Alert button
Jun 08, 2021
Yuki Takashima, Yusuke Fujita, Shinji Watanabe, Shota Horiguchi, Paola García, Kenji Nagamatsu

Figure 1 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Figure 2 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Figure 3 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Figure 4 for End-to-End Speaker Diarization Conditioned on Speech Activity and Overlap Detection
Viaarxiv icon

The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap

Add code
Bookmark button
Alert button
Feb 02, 2021
Shota Horiguchi, Nelson Yalta, Paola Garcia, Yuki Takashima, Yawen Xue, Desh Raj, Zili Huang, Yusuke Fujita, Shinji Watanabe, Sanjeev Khudanpur

Figure 1 for The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap
Figure 2 for The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap
Figure 3 for The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap
Figure 4 for The Hitachi-JHU DIHARD III System: Competitive End-to-End Neural Diarization and X-Vector Clustering Systems Combined by DOVER-Lap
Viaarxiv icon