Alert button
Picture for Hayato Futami

Hayato Futami

Alert button

Phoneme-aware Encoding for Prefix-tree-based Contextual ASR

Add code
Bookmark button
Alert button
Dec 15, 2023
Hayato Futami, Emiru Tsunoo, Yosuke Kashiwagi, Hiroaki Ogawa, Siddhant Arora, Shinji Watanabe

Viaarxiv icon

UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network

Add code
Bookmark button
Alert button
Oct 04, 2023
Siddhant Arora, Hayato Futami, Jee-weon Jung, Yifan Peng, Roshan Sharma, Yosuke Kashiwagi, Emiru Tsunoo, Shinji Watanabe

Figure 1 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Figure 2 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Figure 3 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Figure 4 for UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network
Viaarxiv icon

Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation

Add code
Bookmark button
Alert button
Sep 16, 2023
Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe

Viaarxiv icon

Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition

Add code
Bookmark button
Alert button
Jul 24, 2023
Emiru Tsunoo, Hayato Futami, Yosuke Kashiwagi, Siddhant Arora, Shinji Watanabe

Viaarxiv icon

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding

Add code
Bookmark button
Alert button
Jul 20, 2023
Siddhant Arora, Hayato Futami, Yosuke Kashiwagi, Emiru Tsunoo, Brian Yan, Shinji Watanabe

Figure 1 for Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Figure 2 for Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Figure 3 for Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Viaarxiv icon

Tensor decomposition for minimization of E2E SLU model toward on-device processing

Add code
Bookmark button
Alert button
Jun 02, 2023
Yosuke Kashiwagi, Siddhant Arora, Hayato Futami, Jessica Huynh, Shih-Lun Wu, Yifan Peng, Brian Yan, Emiru Tsunoo, Shinji Watanabe

Figure 1 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 2 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 3 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 4 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Viaarxiv icon

The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge

Add code
Bookmark button
Alert button
May 11, 2023
Hayato Futami, Jessica Huynh, Siddhant Arora, Shih-Lun Wu, Yosuke Kashiwagi, Yifan Peng, Brian Yan, Emiru Tsunoo, Shinji Watanabe

Figure 1 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 2 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 3 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 4 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Viaarxiv icon

A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

Add code
Bookmark button
Alert button
May 06, 2023
Siddhant Arora, Hayato Futami, Shih-Lun Wu, Jessica Huynh, Yifan Peng, Yosuke Kashiwagi, Emiru Tsunoo, Brian Yan, Shinji Watanabe

Figure 1 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Figure 2 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Figure 3 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Viaarxiv icon