Picture for Hirofumi Inaguma

Hirofumi Inaguma

CTC-synchronous Training for Monotonic Attention Model

Add code
May 17, 2020
Figure 1 for CTC-synchronous Training for Monotonic Attention Model
Figure 2 for CTC-synchronous Training for Monotonic Attention Model
Figure 3 for CTC-synchronous Training for Monotonic Attention Model
Figure 4 for CTC-synchronous Training for Monotonic Attention Model
Viaarxiv icon

Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR

Add code
May 15, 2020
Figure 1 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 2 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 3 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Figure 4 for Minimum Latency Training Strategies for Streaming Sequence-to-Sequence ASR
Viaarxiv icon

End-to-end speech-to-dialog-act recognition

Add code
Apr 23, 2020
Figure 1 for End-to-end speech-to-dialog-act recognition
Figure 2 for End-to-end speech-to-dialog-act recognition
Figure 3 for End-to-end speech-to-dialog-act recognition
Figure 4 for End-to-end speech-to-dialog-act recognition
Viaarxiv icon

ESPnet-ST: All-in-One Speech Translation Toolkit

Add code
Apr 21, 2020
Figure 1 for ESPnet-ST: All-in-One Speech Translation Toolkit
Figure 2 for ESPnet-ST: All-in-One Speech Translation Toolkit
Figure 3 for ESPnet-ST: All-in-One Speech Translation Toolkit
Figure 4 for ESPnet-ST: All-in-One Speech Translation Toolkit
Viaarxiv icon

Multilingual End-to-End Speech Translation

Add code
Oct 31, 2019
Figure 1 for Multilingual End-to-End Speech Translation
Figure 2 for Multilingual End-to-End Speech Translation
Figure 3 for Multilingual End-to-End Speech Translation
Figure 4 for Multilingual End-to-End Speech Translation
Viaarxiv icon

A Comparative Study on Transformer vs RNN in Speech Applications

Add code
Sep 28, 2019
Figure 1 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 2 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 3 for A Comparative Study on Transformer vs RNN in Speech Applications
Figure 4 for A Comparative Study on Transformer vs RNN in Speech Applications
Viaarxiv icon

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR

Add code
Sep 22, 2019
Figure 1 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Figure 2 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Figure 3 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Figure 4 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Viaarxiv icon

Transfer learning of language-independent end-to-end ASR with language model fusion

Add code
Nov 06, 2018
Figure 1 for Transfer learning of language-independent end-to-end ASR with language model fusion
Figure 2 for Transfer learning of language-independent end-to-end ASR with language model fusion
Figure 3 for Transfer learning of language-independent end-to-end ASR with language model fusion
Figure 4 for Transfer learning of language-independent end-to-end ASR with language model fusion
Viaarxiv icon