Picture for Hirofumi Inaguma

Hirofumi Inaguma

VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording

Add code
Jul 15, 2021
Figure 1 for VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Figure 2 for VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Figure 3 for VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Viaarxiv icon

StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR

Add code
Jul 15, 2021
Figure 1 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Figure 2 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Figure 3 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Figure 4 for StableEmit: Selection Probability Discount for Reducing Emission Latency of Streaming Monotonic Attention ASR
Viaarxiv icon

ESPnet-ST IWSLT 2021 Offline Speech Translation System

Add code
Jul 06, 2021
Figure 1 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Figure 2 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Figure 3 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Figure 4 for ESPnet-ST IWSLT 2021 Offline Speech Translation System
Viaarxiv icon

Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation

Add code
Apr 13, 2021
Figure 1 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 2 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 3 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Figure 4 for Source and Target Bidirectional Knowledge Distillation for End-to-end Speech Translation
Viaarxiv icon

Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition

Add code
Feb 28, 2021
Figure 1 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Figure 2 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Figure 3 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Figure 4 for Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition
Viaarxiv icon

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

Add code
Dec 23, 2020
Figure 1 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Figure 2 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Viaarxiv icon

Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder

Add code
Nov 06, 2020
Figure 1 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 2 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 3 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Figure 4 for Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder
Viaarxiv icon

Improved Mask-CTC for Non-Autoregressive End-to-End ASR

Add code
Oct 26, 2020
Figure 1 for Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Figure 2 for Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Figure 3 for Improved Mask-CTC for Non-Autoregressive End-to-End ASR
Viaarxiv icon

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR

Add code
Aug 09, 2020
Figure 1 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 2 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 3 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 4 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Viaarxiv icon

Enhancing Monotonic Multihead Attention for Streaming ASR

Add code
May 23, 2020
Figure 1 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 2 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 3 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 4 for Enhancing Monotonic Multihead Attention for Streaming ASR
Viaarxiv icon