Picture for Takaaki Hori

Takaaki Hori

Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition

Add code
Jul 02, 2021
Figure 1 for Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition
Figure 2 for Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition
Viaarxiv icon

Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition

Add code
Jun 16, 2021
Figure 1 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Figure 2 for Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
Viaarxiv icon

Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers

Add code
Apr 19, 2021
Figure 1 for Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers
Figure 2 for Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers
Figure 3 for Advanced Long-context End-to-end Speech Recognition Using Context-expanded Transformers
Viaarxiv icon

Capturing Multi-Resolution Context by Dilated Self-Attention

Add code
Apr 07, 2021
Figure 1 for Capturing Multi-Resolution Context by Dilated Self-Attention
Figure 2 for Capturing Multi-Resolution Context by Dilated Self-Attention
Figure 3 for Capturing Multi-Resolution Context by Dilated Self-Attention
Viaarxiv icon

The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans

Add code
Dec 23, 2020
Figure 1 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Figure 2 for The 2020 ESPnet update: new features, broadened applications, performance improvements, and future plans
Viaarxiv icon

Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training

Add code
Nov 26, 2020
Figure 1 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Figure 2 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Figure 3 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Figure 4 for Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training
Viaarxiv icon

Semi-Supervised Speech Recognition via Graph-based Temporal Classification

Add code
Oct 29, 2020
Figure 1 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Figure 2 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Figure 3 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Figure 4 for Semi-Supervised Speech Recognition via Graph-based Temporal Classification
Viaarxiv icon

Multi-Pass Transformer for Machine Translation

Add code
Sep 23, 2020
Figure 1 for Multi-Pass Transformer for Machine Translation
Figure 2 for Multi-Pass Transformer for Machine Translation
Figure 3 for Multi-Pass Transformer for Machine Translation
Figure 4 for Multi-Pass Transformer for Machine Translation
Viaarxiv icon

Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR

Add code
Feb 14, 2020
Figure 1 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 2 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 3 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Figure 4 for Unsupervised Speaker Adaptation using Attention-based Speaker Memory for End-to-End ASR
Viaarxiv icon

Streaming automatic speech recognition with the transformer model

Add code
Jan 09, 2020
Figure 1 for Streaming automatic speech recognition with the transformer model
Figure 2 for Streaming automatic speech recognition with the transformer model
Figure 3 for Streaming automatic speech recognition with the transformer model
Viaarxiv icon