Picture for Masato Mimura

Masato Mimura

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR

Add code
Aug 09, 2020
Figure 1 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 2 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 3 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 4 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Viaarxiv icon

Enhancing Monotonic Multihead Attention for Streaming ASR

Add code
May 23, 2020
Figure 1 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 2 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 3 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 4 for Enhancing Monotonic Multihead Attention for Streaming ASR
Viaarxiv icon

Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition

Add code
May 19, 2020
Figure 1 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Figure 2 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Figure 3 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Figure 4 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Viaarxiv icon

CTC-synchronous Training for Monotonic Attention Model

Add code
May 17, 2020
Figure 1 for CTC-synchronous Training for Monotonic Attention Model
Figure 2 for CTC-synchronous Training for Monotonic Attention Model
Figure 3 for CTC-synchronous Training for Monotonic Attention Model
Figure 4 for CTC-synchronous Training for Monotonic Attention Model
Viaarxiv icon

Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language

Add code
Feb 19, 2020
Figure 1 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 2 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 3 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 4 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Viaarxiv icon

Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR

Add code
Sep 22, 2019
Figure 1 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Figure 2 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Figure 3 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Figure 4 for Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR
Viaarxiv icon

Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition

Add code
Mar 31, 2019
Figure 1 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 2 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 3 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Figure 4 for Unsupervised Speech Enhancement Based on Multichannel NMF-Informed Beamforming for Noise-Robust Automatic Speech Recognition
Viaarxiv icon

Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization

Add code
Mar 19, 2018
Figure 1 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 2 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 3 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Figure 4 for Statistical Speech Enhancement Based on Probabilistic Integration of Variational Autoencoder and Non-Negative Matrix Factorization
Viaarxiv icon