Picture for Masato Mimura

Masato Mimura

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling

Add code
Jul 01, 2024
Figure 1 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Figure 2 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Figure 3 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Viaarxiv icon

Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder

Add code
Mar 26, 2023
Figure 1 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 2 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 3 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 4 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Viaarxiv icon

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM

Add code
Sep 08, 2022
Figure 1 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 2 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 3 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 4 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Viaarxiv icon

Distilling the Knowledge of BERT for CTC-based ASR

Add code
Sep 05, 2022
Figure 1 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 2 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 3 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 4 for Distilling the Knowledge of BERT for CTC-based ASR
Viaarxiv icon

ASR Rescoring and Confidence Estimation with ELECTRA

Add code
Oct 05, 2021
Figure 1 for ASR Rescoring and Confidence Estimation with ELECTRA
Figure 2 for ASR Rescoring and Confidence Estimation with ELECTRA
Figure 3 for ASR Rescoring and Confidence Estimation with ELECTRA
Figure 4 for ASR Rescoring and Confidence Estimation with ELECTRA
Viaarxiv icon

Distilling the Knowledge of BERT for Sequence-to-Sequence ASR

Add code
Aug 09, 2020
Figure 1 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 2 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 3 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Figure 4 for Distilling the Knowledge of BERT for Sequence-to-Sequence ASR
Viaarxiv icon

Enhancing Monotonic Multihead Attention for Streaming ASR

Add code
May 23, 2020
Figure 1 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 2 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 3 for Enhancing Monotonic Multihead Attention for Streaming ASR
Figure 4 for Enhancing Monotonic Multihead Attention for Streaming ASR
Viaarxiv icon

Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition

Add code
May 19, 2020
Figure 1 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Figure 2 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Figure 3 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Figure 4 for Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech Recognition
Viaarxiv icon

CTC-synchronous Training for Monotonic Attention Model

Add code
May 17, 2020
Figure 1 for CTC-synchronous Training for Monotonic Attention Model
Figure 2 for CTC-synchronous Training for Monotonic Attention Model
Figure 3 for CTC-synchronous Training for Monotonic Attention Model
Figure 4 for CTC-synchronous Training for Monotonic Attention Model
Viaarxiv icon

Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language

Add code
Feb 19, 2020
Figure 1 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 2 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 3 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Figure 4 for Speech Corpus of Ainu Folklore and End-to-end Speech Recognition for Ainu Language
Viaarxiv icon