Picture for Tatsuya Kawahara

Tatsuya Kawahara

Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders

Add code
May 18, 2023
Figure 1 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Figure 2 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Figure 3 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Figure 4 for Diffusion-Based Speech Enhancement with Joint Generative and Predictive Decoders
Viaarxiv icon

Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder

Add code
Mar 26, 2023
Figure 1 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 2 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 3 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Figure 4 for Time-domain Speech Enhancement Assisted by Multi-resolution Frequency Encoder and Decoder
Viaarxiv icon

I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue

Add code
Mar 17, 2023
Figure 1 for I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue
Figure 2 for I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue
Figure 3 for I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue
Figure 4 for I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue
Viaarxiv icon

Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners

Add code
Nov 15, 2022
Figure 1 for Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners
Figure 2 for Alzheimer's Dementia Detection through Spontaneous Dialogue with Proactive Robotic Listeners
Viaarxiv icon

Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM

Add code
Sep 08, 2022
Figure 1 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 2 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 3 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Figure 4 for Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM
Viaarxiv icon

Distilling the Knowledge of BERT for CTC-based ASR

Add code
Sep 05, 2022
Figure 1 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 2 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 3 for Distilling the Knowledge of BERT for CTC-based ASR
Figure 4 for Distilling the Knowledge of BERT for CTC-based ASR
Viaarxiv icon

End-to-end Speech-to-Punctuated-Text Recognition

Add code
Jul 07, 2022
Figure 1 for End-to-end Speech-to-Punctuated-Text Recognition
Figure 2 for End-to-end Speech-to-Punctuated-Text Recognition
Figure 3 for End-to-end Speech-to-Punctuated-Text Recognition
Figure 4 for End-to-end Speech-to-Punctuated-Text Recognition
Viaarxiv icon

ASR Rescoring and Confidence Estimation with ELECTRA

Add code
Oct 05, 2021
Figure 1 for ASR Rescoring and Confidence Estimation with ELECTRA
Figure 2 for ASR Rescoring and Confidence Estimation with ELECTRA
Figure 3 for ASR Rescoring and Confidence Estimation with ELECTRA
Figure 4 for ASR Rescoring and Confidence Estimation with ELECTRA
Viaarxiv icon

Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring

Add code
Sep 09, 2021
Figure 1 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 2 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 3 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Figure 4 for Non-autoregressive End-to-end Speech Translation with Parallel Autoregressive Rescoring
Viaarxiv icon

VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording

Add code
Jul 15, 2021
Figure 1 for VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Figure 2 for VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Figure 3 for VAD-free Streaming Hybrid CTC/Attention ASR for Unsegmented Recording
Viaarxiv icon