Picture for Tatsuya Komatsu

Tatsuya Komatsu

Audio Fingerprinting with Holographic Reduced Representations

Add code
Jun 19, 2024
Viaarxiv icon

Universal Score-based Speech Enhancement with High Content Preservation

Add code
Jun 18, 2024
Viaarxiv icon

Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers

Add code
Jan 22, 2024
Figure 1 for Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers
Figure 2 for Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers
Figure 3 for Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers
Figure 4 for Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers
Viaarxiv icon

Audio Difference Learning for Audio Captioning

Add code
Sep 15, 2023
Viaarxiv icon

PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions

Add code
Sep 15, 2023
Figure 1 for PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
Figure 2 for PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-to-Speech Using Natural Language Descriptions
Viaarxiv icon

Neural Diarization with Non-autoregressive Intermediate Attractors

Add code
Mar 13, 2023
Figure 1 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 2 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 3 for Neural Diarization with Non-autoregressive Intermediate Attractors
Figure 4 for Neural Diarization with Non-autoregressive Intermediate Attractors
Viaarxiv icon

How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks

Add code
Apr 05, 2022
Figure 1 for How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks
Figure 2 for How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks
Figure 3 for How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks
Figure 4 for How Information on Acoustic Scenes and Sound Events Mutually Benefits Event Detection and Scene Classification Tasks
Viaarxiv icon

Better Intermediates Improve CTC Inference

Add code
Apr 01, 2022
Figure 1 for Better Intermediates Improve CTC Inference
Figure 2 for Better Intermediates Improve CTC Inference
Figure 3 for Better Intermediates Improve CTC Inference
Viaarxiv icon

Multi-sequence Intermediate Conditioning for CTC-based ASR

Add code
Apr 01, 2022
Figure 1 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 2 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 3 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Figure 4 for Multi-sequence Intermediate Conditioning for CTC-based ASR
Viaarxiv icon

InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR

Add code
Apr 01, 2022
Figure 1 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 2 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 3 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Figure 4 for InterAug: Augmenting Noisy Intermediate Predictions for CTC-based ASR
Viaarxiv icon