Picture for Takuya Yoshioka

Takuya Yoshioka

Streaming Multi-Talker ASR with Token-Level Serialized Output Training

Add code
Feb 05, 2022
Figure 1 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Figure 2 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Figure 3 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Figure 4 for Streaming Multi-Talker ASR with Token-Level Serialized Output Training
Viaarxiv icon

PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays

Add code
Jan 24, 2022
Figure 1 for PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays
Figure 2 for PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays
Figure 3 for PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays
Figure 4 for PickNet: Real-Time Channel Selection for Ad Hoc Microphone Arrays
Viaarxiv icon

Separating Long-Form Speech with Group-Wise Permutation Invariant Training

Add code
Nov 17, 2021
Figure 1 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Figure 2 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Figure 3 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Figure 4 for Separating Long-Form Speech with Group-Wise Permutation Invariant Training
Viaarxiv icon

WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing

Add code
Oct 29, 2021
Figure 1 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Figure 2 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Figure 3 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Figure 4 for WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech Processing
Viaarxiv icon

Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction

Add code
Oct 28, 2021
Figure 1 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Figure 2 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Figure 3 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Figure 4 for Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction
Viaarxiv icon

Continuous Speech Separation with Recurrent Selective Attention Network

Add code
Oct 28, 2021
Figure 1 for Continuous Speech Separation with Recurrent Selective Attention Network
Figure 2 for Continuous Speech Separation with Recurrent Selective Attention Network
Figure 3 for Continuous Speech Separation with Recurrent Selective Attention Network
Figure 4 for Continuous Speech Separation with Recurrent Selective Attention Network
Viaarxiv icon

VarArray: Array-Geometry-Agnostic Continuous Speech Separation

Add code
Oct 26, 2021
Figure 1 for VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Figure 2 for VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Figure 3 for VarArray: Array-Geometry-Agnostic Continuous Speech Separation
Viaarxiv icon

One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement

Add code
Oct 20, 2021
Figure 1 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Figure 2 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Figure 3 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Figure 4 for One model to enhance them all: array geometry agnostic multi-channel personalized speech enhancement
Viaarxiv icon

Personalized Speech Enhancement: New Models and Comprehensive Evaluation

Add code
Oct 18, 2021
Figure 1 for Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Figure 2 for Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Figure 3 for Personalized Speech Enhancement: New Models and Comprehensive Evaluation
Viaarxiv icon

All-neural beamformer for continuous speech separation

Add code
Oct 13, 2021
Figure 1 for All-neural beamformer for continuous speech separation
Figure 2 for All-neural beamformer for continuous speech separation
Figure 3 for All-neural beamformer for continuous speech separation
Figure 4 for All-neural beamformer for continuous speech separation
Viaarxiv icon