Picture for Shota Horiguchi

Shota Horiguchi

SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling

Add code
Jul 01, 2024
Figure 1 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Figure 2 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Figure 3 for SpeakerBeam-SS: Real-time Target Speaker Extraction with Lightweight Conv-TasNet and State Space Modeling
Viaarxiv icon

Factor-Conditioned Speaking-Style Captioning

Add code
Jun 27, 2024
Viaarxiv icon

Thresholding Data Shapley for Data Cleansing Using Multi-Armed Bandits

Add code
Feb 13, 2024
Viaarxiv icon

Streaming Active Learning for Regression Problems Using Regression via Classification

Add code
Sep 02, 2023
Figure 1 for Streaming Active Learning for Regression Problems Using Regression via Classification
Figure 2 for Streaming Active Learning for Regression Problems Using Regression via Classification
Figure 3 for Streaming Active Learning for Regression Problems Using Regression via Classification
Viaarxiv icon

CAPTDURE: Captioned Sound Dataset of Single Sources

Add code
May 28, 2023
Figure 1 for CAPTDURE: Captioned Sound Dataset of Single Sources
Figure 2 for CAPTDURE: Captioned Sound Dataset of Single Sources
Figure 3 for CAPTDURE: Captioned Sound Dataset of Single Sources
Figure 4 for CAPTDURE: Captioned Sound Dataset of Single Sources
Viaarxiv icon

Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model

Add code
May 24, 2023
Figure 1 for Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Figure 2 for Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Figure 3 for Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Figure 4 for Spoofing Attacker Also Benefits from Self-Supervised Pretrained Model
Viaarxiv icon

Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization

Add code
Oct 07, 2022
Figure 1 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Figure 2 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Figure 3 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Figure 4 for Mutual Learning of Single- and Multi-Channel End-to-End Neural Diarization
Viaarxiv icon

Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models

Add code
Jul 01, 2022
Figure 1 for Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models
Figure 2 for Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models
Figure 3 for Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models
Figure 4 for Updating Only Encoders Prevents Catastrophic Forgetting of End-to-End ASR Models
Viaarxiv icon

Online Neural Diarization of Unlimited Numbers of Speakers

Add code
Jun 06, 2022
Figure 1 for Online Neural Diarization of Unlimited Numbers of Speakers
Figure 2 for Online Neural Diarization of Unlimited Numbers of Speakers
Figure 3 for Online Neural Diarization of Unlimited Numbers of Speakers
Figure 4 for Online Neural Diarization of Unlimited Numbers of Speakers
Viaarxiv icon

Rethinking Fano's Inequality in Ensemble Learning

Add code
May 25, 2022
Figure 1 for Rethinking Fano's Inequality in Ensemble Learning
Figure 2 for Rethinking Fano's Inequality in Ensemble Learning
Figure 3 for Rethinking Fano's Inequality in Ensemble Learning
Figure 4 for Rethinking Fano's Inequality in Ensemble Learning
Viaarxiv icon