Picture for Sanjeev Khudanpur

Sanjeev Khudanpur

MERLIon CCS Challenge Evaluation Plan

Add code
May 31, 2023
Figure 1 for MERLIon CCS Challenge Evaluation Plan
Figure 2 for MERLIon CCS Challenge Evaluation Plan
Figure 3 for MERLIon CCS Challenge Evaluation Plan
Figure 4 for MERLIon CCS Challenge Evaluation Plan
Viaarxiv icon

Investigating model performance in language identification: beyond simple error statistics

Add code
May 30, 2023
Viaarxiv icon

MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization

Add code
May 30, 2023
Figure 1 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Figure 2 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Figure 3 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Figure 4 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Viaarxiv icon

GPU-accelerated Guided Source Separation for Meeting Transcription

Add code
Dec 10, 2022
Viaarxiv icon

EURO: ESPnet Unsupervised ASR Open-source Toolkit

Add code
Dec 01, 2022
Figure 1 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Figure 2 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Figure 3 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Figure 4 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Viaarxiv icon

Adapting self-supervised models to multi-talker speech recognition using speaker embeddings

Add code
Nov 01, 2022
Viaarxiv icon

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

Add code
Oct 26, 2022
Figure 1 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 2 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 3 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 4 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Viaarxiv icon

Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser

Add code
Apr 08, 2022
Figure 1 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 2 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 3 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Figure 4 for Defense against Adversarial Attacks on Hybrid Speech Recognition using Joint Adversarial Fine-tuning with Denoiser
Viaarxiv icon

PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification

Add code
Mar 31, 2022
Figure 1 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Figure 2 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Figure 3 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Figure 4 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Viaarxiv icon

Investigating self-supervised learning for speech enhancement and separation

Add code
Mar 15, 2022
Figure 1 for Investigating self-supervised learning for speech enhancement and separation
Figure 2 for Investigating self-supervised learning for speech enhancement and separation
Figure 3 for Investigating self-supervised learning for speech enhancement and separation
Figure 4 for Investigating self-supervised learning for speech enhancement and separation
Viaarxiv icon