Picture for Andy W. H. Khong

Andy W. H. Khong

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

Mar 09, 2024
Figure 1 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 2 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 3 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 4 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Viaarxiv icon

Enhancing Code-switching Speech Recognition with Interactive Language Biases

Add code
Sep 29, 2023
Figure 1 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 2 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 3 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 4 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Viaarxiv icon

MERLIon CCS Challenge Evaluation Plan

Add code
May 31, 2023
Figure 1 for MERLIon CCS Challenge Evaluation Plan
Figure 2 for MERLIon CCS Challenge Evaluation Plan
Figure 3 for MERLIon CCS Challenge Evaluation Plan
Figure 4 for MERLIon CCS Challenge Evaluation Plan
Viaarxiv icon

Investigating model performance in language identification: beyond simple error statistics

Add code
May 30, 2023
Figure 1 for Investigating model performance in language identification: beyond simple error statistics
Figure 2 for Investigating model performance in language identification: beyond simple error statistics
Figure 3 for Investigating model performance in language identification: beyond simple error statistics
Figure 4 for Investigating model performance in language identification: beyond simple error statistics
Viaarxiv icon

MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization

Add code
May 30, 2023
Figure 1 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Figure 2 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Figure 3 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Figure 4 for MERLIon CCS Challenge: A English-Mandarin code-switching child-directed speech corpus for language identification and diarization
Viaarxiv icon

Improving performance of real-time full-band blind packet-loss concealment with predictive network

Add code
Nov 10, 2022
Figure 1 for Improving performance of real-time full-band blind packet-loss concealment with predictive network
Figure 2 for Improving performance of real-time full-band blind packet-loss concealment with predictive network
Figure 3 for Improving performance of real-time full-band blind packet-loss concealment with predictive network
Figure 4 for Improving performance of real-time full-band blind packet-loss concealment with predictive network
Viaarxiv icon

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

Add code
Oct 26, 2022
Figure 1 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 2 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 3 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 4 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Viaarxiv icon

PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification

Add code
Mar 31, 2022
Figure 1 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Figure 2 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Figure 3 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Figure 4 for PHO-LID: A Unified Model Incorporating Acoustic-Phonetic and Phonotactic Information for Language Identification
Viaarxiv icon

Enhance Language Identification using Dual-mode Model with Knowledge Distillation

Mar 07, 2022
Figure 1 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Figure 2 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Figure 3 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Figure 4 for Enhance Language Identification using Dual-mode Model with Knowledge Distillation
Viaarxiv icon

TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining

Add code
Oct 26, 2021
Figure 1 for TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining
Figure 2 for TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining
Figure 3 for TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining
Figure 4 for TUNet: A Block-online Bandwidth Extension Model based on Transformers and Self-supervised Pretraining
Viaarxiv icon