Alert button
Picture for Eng Siong Chng

Eng Siong Chng

Alert button

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Mar 09, 2024
Hexin Liu, Xiangyu Zhang, Leibny Paola Garcia, Andy W. H. Khong, Eng Siong Chng, Shinji Watanabe

Figure 1 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 2 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 3 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Figure 4 for Aligning Speech to Languages to Enhance Code-switching Speech Recognition
Viaarxiv icon

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

Add code
Bookmark button
Alert button
Feb 16, 2024
Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia, Eng Siong Chng, Lina Yao

Viaarxiv icon

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

Add code
Bookmark button
Alert button
Feb 10, 2024
Yuchen Hu, Chen Chen, Chao-Han Huck Yang, Ruizhe Li, Dong Zhang, Zhehuai Chen, Eng Siong Chng

Viaarxiv icon

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

Add code
Bookmark button
Alert button
Jan 07, 2024
He Wang, Pengcheng Guo, Yue Li, Ao Zhang, Jiayao Sun, Lei Xie, Wei Chen, Pan Zhou, Hui Bu, Xin Xu, Binbin Zhang, Zhuo Chen, Jian Wu, Longbiao Wang, Eng Siong Chng, Sun Li

Viaarxiv icon

Noise robust distillation of self-supervised speech models via correlation metrics

Add code
Bookmark button
Alert button
Dec 19, 2023
Fabian Ritter-Gutierrez, Kuan-Po Huang, Dianwen Ng, Jeremy H. M. Wong, Hung-yi Lee, Eng Siong Chng, Nancy F. Chen

Viaarxiv icon

Adapting OpenAI's Whisper for Speech Recognition on Code-Switch Mandarin-English SEAME and ASRU2019 Datasets

Add code
Bookmark button
Alert button
Nov 29, 2023
Yuhang Yang, Yizhou Peng, Xionghu Zhong, Hao Huang, Eng Siong Chng

Viaarxiv icon

Generative error correction for code-switching speech recognition using large language models

Add code
Bookmark button
Alert button
Oct 17, 2023
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Hexin Liu, Sabato Marco Siniscalchi, Eng Siong Chng

Figure 1 for Generative error correction for code-switching speech recognition using large language models
Figure 2 for Generative error correction for code-switching speech recognition using large language models
Figure 3 for Generative error correction for code-switching speech recognition using large language models
Figure 4 for Generative error correction for code-switching speech recognition using large language models
Viaarxiv icon

HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models

Add code
Bookmark button
Alert button
Oct 16, 2023
Chen Chen, Yuchen Hu, Chao-Han Huck Yang, Sabato Macro Siniscalchi, Pin-Yu Chen, Eng Siong Chng

Figure 1 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 2 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 3 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Figure 4 for HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models
Viaarxiv icon

Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification

Add code
Bookmark button
Alert button
Sep 26, 2023
Duc-Tuan Truong, Ruijie Tao, Jia Qi Yip, Kong Aik Lee, Eng Siong Chng

Figure 1 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 2 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 3 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Figure 4 for Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Automatic Speaker Verification
Viaarxiv icon