Alert button
Picture for Leibny Paola Garcia

Leibny Paola Garcia

Alert button

Aligning Speech to Languages to Enhance Code-switching Speech Recognition

Mar 09, 2024
Hexin Liu, Xiangyu Zhang, Leibny Paola Garcia, Andy W. H. Khong, Eng Siong Chng, Shinji Watanabe

Viaarxiv icon

Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model

Feb 16, 2024
Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola Garcia, Eng Siong Chng, Lina Yao

Viaarxiv icon

A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors

Nov 27, 2023
Shuyue Stella Li, Beining Xu, Xiangyu Zhang, Hexin Liu, Wenhan Chao, Leibny Paola Garcia

Viaarxiv icon

Enhancing Code-switching Speech Recognition with Interactive Language Biases

Sep 29, 2023
Hexin Liu, Leibny Paola Garcia, Xiangyu Zhang, Andy W. H. Khong, Sanjeev Khudanpur

Figure 1 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 2 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 3 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Figure 4 for Enhancing Code-switching Speech Recognition with Interactive Language Biases
Viaarxiv icon

Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex

Sep 26, 2023
Ruixing Liang, Xiangyu Zhang, Qiong Li, Lai Wei, Hexin Liu, Avisha Kumar, Kelley M. Kempski Leadingham, Joshua Punnoose, Leibny Paola Garcia, Amir Manbachi

Figure 1 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 2 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 3 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Figure 4 for Unidirectional brain-computer interface: Artificial neural network encoding natural images to fMRI response in the visual cortex
Viaarxiv icon

Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts

Jun 01, 2023
Dongji Gao, Matthew Wiesner, Hainan Xu, Leibny Paola Garcia, Daniel Povey, Sanjeev Khudanpur

Figure 1 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Figure 2 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Figure 3 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Figure 4 for Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts
Viaarxiv icon

EURO: ESPnet Unsupervised ASR Open-source Toolkit

Dec 01, 2022
Dongji Gao, Jiatong Shi, Shun-Po Chuang, Leibny Paola Garcia, Hung-yi Lee, Shinji Watanabe, Sanjeev Khudanpur

Figure 1 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Figure 2 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Figure 3 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Figure 4 for EURO: ESPnet Unsupervised ASR Open-source Toolkit
Viaarxiv icon

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

Oct 26, 2022
Hexin Liu, Haihua Xu, Leibny Paola Garcia, Andy W. H. Khong, Yi He, Sanjeev Khudanpur

Figure 1 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 2 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 3 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 4 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Viaarxiv icon

Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection

Oct 06, 2022
Shuyue Stella Li, Xiangyu Zhang, Shu Zhou, Hongchao Shu, Ruixing Liang, Hexin Liu, Leibny Paola Garcia

Figure 1 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Figure 2 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Figure 3 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Figure 4 for Q-LSTM Language Model -- Decentralized Quantum Multilingual Pre-Trained Language Model for Privacy Protection
Viaarxiv icon

Investigating self-supervised learning for lyrics recognition

Sep 28, 2022
Xiangyu Zhang, Zhanhong He, Shuyue Stella Li, Roberto Togneri, Leibny Paola Garcia

Figure 1 for Investigating self-supervised learning for lyrics recognition
Figure 2 for Investigating self-supervised learning for lyrics recognition
Figure 3 for Investigating self-supervised learning for lyrics recognition
Figure 4 for Investigating self-supervised learning for lyrics recognition
Viaarxiv icon