Alert button
Picture for Haihua Xu

Haihua Xu

Alert button

Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation

Add code
Bookmark button
Alert button
Nov 02, 2022
Rao Ma, Xiaobo Wu, Jin Qiu, Yanan Qin, Haihua Xu, Peihao Wu, Zejun Ma

Figure 1 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Figure 2 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Figure 3 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Figure 4 for Internal Language Model Estimation based Adaptive Language Model Fusion for Domain Adaptation
Viaarxiv icon

Speech-text based multi-modal training with bidirectional attention for improved speech recognition

Add code
Bookmark button
Alert button
Nov 01, 2022
Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li

Figure 1 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Figure 2 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Figure 3 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Figure 4 for Speech-text based multi-modal training with bidirectional attention for improved speech recognition
Viaarxiv icon

Improving short-video speech recognition using random utterance concatenation

Add code
Bookmark button
Alert button
Oct 28, 2022
Haihua Xu, Van Tung Pham, Yerbolat Khassanov, Yist Lin, Tao Han, Tze Yuan Chong, Yi He, Zejun Ma

Figure 1 for Improving short-video speech recognition using random utterance concatenation
Figure 2 for Improving short-video speech recognition using random utterance concatenation
Figure 3 for Improving short-video speech recognition using random utterance concatenation
Figure 4 for Improving short-video speech recognition using random utterance concatenation
Viaarxiv icon

Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization

Add code
Bookmark button
Alert button
Oct 26, 2022
Hexin Liu, Haihua Xu, Leibny Paola Garcia, Andy W. H. Khong, Yi He, Sanjeev Khudanpur

Figure 1 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 2 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 3 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Figure 4 for Reducing Language confusion for Code-switching Speech Recognition with Token-level Language Diarization
Viaarxiv icon

Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder

Add code
Bookmark button
Alert button
Jul 09, 2022
Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang

Figure 1 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 2 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 3 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 4 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Viaarxiv icon

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition

Add code
Bookmark button
Alert button
Jul 09, 2022
Yizhou Peng, Yufei Liu, Jicheng Zhang, Haihua Xu, Yi He, Hao Huang, Eng Siong Chng

Figure 1 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Figure 2 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Figure 3 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Figure 4 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Viaarxiv icon

Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR

Add code
Bookmark button
Alert button
Jan 26, 2022
Yufei Liu, Rao Ma, Haihua Xu, Yi He, Zejun Ma, Weibin Zhang

Figure 1 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 2 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 3 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Figure 4 for Internal language model estimation through explicit context vector learning for attention-based encoder-decoder ASR
Viaarxiv icon

Minimum word error training for non-autoregressive Transformer-based code-switching ASR

Add code
Bookmark button
Alert button
Oct 07, 2021
Yizhou Peng, Jicheng Zhang, Haihua Xu, Hao Huang, Eng Siong Chng

Figure 1 for Minimum word error training for non-autoregressive Transformer-based code-switching ASR
Figure 2 for Minimum word error training for non-autoregressive Transformer-based code-switching ASR
Figure 3 for Minimum word error training for non-autoregressive Transformer-based code-switching ASR
Figure 4 for Minimum word error training for non-autoregressive Transformer-based code-switching ASR
Viaarxiv icon

Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech

Add code
Bookmark button
Alert button
Jul 22, 2021
Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng

Figure 1 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Figure 2 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Figure 3 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Figure 4 for Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech
Viaarxiv icon

E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition

Add code
Bookmark button
Alert button
Jun 15, 2021
Jicheng Zhang, Yizhou Peng, Pham Van Tung, Haihua Xu, Hao Huang, Eng Siong Chng

Figure 1 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Figure 2 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Figure 3 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Figure 4 for E2E-based Multi-task Learning Approach to Joint Speech and Accent Recognition
Viaarxiv icon