Alert button
Picture for Hao Huang

Hao Huang

Alert button

A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR

Add code
Bookmark button
Alert button
Oct 16, 2022
Rui Li, Guodong Ma, Dexin Zhao, Ranran Zeng, Xiaoyu Li, Hao Huang

Figure 1 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Figure 2 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Figure 3 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Figure 4 for A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR
Viaarxiv icon

Analysis of the Power Imbalance in Power-Domain NOMA on Correlated Rayleigh Fading Channels

Add code
Bookmark button
Alert button
Sep 09, 2022
Shaokai Hu, Hao Huang, Guan Gui, Hikmet Sari

Figure 1 for Analysis of the Power Imbalance in Power-Domain NOMA on Correlated Rayleigh Fading Channels
Figure 2 for Analysis of the Power Imbalance in Power-Domain NOMA on Correlated Rayleigh Fading Channels
Figure 3 for Analysis of the Power Imbalance in Power-Domain NOMA on Correlated Rayleigh Fading Channels
Figure 4 for Analysis of the Power Imbalance in Power-Domain NOMA on Correlated Rayleigh Fading Channels
Viaarxiv icon

Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder

Add code
Bookmark button
Alert button
Jul 09, 2022
Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang

Figure 1 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 2 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 3 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Figure 4 for Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder
Viaarxiv icon

Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition

Add code
Bookmark button
Alert button
Jul 09, 2022
Yizhou Peng, Yufei Liu, Jicheng Zhang, Haihua Xu, Yi He, Hao Huang, Eng Siong Chng

Figure 1 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Figure 2 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Figure 3 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Figure 4 for Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition
Viaarxiv icon

A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition

Add code
Bookmark button
Alert button
Jul 03, 2022
Ying Hu, Yuwu Tang, Hao Huang, Liang He

Figure 1 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 2 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 3 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Figure 4 for A Graph Isomorphism Network with Weighted Multiple Aggregators for Speech Emotion Recognition
Viaarxiv icon

A Multi-grained based Attention Network for Semi-supervised Sound Event Detection

Add code
Bookmark button
Alert button
Jun 21, 2022
Ying Hu, Xiujuan Zhu, Yunlong Li, Hao Huang, Liang He

Figure 1 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection
Figure 2 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection
Figure 3 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection
Figure 4 for A Multi-grained based Attention Network for Semi-supervised Sound Event Detection
Viaarxiv icon

An Analysis of the Power Imbalance on the Uplink of Power-Domain NOMA

Add code
Bookmark button
Alert button
May 06, 2022
Shaokai Hu, Hao Huang, Guan Gui, Hikmet Sari

Figure 1 for An Analysis of the Power Imbalance on the Uplink of Power-Domain NOMA
Figure 2 for An Analysis of the Power Imbalance on the Uplink of Power-Domain NOMA
Figure 3 for An Analysis of the Power Imbalance on the Uplink of Power-Domain NOMA
Figure 4 for An Analysis of the Power Imbalance on the Uplink of Power-Domain NOMA
Viaarxiv icon

Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition

Add code
Bookmark button
Alert button
Apr 08, 2022
Qianying Liu, Yuhang Yang, Zhuo Gong, Sheng Li, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Sadao Kurohashi

Figure 1 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 2 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 3 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Figure 4 for Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition
Viaarxiv icon

Fine-Grained Predicates Learning for Scene Graph Generation

Add code
Bookmark button
Alert button
Apr 08, 2022
Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song

Figure 1 for Fine-Grained Predicates Learning for Scene Graph Generation
Figure 2 for Fine-Grained Predicates Learning for Scene Graph Generation
Figure 3 for Fine-Grained Predicates Learning for Scene Graph Generation
Figure 4 for Fine-Grained Predicates Learning for Scene Graph Generation
Viaarxiv icon

Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition

Add code
Bookmark button
Alert button
Apr 02, 2022
Guodong Ma, Pengfei Hu, Jian Kang, Shen Huang, Hao Huang

Figure 1 for Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition
Figure 2 for Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition
Figure 3 for Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition
Figure 4 for Leveraging Phone Mask Training for Phonetic-Reduction-Robust E2E Uyghur Speech Recognition
Viaarxiv icon