Alert button
Picture for Haizhou Li

Haizhou Li

Alert button

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

Add code
Bookmark button
Alert button
Mar 29, 2022
Rui Wang, Qibing Bai, Junyi Ao, Long Zhou, Zhixiang Xiong, Zhihua Wei, Yu Zhang, Tom Ko, Haizhou Li

Figure 1 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 2 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 3 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Figure 4 for LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
Viaarxiv icon

Just Rank: Rethinking Evaluation with Word and Sentence Similarities

Add code
Bookmark button
Alert button
Mar 21, 2022
Bin Wang, C. -C. Jay Kuo, Haizhou Li

Figure 1 for Just Rank: Rethinking Evaluation with Word and Sentence Similarities
Figure 2 for Just Rank: Rethinking Evaluation with Word and Sentence Similarities
Figure 3 for Just Rank: Rethinking Evaluation with Word and Sentence Similarities
Figure 4 for Just Rank: Rethinking Evaluation with Word and Sentence Similarities
Viaarxiv icon

ADD 2022: the First Audio Deep Synthesis Detection Challenge

Add code
Bookmark button
Alert button
Feb 26, 2022
Jiangyan Yi, Ruibo Fu, Jianhua Tao, Shuai Nie, Haoxin Ma, Chenglong Wang, Tao Wang, Zhengkun Tian, Ye Bai, Cunhang Fan, Shan Liang, Shiming Wang, Shuai Zhang, Xinrui Yan, Le Xu, Zhengqi Wen, Haizhou Li, Zheng Lian, Bin Liu

Figure 1 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Figure 2 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Figure 3 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Figure 4 for ADD 2022: the First Audio Deep Synthesis Detection Challenge
Viaarxiv icon

L-SpEx: Localized Target Speaker Extraction

Add code
Bookmark button
Alert button
Feb 21, 2022
Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li

Figure 1 for L-SpEx: Localized Target Speaker Extraction
Figure 2 for L-SpEx: Localized Target Speaker Extraction
Figure 3 for L-SpEx: Localized Target Speaker Extraction
Figure 4 for L-SpEx: Localized Target Speaker Extraction
Viaarxiv icon

MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances

Add code
Bookmark button
Alert button
Feb 15, 2022
Tianchi Liu, Rohan Kumar Das, Kong Aik Lee, Haizhou Li

Figure 1 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 2 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Figure 3 for MFA: TDNN with Multi-scale Frequency-channel Attention for Text-independent Speaker Verification with Short Utterances
Viaarxiv icon

Noise-robust voice conversion with domain adversarial training

Add code
Bookmark button
Alert button
Jan 26, 2022
Hongqiang Du, Lei Xie, Haizhou Li

Figure 1 for Noise-robust voice conversion with domain adversarial training
Figure 2 for Noise-robust voice conversion with domain adversarial training
Figure 3 for Noise-robust voice conversion with domain adversarial training
Figure 4 for Noise-robust voice conversion with domain adversarial training
Viaarxiv icon

Emotion Intensity and its Control for Emotional Voice Conversion

Add code
Bookmark button
Alert button
Jan 10, 2022
Kun Zhou, Berrak Sisman, Rajib Rana, Björn W. Schuller, Haizhou Li

Figure 1 for Emotion Intensity and its Control for Emotional Voice Conversion
Figure 2 for Emotion Intensity and its Control for Emotional Voice Conversion
Figure 3 for Emotion Intensity and its Control for Emotional Voice Conversion
Figure 4 for Emotion Intensity and its Control for Emotional Voice Conversion
Viaarxiv icon