Alert button
Picture for Haizhou Li

Haizhou Li

Alert button

xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark

Add code
Bookmark button
Alert button
Oct 13, 2023
Chen Zhang, Luis Fernando D'Haro, Chengguang Tang, Ke Shi, Guohua Tang, Haizhou Li

Viaarxiv icon

Disentangling Voice and Content with Self-Supervision for Speaker Recognition

Add code
Bookmark button
Alert button
Oct 02, 2023
Tianchi Liu, Kong Aik Lee, Qiongqiong Wang, Haizhou Li

Figure 1 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 2 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 3 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Figure 4 for Disentangling Voice and Content with Self-Supervision for Speaker Recognition
Viaarxiv icon

Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition

Add code
Bookmark button
Alert button
Sep 27, 2023
Shuai Wang, Qibing Bai, Qi Liu, Jianwei Yu, Zhengyang Chen, Bing Han, Yanmin Qian, Haizhou Li

Figure 1 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Figure 2 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Figure 3 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Figure 4 for Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition
Viaarxiv icon

AceGPT, Localizing Large Language Models in Arabic

Add code
Bookmark button
Alert button
Sep 22, 2023
Huang Huang, Fei Yu, Jianqing Zhu, Xuening Sun, Hao Cheng, Dingjie Song, Zhihong Chen, Abdulmohsen Alharthi, Bang An, Ziche Liu, Zhiyi Zhang, Junying Chen, Jianquan Li, Benyou Wang, Lian Zhang, Ruoyu Sun, Xiang Wan, Haizhou Li, Jinchao Xu

Figure 1 for AceGPT, Localizing Large Language Models in Arabic
Figure 2 for AceGPT, Localizing Large Language Models in Arabic
Figure 3 for AceGPT, Localizing Large Language Models in Arabic
Figure 4 for AceGPT, Localizing Large Language Models in Arabic
Viaarxiv icon

FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency

Add code
Bookmark button
Alert button
Sep 22, 2023
Rui Liu, Jiatian Xi, Ziyue Jiang, Haizhou Li

Figure 1 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 2 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 3 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 4 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Viaarxiv icon

Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech

Add code
Bookmark button
Alert button
Sep 21, 2023
Rui Liu, Bin Liu, Haizhou Li

Figure 1 for Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
Figure 2 for Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
Figure 3 for Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech
Viaarxiv icon

USED: Universal Speaker Extraction and Diarization

Add code
Bookmark button
Alert button
Sep 19, 2023
Junyi Ao, Mehmet Sinan Yıldırım, Meng Ge, Shuai Wang, Ruijie Tao, Yanmin Qian, Liqun Deng, Longshuai Xiao, Haizhou Li

Figure 1 for USED: Universal Speaker Extraction and Diarization
Figure 2 for USED: Universal Speaker Extraction and Diarization
Figure 3 for USED: Universal Speaker Extraction and Diarization
Figure 4 for USED: Universal Speaker Extraction and Diarization
Viaarxiv icon

Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks

Add code
Bookmark button
Alert button
Sep 18, 2023
Zeyang Song, Jibin Wu, Malu Zhang, Mike Zheng Shou, Haizhou Li

Viaarxiv icon

Audio-Visual Active Speaker Extraction for Sparsely Overlapped Multi-talker Speech

Add code
Bookmark button
Alert button
Sep 15, 2023
Junjie Li, Ruijie Tao, Zexu Pan, Meng Ge, Shuai Wang, Haizhou Li

Viaarxiv icon