Alert button
Picture for Haizhou Li

Haizhou Li

Alert button

Voice Conversion Augmentation for Speaker Recognition on Defective Datasets

Add code
Bookmark button
Alert button
Apr 01, 2024
Ruijie Tao, Zhan Shi, Yidi Jiang, Tianchi Liu, Haizhou Li

Viaarxiv icon

Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training

Add code
Bookmark button
Alert button
Apr 01, 2024
Ruijie Tao, Xinyuan Qian, Rohan Kumar Das, Xiaoxue Gao, Jiadong Wang, Haizhou Li

Viaarxiv icon

Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy

Add code
Bookmark button
Alert button
Mar 24, 2024
Wenxuan Wu, Xueyuan Chen, Xixin Wu, Haizhou Li, Helen Meng

Viaarxiv icon

CrossTune: Black-Box Few-Shot Classification with Label Enhancement

Add code
Bookmark button
Alert button
Mar 19, 2024
Danqing Luo, Chen Zhang, Yan Zhang, Haizhou Li

Figure 1 for CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Figure 2 for CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Figure 3 for CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Figure 4 for CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Viaarxiv icon

Apollo: An Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People

Add code
Bookmark button
Alert button
Mar 09, 2024
Xidong Wang, Nuo Chen, Junyin Chen, Yan Hu, Yidong Wang, Xiangbo Wu, Anningzhe Gao, Xiang Wan, Haizhou Li, Benyou Wang

Figure 1 for Apollo: An Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
Figure 2 for Apollo: An Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
Figure 3 for Apollo: An Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
Figure 4 for Apollo: An Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
Viaarxiv icon

sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks

Add code
Bookmark button
Alert button
Mar 09, 2024
Qu Yang, Qianhui Liu, Nan Li, Meng Ge, Zeyang Song, Haizhou Li

Figure 1 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Figure 2 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Figure 3 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Figure 4 for sVAD: A Robust, Low-Power, and Light-Weight Voice Activity Detection with Spiking Neural Networks
Viaarxiv icon

Fine-Grained Quantitative Emotion Editing for Speech Generation

Add code
Bookmark button
Alert button
Mar 04, 2024
Sho Inoue, Kun Zhou, Shuai Wang, Haizhou Li

Figure 1 for Fine-Grained Quantitative Emotion Editing for Speech Generation
Figure 2 for Fine-Grained Quantitative Emotion Editing for Speech Generation
Figure 3 for Fine-Grained Quantitative Emotion Editing for Speech Generation
Figure 4 for Fine-Grained Quantitative Emotion Editing for Speech Generation
Viaarxiv icon

Event-Driven Learning for Spiking Neural Networks

Add code
Bookmark button
Alert button
Mar 01, 2024
Wenjie Wei, Malu Zhang, Jilin Zhang, Ammar Belatreche, Jibin Wu, Zijing Xu, Xuerui Qiu, Hong Chen, Yang Yang, Haizhou Li

Figure 1 for Event-Driven Learning for Spiking Neural Networks
Figure 2 for Event-Driven Learning for Spiking Neural Networks
Figure 3 for Event-Driven Learning for Spiking Neural Networks
Figure 4 for Event-Driven Learning for Spiking Neural Networks
Viaarxiv icon

Text-guided HuBERT: Self-Supervised Speech Pre-training via Generative Adversarial Networks

Add code
Bookmark button
Alert button
Feb 28, 2024
Duo Ma, Xianghu Yue, Junyi Ao, Xiaoxue Gao, Haizhou Li

Viaarxiv icon