Picture for Haizhou Li

Haizhou Li

LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism

Add code
Oct 17, 2023
Figure 1 for LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism
Figure 2 for LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism
Figure 3 for LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism
Figure 4 for LocSelect: Target Speaker Localization with an Auditory Selective Hearing Mechanism
Viaarxiv icon

UNO-DST: Leveraging Unlabelled Data in Zero-Shot Dialogue State Tracking

Add code
Oct 16, 2023
Viaarxiv icon

xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark

Add code
Oct 13, 2023
Figure 1 for xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Figure 2 for xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Figure 3 for xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Figure 4 for xDial-Eval: A Multilingual Open-Domain Dialogue Evaluation Benchmark
Viaarxiv icon

Disentangling Voice and Content with Self-Supervision for Speaker Recognition

Add code
Oct 02, 2023
Viaarxiv icon

Leveraging In-the-Wild Data for Effective Self-Supervised Pretraining in Speaker Recognition

Add code
Sep 27, 2023
Viaarxiv icon

AceGPT, Localizing Large Language Models in Arabic

Add code
Sep 22, 2023
Figure 1 for AceGPT, Localizing Large Language Models in Arabic
Figure 2 for AceGPT, Localizing Large Language Models in Arabic
Figure 3 for AceGPT, Localizing Large Language Models in Arabic
Figure 4 for AceGPT, Localizing Large Language Models in Arabic
Viaarxiv icon

FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency

Add code
Sep 22, 2023
Figure 1 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 2 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 3 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Figure 4 for FluentEditor: Text-based Speech Editing by Considering Acoustic and Prosody Consistency
Viaarxiv icon

Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech

Add code
Sep 21, 2023
Viaarxiv icon

USED: Universal Speaker Extraction and Diarization

Add code
Sep 19, 2023
Figure 1 for USED: Universal Speaker Extraction and Diarization
Figure 2 for USED: Universal Speaker Extraction and Diarization
Figure 3 for USED: Universal Speaker Extraction and Diarization
Figure 4 for USED: Universal Speaker Extraction and Diarization
Viaarxiv icon

Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks

Add code
Sep 18, 2023
Figure 1 for Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
Figure 2 for Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
Figure 3 for Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
Figure 4 for Spiking-LEAF: A Learnable Auditory front-end for Spiking Neural Networks
Viaarxiv icon