Picture for Hui Bu

Hui Bu

Full-Duplex Interaction in Spoken Dialogue Systems: A Comprehensive Study from the ICASSP 2026 HumDial Challenge

Add code
Apr 23, 2026
Viaarxiv icon

HumDial-EIBench: A Human-Recorded Multi-Turn Emotional Intelligence Benchmark for Audio Language Models

Add code
Apr 13, 2026
Viaarxiv icon

WenetSpeech-Wu: Datasets, Benchmarks, and Models for a Unified Chinese Wu Dialect Speech Processing Ecosystem

Add code
Jan 16, 2026
Viaarxiv icon

The CCF AATC 2025: Speech Restoration Challenge

Add code
Sep 16, 2025
Figure 1 for The CCF AATC 2025: Speech Restoration Challenge
Viaarxiv icon

AISHELL-5: The First Open-Source In-Car Multi-Channel Multi-Speaker Speech Dataset for Automatic Speech Diarization and Recognition

Add code
May 29, 2025
Viaarxiv icon

MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation

Add code
Jan 18, 2025
Figure 1 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Figure 2 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Figure 3 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Figure 4 for MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation
Viaarxiv icon

Exploring Differences between Human Perception and Model Inference in Audio Event Recognition

Add code
Sep 10, 2024
Figure 1 for Exploring Differences between Human Perception and Model Inference in Audio Event Recognition
Figure 2 for Exploring Differences between Human Perception and Model Inference in Audio Event Recognition
Figure 3 for Exploring Differences between Human Perception and Model Inference in Audio Event Recognition
Figure 4 for Exploring Differences between Human Perception and Model Inference in Audio Event Recognition
Viaarxiv icon

Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge

Add code
Sep 09, 2024
Figure 1 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 2 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 3 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Figure 4 for Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
Viaarxiv icon

RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization

Add code
Jun 28, 2024
Figure 1 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 2 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 3 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Figure 4 for RealMAN: A Real-Recorded and Annotated Microphone Array Dataset for Dynamic Speech Enhancement and Localization
Viaarxiv icon

Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design

Add code
Jun 14, 2024
Figure 1 for Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design
Figure 2 for Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design
Figure 3 for Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design
Figure 4 for Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design
Viaarxiv icon