Picture for Haizhou Li

Haizhou Li

SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Add code
Jun 09, 2025
Viaarxiv icon

Exploring Length Generalization For Transformer-based Speech Enhancement

Add code
Jun 07, 2025
Viaarxiv icon

From Word to World: Evaluate and Mitigate Culture Bias via Word Association Test

Add code
May 24, 2025
Viaarxiv icon

Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset

Add code
May 24, 2025
Viaarxiv icon

PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs

Add code
May 20, 2025
Viaarxiv icon

Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis

Add code
May 19, 2025
Viaarxiv icon

What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips

Add code
May 09, 2025
Viaarxiv icon

Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing

Add code
Apr 08, 2025
Viaarxiv icon

Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation

Add code
Apr 03, 2025
Viaarxiv icon

$C^2$AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction

Add code
Apr 01, 2025
Viaarxiv icon