Picture for Haizhou Li

Haizhou Li

SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

Add code
Jun 09, 2025
Viaarxiv icon

Exploring Length Generalization For Transformer-based Speech Enhancement

Add code
Jun 07, 2025
Figure 1 for Exploring Length Generalization For Transformer-based Speech Enhancement
Figure 2 for Exploring Length Generalization For Transformer-based Speech Enhancement
Figure 3 for Exploring Length Generalization For Transformer-based Speech Enhancement
Figure 4 for Exploring Length Generalization For Transformer-based Speech Enhancement
Viaarxiv icon

Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset

Add code
May 24, 2025
Figure 1 for Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset
Figure 2 for Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset
Figure 3 for Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset
Figure 4 for Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset
Viaarxiv icon

From Word to World: Evaluate and Mitigate Culture Bias via Word Association Test

Add code
May 24, 2025
Viaarxiv icon

PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs

Add code
May 20, 2025
Viaarxiv icon

Chain-Talker: Chain Understanding and Rendering for Empathetic Conversational Speech Synthesis

Add code
May 19, 2025
Viaarxiv icon

What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips

Add code
May 09, 2025
Figure 1 for What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips
Figure 2 for What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips
Figure 3 for What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips
Figure 4 for What Is Next for LLMs? Next-Generation AI Computing Hardware Using Photonic Chips
Viaarxiv icon

Nes2Net: A Lightweight Nested Architecture for Foundation Model Driven Speech Anti-spoofing

Add code
Apr 08, 2025
Viaarxiv icon

Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation

Add code
Apr 03, 2025
Figure 1 for Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
Figure 2 for Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
Figure 3 for Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
Figure 4 for Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation
Viaarxiv icon

$C^2$AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction

Add code
Apr 01, 2025
Viaarxiv icon