Picture for Changhao Pan

Changhao Pan

TMD-Bench: A Multi-Level Evaluation Paradigm for Music-Dance Co-Generation

Add code
May 03, 2026
Viaarxiv icon

Diffusion Model as a Generalist Segmentation Learner

Add code
Apr 27, 2026
Viaarxiv icon

ImVideoEdit: Image-learning Video Editing via 2D Spatial Difference Attention Blocks

Add code
Apr 09, 2026
Viaarxiv icon

Modeling and Benchmarking Spoken Dialogue Rewards with Modality and Colloquialness

Add code
Mar 16, 2026
Viaarxiv icon

Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches

Add code
Jan 20, 2026
Viaarxiv icon

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation

Add code
Jul 09, 2025
Viaarxiv icon

Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis

Add code
Jul 08, 2025
Viaarxiv icon

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis

Add code
May 20, 2025
Figure 1 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 2 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 3 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Figure 4 for TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
Viaarxiv icon

Versatile Framework for Song Generation with Prompt-based Control

Add code
Apr 29, 2025
Viaarxiv icon

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Add code
Apr 29, 2025
Viaarxiv icon