Picture for Wenxiang Guo

Wenxiang Guo

STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation

Add code
Jul 09, 2025
Viaarxiv icon

Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis

Add code
Jul 08, 2025
Viaarxiv icon

TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis

Add code
May 20, 2025
Viaarxiv icon

ISDrama: Immersive Spatial Drama Generation through Multimodal Prompting

Add code
Apr 29, 2025
Viaarxiv icon

Versatile Framework for Song Generation with Prompt-based Control

Add code
Apr 29, 2025
Viaarxiv icon

GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks

Add code
Sep 26, 2024
Figure 1 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Figure 2 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Figure 3 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Figure 4 for GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
Viaarxiv icon

Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching

Add code
Jun 01, 2024
Figure 1 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 2 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 3 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Figure 4 for Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching
Viaarxiv icon