Picture for Wenwu Wang

Wenwu Wang

AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion

Add code
May 28, 2025
Viaarxiv icon

EnvSDD: Benchmarking Environmental Sound Deepfake Detection

Add code
May 25, 2025
Viaarxiv icon

From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems

Add code
Apr 30, 2025
Viaarxiv icon

Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows

Add code
Apr 22, 2025
Viaarxiv icon

Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture

Add code
Apr 21, 2025
Viaarxiv icon

DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention

Add code
Apr 15, 2025
Viaarxiv icon

DGFM: Full Body Dance Generation Driven by Music Foundation Models

Add code
Feb 27, 2025
Viaarxiv icon

GCDance: Genre-Controlled 3D Full Body Dance Generation Driven By Music

Add code
Feb 25, 2025
Viaarxiv icon

The ICME 2025 Audio Encoder Capability Challenge

Add code
Jan 25, 2025
Viaarxiv icon

Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring

Add code
Dec 26, 2024
Figure 1 for Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring
Figure 2 for Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring
Figure 3 for Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring
Figure 4 for Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring
Viaarxiv icon