Picture for Wenwu Wang

Wenwu Wang

Teacher-Guided Pseudo Supervision and Cross-Modal Alignment for Audio-Visual Video Parsing

Add code
Sep 17, 2025
Viaarxiv icon

RFM-Editing: Rectified Flow Matching for Text-guided Audio Editing

Add code
Sep 17, 2025
Viaarxiv icon

Region-Specific Audio Tagging for Spatial Sound

Add code
Sep 11, 2025
Viaarxiv icon

TEn-CATS: Text-Enriched Audio-Visual Video Parsing with Multi-Scale Category-Aware Temporal Graph

Add code
Sep 04, 2025
Viaarxiv icon

AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion

Add code
May 28, 2025
Viaarxiv icon

EnvSDD: Benchmarking Environmental Sound Deepfake Detection

Add code
May 25, 2025
Viaarxiv icon

From Aesthetics to Human Preferences: Comparative Perspectives of Evaluating Text-to-Music Systems

Add code
Apr 30, 2025
Viaarxiv icon

Exploring the User Experience of AI-Assisted Sound Searching Systems for Creative Workflows

Add code
Apr 22, 2025
Viaarxiv icon

Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture

Add code
Apr 21, 2025
Figure 1 for Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture
Figure 2 for Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture
Figure 3 for Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture
Figure 4 for Audio-Visual Class-Incremental Learning for Fish Feeding intensity Assessment in Aquaculture
Viaarxiv icon

DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention

Add code
Apr 15, 2025
Viaarxiv icon