Picture for Junxian Wu

Junxian Wu

MOON2.0: Dynamic Modality-balanced Multimodal Representation Learning for E-commerce Product Understanding

Add code
Nov 16, 2025
Viaarxiv icon

Learning to Hear by Seeing: It's Time for Vision Language Models to Understand Artistic Emotion from Sight and Sound

Add code
Nov 15, 2025
Viaarxiv icon

Controllable Video-to-Music Generation with Multiple Time-Varying Conditions

Add code
Jul 28, 2025
Viaarxiv icon

GVMGen: A General Video-to-Music Generation Model with Hierarchical Attentions

Add code
Jan 17, 2025
Viaarxiv icon

SpineCLUE: Automatic Vertebrae Identification Using Contrastive Learning and Uncertainty Estimation

Add code
Jan 14, 2024
Viaarxiv icon