Picture for Yu-Chiang Frank Wang

Yu-Chiang Frank Wang

UWAV: Uncertainty-weighted Weakly-supervised Audio-Visual Video Parsing

Add code
May 14, 2025
Viaarxiv icon

VideoMage: Multi-Subject and Motion Customization of Text-to-Video Diffusion Models

Add code
Mar 27, 2025
Viaarxiv icon

Segment Anything, Even Occluded

Add code
Mar 08, 2025
Viaarxiv icon

Plan2Align: Predictive Planning Based Test-Time Preference Alignment in Paragraph-Level Machine Translation

Add code
Feb 28, 2025
Viaarxiv icon

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

Add code
Feb 23, 2025
Viaarxiv icon

MotionMatcher: Motion Customization of Text-to-Video Diffusion Models via Motion Feature Matching

Add code
Feb 18, 2025
Viaarxiv icon

3D Gaussian Inpainting with Depth-Guided Cross-View Consistency

Add code
Feb 17, 2025
Viaarxiv icon

Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation

Add code
Feb 04, 2025
Viaarxiv icon

Towards Affordance-Aware Articulation Synthesis for Rigged Objects

Add code
Jan 21, 2025
Viaarxiv icon

Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks

Add code
Jan 14, 2025
Viaarxiv icon