Picture for Yu-Xiong Wang

Yu-Xiong Wang

Dexplore: Scalable Neural Control for Dexterous Manipulation from Reference-Scoped Exploration

Add code
Sep 11, 2025
Viaarxiv icon

InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation

Add code
Sep 11, 2025
Viaarxiv icon

Virtual Fitting Room: Generating Arbitrarily Long Videos of Virtual Try-On from a Single Image -- Technical Preview

Add code
Sep 04, 2025
Viaarxiv icon

Dress&Dance: Dress up and Dance as You Like It - Technical Preview

Add code
Aug 28, 2025
Viaarxiv icon

Towards Formal Verification of LLM-Generated Code from Natural Language Prompts

Add code
Jul 17, 2025
Viaarxiv icon

Refer to Anything with Vision-Language Prompts

Add code
Jun 05, 2025
Viaarxiv icon

Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought

Add code
May 29, 2025
Viaarxiv icon

MR. Video: "MapReduce" is the Principle for Long Video Understanding

Add code
Apr 22, 2025
Viaarxiv icon

Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception

Add code
Apr 15, 2025
Figure 1 for Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Figure 2 for Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Figure 3 for Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Figure 4 for Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception
Viaarxiv icon

AgMMU: A Comprehensive Agricultural Multimodal Understanding and Reasoning Benchmark

Add code
Apr 14, 2025
Viaarxiv icon