Picture for Xiangyu Yue

Xiangyu Yue

Learning to Integrate Diffusion ODEs by Averaging the Derivatives

Add code
May 20, 2025
Viaarxiv icon

CMT: A Cascade MAR with Topology Predictor for Multimodal Conditional CAD Generation

Add code
Apr 29, 2025
Viaarxiv icon

Multimodal Long Video Modeling Based on Temporal Dynamic Context

Add code
Apr 14, 2025
Viaarxiv icon

Video-R1: Reinforcing Video Reasoning in MLLMs

Add code
Mar 27, 2025
Viaarxiv icon

UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines

Add code
Mar 26, 2025
Viaarxiv icon

Unleashing Vecset Diffusion Model for Fast Shape Generation

Add code
Mar 20, 2025
Viaarxiv icon

SemGeoMo: Dynamic Contextual Human Motion Generation with Semantic and Geometric Guidance

Add code
Mar 03, 2025
Viaarxiv icon

Unposed Sparse Views Room Layout Reconstruction in the Age of Pretrain Model

Add code
Feb 24, 2025
Viaarxiv icon

Reflective Planning: Vision-Language Models for Multi-Stage Long-Horizon Robotic Manipulation

Add code
Feb 23, 2025
Viaarxiv icon

HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States

Add code
Feb 21, 2025
Viaarxiv icon