Picture for Xiangyang Xue

Xiangyang Xue

Fudan University

DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving

Add code
Mar 09, 2026
Viaarxiv icon

Vision-Language Feature Alignment for Road Anomaly Segmentation

Add code
Mar 01, 2026
Viaarxiv icon

Universal Pose Pretraining for Generalizable Vision-Language-Action Policies

Add code
Feb 23, 2026
Viaarxiv icon

EgoSound: Benchmarking Sound Understanding in Egocentric Videos

Add code
Feb 15, 2026
Viaarxiv icon

Aligned Stable Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency

Add code
Jan 21, 2026
Viaarxiv icon

ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation

Add code
Jan 13, 2026
Viaarxiv icon

CME-CAD: Heterogeneous Collaborative Multi-Expert Reinforcement Learning for CAD Code Generation

Add code
Dec 29, 2025
Viaarxiv icon

VidSplice: Towards Coherent Video Inpainting via Explicit Spaced Frame Guidance

Add code
Oct 24, 2025
Viaarxiv icon

Learning Global Representation from Queries for Vectorized HD Map Construction

Add code
Oct 08, 2025
Viaarxiv icon

Training-Free Pyramid Token Pruning for Efficient Large Vision-Language Models via Region, Token, and Instruction-Guided Importance

Add code
Sep 19, 2025
Viaarxiv icon