Picture for Xiangyang Xue

Xiangyang Xue

Fudan University

DINO-VO: Learning Where to Focus for Enhanced State Estimation

Add code
Apr 05, 2026
Viaarxiv icon

ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models

Add code
Mar 22, 2026
Viaarxiv icon

OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer

Add code
Mar 15, 2026
Viaarxiv icon

DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving

Add code
Mar 09, 2026
Viaarxiv icon

Vision-Language Feature Alignment for Road Anomaly Segmentation

Add code
Mar 01, 2026
Viaarxiv icon

Universal Pose Pretraining for Generalizable Vision-Language-Action Policies

Add code
Feb 23, 2026
Viaarxiv icon

EgoSound: Benchmarking Sound Understanding in Egocentric Videos

Add code
Feb 15, 2026
Viaarxiv icon

Aligned Stable Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency

Add code
Jan 21, 2026
Viaarxiv icon

ActiveVLA: Injecting Active Perception into Vision-Language-Action Models for Precise 3D Robotic Manipulation

Add code
Jan 13, 2026
Viaarxiv icon

CME-CAD: Heterogeneous Collaborative Multi-Expert Reinforcement Learning for CAD Code Generation

Add code
Dec 29, 2025
Viaarxiv icon