Picture for Xuesong Li

Xuesong Li

EvoScene-VLA: Evolving Scene Beliefs Inside the Action Decoder for Chunked Robot Control

Add code
May 21, 2026
Viaarxiv icon

Structural Energy Guidance for View-Consistent Text-to-3D Generation

Add code
May 19, 2026
Viaarxiv icon

Break the Brake, Not the Wheel: Untargeted Jailbreak via Entropy Maximization

Add code
May 11, 2026
Viaarxiv icon

From Diffusion to Rectified Flow: Rethinking Text-Based Segmentation

Add code
May 06, 2026
Viaarxiv icon

EgoSelf: From Memory to Personalized Egocentric Assistant

Add code
Apr 22, 2026
Viaarxiv icon

ReMAP-DP: Reprojected Multi-view Aligned PointMaps for Diffusion Policy

Add code
Mar 16, 2026
Viaarxiv icon

MVHOI: Bridge Multi-view Condition to Complex Human-Object Interaction Video Reenactment via 3D Foundation Model

Add code
Mar 16, 2026
Viaarxiv icon

RnG: A Unified Transformer for Complete 3D Modeling from Partial Observations

Add code
Mar 01, 2026
Viaarxiv icon

Probing and Bridging Geometry-Interaction Cues for Affordance Reasoning in Vision Foundation Models

Add code
Feb 24, 2026
Viaarxiv icon

A Kung Fu Athlete Bot That Can Do It All Day: Highly Dynamic, Balance-Challenging Motion Dataset and Autonomous Fall-Resilient Tracking

Add code
Feb 14, 2026
Viaarxiv icon