Picture for Xiu Li

Xiu Li

PhysEditWorld: A Large-Scale Dataset Toward Physics-Editable World Models

Add code
Jun 25, 2026
Viaarxiv icon

Decoupling Semantics and Geometric Grounding: Spatial Visual Prompts for Language-Conditioned Imitation Learning

Add code
Jun 24, 2026
Viaarxiv icon

Learning Visual Spatial Planning from Symbolic State via Modality-Gap-Aware Self-Distillation

Add code
Jun 04, 2026
Viaarxiv icon

ELAN4D: Embodiment-Centric 4D Supervision for Vision-Language-Action Models via Plug-and-Play Adaptation

Add code
May 28, 2026
Viaarxiv icon

KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration

Add code
May 14, 2026
Viaarxiv icon

RoTE: Coarse-to-Fine Multi-Level Rotary Time Embedding for Sequential Recommendation

Add code
Apr 15, 2026
Viaarxiv icon

PRISM: Rethinking Scattered Atmosphere Reconstruction as a Unified Understanding and Generation Model for Real-world Dehazing

Add code
Apr 08, 2026
Viaarxiv icon

SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing

Add code
Apr 06, 2026
Viaarxiv icon

ViGoR-Bench: How Far Are Visual Generative Models From Zero-Shot Visual Reasoners?

Add code
Mar 26, 2026
Viaarxiv icon

Photon: Speedup Volume Understanding with Efficient Multimodal Large Language Models

Add code
Mar 26, 2026
Viaarxiv icon