Picture for Xiaofeng Wang

Xiaofeng Wang

Spatial-Aware Reduction Framework: Towards Efficient and Faithful Visual State Space Models

Add code
Jun 18, 2026
Viaarxiv icon

R2RDreamer: 3D-aware Data Augmentation for Spatially-generalized 2D Manipulation Policies

Add code
Jun 15, 2026
Viaarxiv icon

ScoutVLA: UAV-Centric Active Perception via a Dual-Expert VLA Model for Open-World Embodied Question Answering

Add code
Jun 09, 2026
Viaarxiv icon

iMaC: Translating Actions into Motion and Contact Images for Embodied World Models

Add code
Jun 08, 2026
Viaarxiv icon

WAM-Nav: Asymmetric Latent World-Action Modeling for Unified Visual Navigation

Add code
Jun 03, 2026
Viaarxiv icon

SKIP: Sparse Keyframe Interpolation Paradigm for Efficient Embodied World Models

Add code
May 30, 2026
Viaarxiv icon

StableIDM: Stabilizing Inverse Dynamics Model against Manipulator Truncation via Spatio-Temporal Refinement

Add code
Apr 20, 2026
Viaarxiv icon

VAG: Dual-Stream Video-Action Generation for Embodied Data Synthesis

Add code
Apr 10, 2026
Viaarxiv icon

ViVa: A Video-Generative Value Model for Robot Reinforcement Learning

Add code
Apr 09, 2026
Viaarxiv icon

ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video

Add code
Apr 09, 2026
Viaarxiv icon