Picture for Yilun Chen

Yilun Chen

VistaBot: View-Robust Robot Manipulation via Spatiotemporal-Aware View Synthesis

Add code
Apr 23, 2026
Viaarxiv icon

PokeVLA: Empowering Pocket-Sized Vision-Language-Action Model with Comprehensive World Knowledge Guidance

Add code
Apr 22, 2026
Viaarxiv icon

Unveiling the Surprising Efficacy of Navigation Understanding in End-to-End Autonomous Driving

Add code
Apr 14, 2026
Viaarxiv icon

StarVLA-$α$: Reducing Complexity in Vision-Language-Action Systems

Add code
Apr 13, 2026
Viaarxiv icon

Chat-Scene++: Exploiting Context-Rich Object Identification for 3D LLM

Add code
Mar 29, 2026
Viaarxiv icon

OmniVTA: Visuo-Tactile World Modeling for Contact-Rich Robotic Manipulation

Add code
Mar 19, 2026
Viaarxiv icon

FutureVLA: Joint Visuomotor Prediction for Vision-Language-Action Model

Add code
Mar 11, 2026
Viaarxiv icon

Rhythm: Learning Interactive Whole-Body Control for Dual Humanoids

Add code
Mar 03, 2026
Viaarxiv icon

RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation

Add code
Feb 10, 2026
Viaarxiv icon

ST4VLA: Spatially Guided Training for Vision-Language-Action Models

Add code
Feb 10, 2026
Viaarxiv icon