Picture for Yilun Chen

Yilun Chen

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Add code
Jul 23, 2025
Viaarxiv icon

Rethinking the Embodied Gap in Vision-and-Language Navigation: A Holistic Study of Physical and Visual Disparities

Add code
Jul 17, 2025
Viaarxiv icon

CronusVLA: Transferring Latent Motion Across Time for Multi-Frame Prediction in Manipulation

Add code
Jun 24, 2025
Viaarxiv icon

GENMANIP: LLM-driven Simulation for Generalizable Instruction-Following Manipulation

Add code
Jun 12, 2025
Viaarxiv icon

LiloDriver: A Lifelong Learning Framework for Closed-loop Motion Planning in Long-tail Autonomous Driving Scenarios

Add code
May 22, 2025
Viaarxiv icon

MoMoE: Mixture of Moderation Experts Framework for AI-Assisted Online Governance

Add code
May 20, 2025
Viaarxiv icon

NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance

Add code
May 13, 2025
Viaarxiv icon

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors

Add code
Apr 30, 2025
Viaarxiv icon

A Data-Centric Revisit of Pre-Trained Vision Models for Robot Learning

Add code
Mar 10, 2025
Viaarxiv icon

A characterization of sample adaptivity in UCB data

Add code
Mar 06, 2025
Viaarxiv icon