Picture for Kefan Gu

Kefan Gu

IntentionVLA: Generalizable and Efficient Embodied Intention Reasoning for Human-Robot Interaction

Add code
Oct 09, 2025
Viaarxiv icon

LLaDA-VLA: Vision Language Diffusion Action Models

Add code
Sep 10, 2025
Viaarxiv icon

ROSA: Harnessing Robot States for Vision-Language and Action Alignment

Add code
Jun 16, 2025
Viaarxiv icon