Picture for Hongrong Wang

Hongrong Wang

SVLL: Staged Vision-Language Learning for Physically Grounded Embodied Task Planning

Add code
Mar 12, 2026
Viaarxiv icon