Picture for Junkun Hong

Junkun Hong

SVLL: Staged Vision-Language Learning for Physically Grounded Embodied Task Planning

Add code
Mar 12, 2026
Viaarxiv icon