Picture for Xiaomin Lie

Xiaomin Lie

Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation

Add code
Apr 23, 2025
Viaarxiv icon