Picture for Yanjia Huang

Yanjia Huang

VISTA: Generative Visual Imagination for Vision-and-Language Navigation

Add code
May 17, 2025
Viaarxiv icon

Can Large Vision Language Models Read Maps Like a Human?

Add code
Mar 18, 2025
Viaarxiv icon

PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing

Add code
Mar 17, 2025
Viaarxiv icon

Zero-shot Object Navigation with Vision-Language Models Reasoning

Add code
Oct 24, 2024
Figure 1 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Figure 2 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Figure 3 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Figure 4 for Zero-shot Object Navigation with Vision-Language Models Reasoning
Viaarxiv icon