Picture for Gengze Zhou

Gengze Zhou

VLNVerse: A Benchmark for Vision-Language Navigation with Versatile, Embodied, Realistic Simulation and Evaluation

Add code
Dec 22, 2025
Viaarxiv icon

MMGR: Multi-Modal Generative Reasoning

Add code
Dec 17, 2025
Viaarxiv icon

Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments

Add code
Feb 26, 2025
Figure 1 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Figure 2 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Figure 3 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Figure 4 for Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments
Viaarxiv icon

SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts

Add code
Dec 07, 2024
Viaarxiv icon

NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Add code
Jul 17, 2024
Figure 1 for NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Figure 2 for NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Figure 3 for NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Figure 4 for NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
Viaarxiv icon

NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation

Add code
Mar 01, 2024
Figure 1 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Figure 2 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Figure 3 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Figure 4 for NaVid: Video-based VLM Plans the Next Step for Vision-and-Language Navigation
Viaarxiv icon

WebVLN: Vision-and-Language Navigation on Websites

Add code
Dec 25, 2023
Viaarxiv icon

NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Add code
May 29, 2023
Figure 1 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Figure 2 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Figure 3 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Figure 4 for NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
Viaarxiv icon