Rxr


Observation-Graph Interaction and Key-Detail Guidance for Vision and Language Navigation

Add code
Mar 14, 2025
Viaarxiv icon

NAVCON: A Cognitively Inspired and Linguistically Grounded Corpus for Vision and Language Navigation

Add code
Dec 17, 2024
Viaarxiv icon

Constraint-Aware Zero-Shot Vision-Language Navigation in Continuous Environments

Add code
Dec 13, 2024
Viaarxiv icon

InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models

Add code
Nov 18, 2024
Figure 1 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Figure 2 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Figure 3 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Figure 4 for InstruGen: Automatic Instruction Generation for Vision-and-Language Navigation Via Large Multimodal Models
Viaarxiv icon

Vision-Language Navigation with Energy-Based Policy

Add code
Oct 18, 2024
Figure 1 for Vision-Language Navigation with Energy-Based Policy
Figure 2 for Vision-Language Navigation with Energy-Based Policy
Figure 3 for Vision-Language Navigation with Energy-Based Policy
Figure 4 for Vision-Language Navigation with Energy-Based Policy
Viaarxiv icon

PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation

Add code
Jul 16, 2024
Figure 1 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Figure 2 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Figure 3 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Figure 4 for PRET: Planning with Directed Fidelity Trajectory for Vision and Language Navigation
Viaarxiv icon

Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation

Add code
Jun 14, 2024
Figure 1 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Figure 2 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Figure 3 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Figure 4 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Viaarxiv icon

Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts

Add code
Jun 04, 2024
Figure 1 for Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Figure 2 for Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Figure 3 for Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Figure 4 for Why Only Text: Empowering Vision-and-Language Navigation with Multi-modal Prompts
Viaarxiv icon

Correctable Landmark Discovery via Large Models for Vision-Language Navigation

Add code
May 29, 2024
Figure 1 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Figure 2 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Figure 3 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Figure 4 for Correctable Landmark Discovery via Large Models for Vision-Language Navigation
Viaarxiv icon

Vision-and-Language Navigation via Causal Learning

Add code
Apr 16, 2024
Viaarxiv icon