Picture for Peter Anderson

Peter Anderson

On the Evaluation of Vision-and-Language Navigation Instructions

Add code
Jan 26, 2021
Figure 1 for On the Evaluation of Vision-and-Language Navigation Instructions
Figure 2 for On the Evaluation of Vision-and-Language Navigation Instructions
Figure 3 for On the Evaluation of Vision-and-Language Navigation Instructions
Figure 4 for On the Evaluation of Vision-and-Language Navigation Instructions
Viaarxiv icon

Where Are You? Localization from Embodied Dialog

Add code
Nov 16, 2020
Figure 1 for Where Are You? Localization from Embodied Dialog
Figure 2 for Where Are You? Localization from Embodied Dialog
Figure 3 for Where Are You? Localization from Embodied Dialog
Figure 4 for Where Are You? Localization from Embodied Dialog
Viaarxiv icon

Sim-to-Real Transfer for Vision-and-Language Navigation

Add code
Nov 07, 2020
Figure 1 for Sim-to-Real Transfer for Vision-and-Language Navigation
Figure 2 for Sim-to-Real Transfer for Vision-and-Language Navigation
Figure 3 for Sim-to-Real Transfer for Vision-and-Language Navigation
Figure 4 for Sim-to-Real Transfer for Vision-and-Language Navigation
Viaarxiv icon

Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding

Add code
Oct 15, 2020
Figure 1 for Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
Figure 2 for Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
Figure 3 for Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
Figure 4 for Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
Viaarxiv icon

Spatially Aware Multimodal Transformers for TextVQA

Add code
Jul 23, 2020
Figure 1 for Spatially Aware Multimodal Transformers for TextVQA
Figure 2 for Spatially Aware Multimodal Transformers for TextVQA
Figure 3 for Spatially Aware Multimodal Transformers for TextVQA
Figure 4 for Spatially Aware Multimodal Transformers for TextVQA
Viaarxiv icon

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web

Add code
May 01, 2020
Figure 1 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Figure 2 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Figure 3 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Figure 4 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Viaarxiv icon

Chasing Ghosts: Instruction Following as Bayesian State Tracking

Add code
Jul 03, 2019
Figure 1 for Chasing Ghosts: Instruction Following as Bayesian State Tracking
Figure 2 for Chasing Ghosts: Instruction Following as Bayesian State Tracking
Figure 3 for Chasing Ghosts: Instruction Following as Bayesian State Tracking
Figure 4 for Chasing Ghosts: Instruction Following as Bayesian State Tracking
Viaarxiv icon

RERERE: Remote Embodied Referring Expressions in Real indoor Environments

Add code
Apr 23, 2019
Figure 1 for RERERE: Remote Embodied Referring Expressions in Real indoor Environments
Figure 2 for RERERE: Remote Embodied Referring Expressions in Real indoor Environments
Figure 3 for RERERE: Remote Embodied Referring Expressions in Real indoor Environments
Figure 4 for RERERE: Remote Embodied Referring Expressions in Real indoor Environments
Viaarxiv icon

Audio-Visual Scene-Aware Dialog

Add code
Jan 25, 2019
Figure 1 for Audio-Visual Scene-Aware Dialog
Figure 2 for Audio-Visual Scene-Aware Dialog
Figure 3 for Audio-Visual Scene-Aware Dialog
Figure 4 for Audio-Visual Scene-Aware Dialog
Viaarxiv icon

nocaps: novel object captioning at scale

Add code
Dec 20, 2018
Figure 1 for nocaps: novel object captioning at scale
Figure 2 for nocaps: novel object captioning at scale
Figure 3 for nocaps: novel object captioning at scale
Figure 4 for nocaps: novel object captioning at scale
Viaarxiv icon