Picture for Dhruv Batra

Dhruv Batra

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web

Add code
May 01, 2020
Figure 1 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Figure 2 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Figure 3 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Figure 4 for Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
Viaarxiv icon

Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments

Add code
Apr 06, 2020
Figure 1 for Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Figure 2 for Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Figure 3 for Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Figure 4 for Beyond the Nav-Graph: Vision-and-Language Navigation in Continuous Environments
Viaarxiv icon

Analyzing Visual Representations in Embodied Navigation Tasks

Add code
Mar 12, 2020
Figure 1 for Analyzing Visual Representations in Embodied Navigation Tasks
Figure 2 for Analyzing Visual Representations in Embodied Navigation Tasks
Figure 3 for Analyzing Visual Representations in Embodied Navigation Tasks
Figure 4 for Analyzing Visual Representations in Embodied Navigation Tasks
Viaarxiv icon

Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation

Add code
Dec 13, 2019
Figure 1 for Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation
Figure 2 for Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation
Figure 3 for Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation
Figure 4 for Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation
Viaarxiv icon

Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline

Add code
Dec 05, 2019
Figure 1 for Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Figure 2 for Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Figure 3 for Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Figure 4 for Large-scale Pretraining for Visual Dialog: A Simple State-of-the-Art Baseline
Viaarxiv icon

Decentralized Distributed PPO: Solving PointGoal Navigation

Add code
Nov 01, 2019
Figure 1 for Decentralized Distributed PPO: Solving PointGoal Navigation
Figure 2 for Decentralized Distributed PPO: Solving PointGoal Navigation
Figure 3 for Decentralized Distributed PPO: Solving PointGoal Navigation
Figure 4 for Decentralized Distributed PPO: Solving PointGoal Navigation
Viaarxiv icon

Improving Generative Visual Dialog by Answering Diverse Questions

Add code
Oct 03, 2019
Figure 1 for Improving Generative Visual Dialog by Answering Diverse Questions
Figure 2 for Improving Generative Visual Dialog by Answering Diverse Questions
Figure 3 for Improving Generative Visual Dialog by Answering Diverse Questions
Figure 4 for Improving Generative Visual Dialog by Answering Diverse Questions
Viaarxiv icon

Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning

Add code
Aug 22, 2019
Figure 1 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Figure 2 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Figure 3 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Figure 4 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Viaarxiv icon

Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning

Add code
Aug 15, 2019
Figure 1 for Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning
Figure 2 for Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning
Figure 3 for Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning
Figure 4 for Unsupervised Discovery of Decision States for Transfer in Reinforcement Learning
Viaarxiv icon

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

Add code
Aug 06, 2019
Figure 1 for ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Figure 2 for ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Figure 3 for ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Figure 4 for ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Viaarxiv icon