Alert button
Picture for Harsh Agrawal

Harsh Agrawal

Alert button

Large Language Models as Generalizable Policies for Embodied Tasks

Add code
Bookmark button
Alert button
Oct 26, 2023
Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Walter Talbott, Katherine Metcalf, Natalie Mackraz, Devon Hjelm, Alexander Toshev

Viaarxiv icon

Housekeep: Tidying Virtual Households using Commonsense Reasoning

Add code
Bookmark button
Alert button
May 22, 2022
Yash Kant, Arun Ramachandran, Sriram Yenamandra, Igor Gilitschenski, Dhruv Batra, Andrew Szot, Harsh Agrawal

Figure 1 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Figure 2 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Figure 3 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Figure 4 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Viaarxiv icon

Simple and Effective Synthesis of Indoor 3D Scenes

Add code
Bookmark button
Alert button
Apr 06, 2022
Jing Yu Koh, Harsh Agrawal, Dhruv Batra, Richard Tucker, Austin Waters, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson

Figure 1 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 2 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 3 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 4 for Simple and Effective Synthesis of Indoor 3D Scenes
Viaarxiv icon

SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation

Add code
Bookmark button
Alert button
Oct 27, 2021
Abhinav Moudgil, Arjun Majumdar, Harsh Agrawal, Stefan Lee, Dhruv Batra

Figure 1 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Figure 2 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Figure 3 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Figure 4 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Viaarxiv icon

The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation

Add code
Bookmark button
Alert button
Aug 26, 2021
Xiaoming Zhao, Harsh Agrawal, Dhruv Batra, Alexander Schwing

Figure 1 for The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
Figure 2 for The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
Figure 3 for The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
Figure 4 for The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation
Viaarxiv icon

Contrast and Classify: Alternate Training for Robust VQA

Add code
Bookmark button
Alert button
Oct 13, 2020
Yash Kant, Abhinav Moudgil, Dhruv Batra, Devi Parikh, Harsh Agrawal

Figure 1 for Contrast and Classify: Alternate Training for Robust VQA
Figure 2 for Contrast and Classify: Alternate Training for Robust VQA
Figure 3 for Contrast and Classify: Alternate Training for Robust VQA
Figure 4 for Contrast and Classify: Alternate Training for Robust VQA
Viaarxiv icon

Spatially Aware Multimodal Transformers for TextVQA

Add code
Bookmark button
Alert button
Jul 23, 2020
Yash Kant, Dhruv Batra, Peter Anderson, Alex Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal

Figure 1 for Spatially Aware Multimodal Transformers for TextVQA
Figure 2 for Spatially Aware Multimodal Transformers for TextVQA
Figure 3 for Spatially Aware Multimodal Transformers for TextVQA
Figure 4 for Spatially Aware Multimodal Transformers for TextVQA
Viaarxiv icon

Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning

Add code
Bookmark button
Alert button
Aug 22, 2019
Jyoti Aneja, Harsh Agrawal, Dhruv Batra, Alexander Schwing

Figure 1 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Figure 2 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Figure 3 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Figure 4 for Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
Viaarxiv icon

EvalAI: Towards Better Evaluation Systems for AI Agents

Add code
Bookmark button
Alert button
Feb 10, 2019
Deshraj Yadav, Rishabh Jain, Harsh Agrawal, Prithvijit Chattopadhyay, Taranjeet Singh, Akash Jain, Shiv Baran Singh, Stefan Lee, Dhruv Batra

Figure 1 for EvalAI: Towards Better Evaluation Systems for AI Agents
Figure 2 for EvalAI: Towards Better Evaluation Systems for AI Agents
Figure 3 for EvalAI: Towards Better Evaluation Systems for AI Agents
Figure 4 for EvalAI: Towards Better Evaluation Systems for AI Agents
Viaarxiv icon

nocaps: novel object captioning at scale

Add code
Bookmark button
Alert button
Dec 20, 2018
Harsh Agrawal, Karan Desai, Xinlei Chen, Rishabh Jain, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson

Figure 1 for nocaps: novel object captioning at scale
Figure 2 for nocaps: novel object captioning at scale
Figure 3 for nocaps: novel object captioning at scale
Figure 4 for nocaps: novel object captioning at scale
Viaarxiv icon