Alert button
Picture for Dhruv Batra

Dhruv Batra

Alert button

ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings

Jun 24, 2022
Arjun Majumdar, Gunjan Aggarwal, Bhavika Devnani, Judy Hoffman, Dhruv Batra

Figure 1 for ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Figure 2 for ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Figure 3 for ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Figure 4 for ZSON: Zero-Shot Object-Goal Navigation using Multimodal Goal Embeddings
Viaarxiv icon

SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning

Jun 16, 2022
Changan Chen, Carl Schissler, Sanchit Garg, Philip Kobernik, Alexander Clegg, Paul Calamia, Dhruv Batra, Philip W Robinson, Kristen Grauman

Figure 1 for SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Figure 2 for SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Figure 3 for SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Figure 4 for SoundSpaces 2.0: A Simulation Platform for Visual-Acoustic Learning
Viaarxiv icon

Is Mapping Necessary for Realistic PointGoal Navigation?

Jun 07, 2022
Ruslan Partsey, Erik Wijmans, Naoki Yokoyama, Oles Dobosevych, Dhruv Batra, Oleksandr Maksymets

Figure 1 for Is Mapping Necessary for Realistic PointGoal Navigation?
Figure 2 for Is Mapping Necessary for Realistic PointGoal Navigation?
Figure 3 for Is Mapping Necessary for Realistic PointGoal Navigation?
Figure 4 for Is Mapping Necessary for Realistic PointGoal Navigation?
Viaarxiv icon

Housekeep: Tidying Virtual Households using Commonsense Reasoning

May 22, 2022
Yash Kant, Arun Ramachandran, Sriram Yenamandra, Igor Gilitschenski, Dhruv Batra, Andrew Szot, Harsh Agrawal

Figure 1 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Figure 2 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Figure 3 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Figure 4 for Housekeep: Tidying Virtual Households using Commonsense Reasoning
Viaarxiv icon

Episodic Memory Question Answering

May 03, 2022
Samyak Datta, Sameer Dharur, Vincent Cartillier, Ruta Desai, Mukul Khanna, Dhruv Batra, Devi Parikh

Figure 1 for Episodic Memory Question Answering
Figure 2 for Episodic Memory Question Answering
Figure 3 for Episodic Memory Question Answering
Figure 4 for Episodic Memory Question Answering
Viaarxiv icon

Offline Visual Representation Learning for Embodied Navigation

Apr 27, 2022
Karmesh Yadav, Ram Ramrakhya, Arjun Majumdar, Vincent-Pierre Berges, Sachit Kuhar, Dhruv Batra, Alexei Baevski, Oleksandr Maksymets

Figure 1 for Offline Visual Representation Learning for Embodied Navigation
Figure 2 for Offline Visual Representation Learning for Embodied Navigation
Figure 3 for Offline Visual Representation Learning for Embodied Navigation
Figure 4 for Offline Visual Representation Learning for Embodied Navigation
Viaarxiv icon

Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale

Apr 08, 2022
Ram Ramrakhya, Eric Undersander, Dhruv Batra, Abhishek Das

Figure 1 for Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Figure 2 for Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Figure 3 for Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Figure 4 for Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
Viaarxiv icon

Simple and Effective Synthesis of Indoor 3D Scenes

Apr 06, 2022
Jing Yu Koh, Harsh Agrawal, Dhruv Batra, Richard Tucker, Austin Waters, Honglak Lee, Yinfei Yang, Jason Baldridge, Peter Anderson

Figure 1 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 2 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 3 for Simple and Effective Synthesis of Indoor 3D Scenes
Figure 4 for Simple and Effective Synthesis of Indoor 3D Scenes
Viaarxiv icon

SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation

Oct 27, 2021
Abhinav Moudgil, Arjun Majumdar, Harsh Agrawal, Stefan Lee, Dhruv Batra

Figure 1 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Figure 2 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Figure 3 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Figure 4 for SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation
Viaarxiv icon

Ego4D: Around the World in 3,000 Hours of Egocentric Video

Oct 13, 2021
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina Gonzalez, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jachym Kolar, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbelaez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik

Figure 1 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 2 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 3 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Figure 4 for Ego4D: Around the World in 3,000 Hours of Egocentric Video
Viaarxiv icon