Alert button
Picture for Devi Parikh

Devi Parikh

Alert button

Human-Adversarial Visual Question Answering

Jun 04, 2021
Sasha Sheng, Amanpreet Singh, Vedanuj Goswami, Jose Alberto Lopez Magana, Wojciech Galuba, Devi Parikh, Douwe Kiela

Figure 1 for Human-Adversarial Visual Question Answering
Figure 2 for Human-Adversarial Visual Question Answering
Figure 3 for Human-Adversarial Visual Question Answering
Figure 4 for Human-Adversarial Visual Question Answering
Viaarxiv icon

VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator

May 25, 2021
Ayush Shrivastava, Karthik Gopalakrishnan, Yang Liu, Robinson Piramuthu, Gokhan Tür, Devi Parikh, Dilek Hakkani-Tür

Figure 1 for VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator
Figure 2 for VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator
Figure 3 for VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator
Figure 4 for VISITRON: Visual Semantics-Aligned Interactively Trained Object-Navigator
Viaarxiv icon

ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations

Mar 02, 2021
Weihua Hu, Muhammed Shuaibi, Abhishek Das, Siddharth Goyal, Anuroop Sriram, Jure Leskovec, Devi Parikh, C. Lawrence Zitnick

Figure 1 for ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations
Figure 2 for ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations
Figure 3 for ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations
Figure 4 for ForceNet: A Graph Neural Network for Large-Scale Quantum Calculations
Viaarxiv icon

VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs

Jan 29, 2021
Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani

Figure 1 for VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Figure 2 for VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Figure 3 for VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Figure 4 for VX2TEXT: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Viaarxiv icon

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs

Jan 28, 2021
Xudong Lin, Gedas Bertasius, Jue Wang, Shih-Fu Chang, Devi Parikh, Lorenzo Torresani

Figure 1 for Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Figure 2 for Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Figure 3 for Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Figure 4 for Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
Viaarxiv icon

Object-Centric Diagnosis of Visual Reasoning

Dec 21, 2020
Jianwei Yang, Jiayuan Mao, Jiajun Wu, Devi Parikh, David D. Cox, Joshua B. Tenenbaum, Chuang Gan

Figure 1 for Object-Centric Diagnosis of Visual Reasoning
Figure 2 for Object-Centric Diagnosis of Visual Reasoning
Figure 3 for Object-Centric Diagnosis of Visual Reasoning
Figure 4 for Object-Centric Diagnosis of Visual Reasoning
Viaarxiv icon

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA

Dec 20, 2020
Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach

Figure 1 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 2 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 3 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 4 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Viaarxiv icon

Creative Sketch Generation

Nov 19, 2020
Songwei Ge, Vedanuj Goswami, C. Lawrence Zitnick, Devi Parikh

Figure 1 for Creative Sketch Generation
Figure 2 for Creative Sketch Generation
Figure 3 for Creative Sketch Generation
Figure 4 for Creative Sketch Generation
Viaarxiv icon

Where Are You? Localization from Embodied Dialog

Nov 16, 2020
Meera Hahn, Jacob Krantz, Dhruv Batra, Devi Parikh, James M. Rehg, Stefan Lee, Peter Anderson

Figure 1 for Where Are You? Localization from Embodied Dialog
Figure 2 for Where Are You? Localization from Embodied Dialog
Figure 3 for Where Are You? Localization from Embodied Dialog
Figure 4 for Where Are You? Localization from Embodied Dialog
Viaarxiv icon

Sim-to-Real Transfer for Vision-and-Language Navigation

Nov 07, 2020
Peter Anderson, Ayush Shrivastava, Joanne Truong, Arjun Majumdar, Devi Parikh, Dhruv Batra, Stefan Lee

Figure 1 for Sim-to-Real Transfer for Vision-and-Language Navigation
Figure 2 for Sim-to-Real Transfer for Vision-and-Language Navigation
Figure 3 for Sim-to-Real Transfer for Vision-and-Language Navigation
Figure 4 for Sim-to-Real Transfer for Vision-and-Language Navigation
Viaarxiv icon