Picture for Anna Rohrbach

Anna Rohrbach

Video Object Segmentation with Language Referring Expressions

Add code
May 09, 2018
Figure 1 for Video Object Segmentation with Language Referring Expressions
Figure 2 for Video Object Segmentation with Language Referring Expressions
Figure 3 for Video Object Segmentation with Language Referring Expressions
Figure 4 for Video Object Segmentation with Language Referring Expressions
Viaarxiv icon

Fooling Vision and Language Models Despite Localization and Attention Mechanism

Add code
Apr 06, 2018
Figure 1 for Fooling Vision and Language Models Despite Localization and Attention Mechanism
Viaarxiv icon

Multimodal Explanations: Justifying Decisions and Pointing to the Evidence

Add code
Feb 15, 2018
Figure 1 for Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Figure 2 for Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Figure 3 for Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Figure 4 for Multimodal Explanations: Justifying Decisions and Pointing to the Evidence
Viaarxiv icon

Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract)

Add code
Nov 17, 2017
Figure 1 for Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract)
Figure 2 for Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract)
Figure 3 for Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract)
Figure 4 for Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract)
Viaarxiv icon

Gradient-free Policy Architecture Search and Adaptation

Add code
Oct 16, 2017
Figure 1 for Gradient-free Policy Architecture Search and Adaptation
Figure 2 for Gradient-free Policy Architecture Search and Adaptation
Figure 3 for Gradient-free Policy Architecture Search and Adaptation
Figure 4 for Gradient-free Policy Architecture Search and Adaptation
Viaarxiv icon

Generating Descriptions with Grounded and Co-Referenced People

Add code
Apr 05, 2017
Figure 1 for Generating Descriptions with Grounded and Co-Referenced People
Figure 2 for Generating Descriptions with Grounded and Co-Referenced People
Figure 3 for Generating Descriptions with Grounded and Co-Referenced People
Figure 4 for Generating Descriptions with Grounded and Co-Referenced People
Viaarxiv icon

Grounding of Textual Phrases in Images by Reconstruction

Add code
Feb 17, 2017
Viaarxiv icon

A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering

Add code
Feb 05, 2017
Figure 1 for A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Figure 2 for A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Figure 3 for A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Figure 4 for A dataset and exploration of models for understanding video data through fill-in-the-blank question-answering
Viaarxiv icon

Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding

Add code
Sep 24, 2016
Figure 1 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Figure 2 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Figure 3 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Figure 4 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Viaarxiv icon

Movie Description

Add code
May 12, 2016
Figure 1 for Movie Description
Figure 2 for Movie Description
Figure 3 for Movie Description
Figure 4 for Movie Description
Viaarxiv icon