Picture for Marcus Rohrbach

Marcus Rohrbach

Neural Module Networks

Add code
Jul 24, 2017
Figure 1 for Neural Module Networks
Figure 2 for Neural Module Networks
Figure 3 for Neural Module Networks
Figure 4 for Neural Module Networks
Viaarxiv icon

Captioning Images with Diverse Objects

Add code
Jul 20, 2017
Figure 1 for Captioning Images with Diverse Objects
Figure 2 for Captioning Images with Diverse Objects
Figure 3 for Captioning Images with Diverse Objects
Figure 4 for Captioning Images with Diverse Objects
Viaarxiv icon

Generating Descriptions with Grounded and Co-Referenced People

Add code
Apr 05, 2017
Figure 1 for Generating Descriptions with Grounded and Co-Referenced People
Figure 2 for Generating Descriptions with Grounded and Co-Referenced People
Figure 3 for Generating Descriptions with Grounded and Co-Referenced People
Figure 4 for Generating Descriptions with Grounded and Co-Referenced People
Viaarxiv icon

Grounding of Textual Phrases in Images by Reconstruction

Add code
Feb 17, 2017
Viaarxiv icon

Modeling Relationships in Referential Expressions with Compositional Modular Networks

Add code
Nov 30, 2016
Figure 1 for Modeling Relationships in Referential Expressions with Compositional Modular Networks
Figure 2 for Modeling Relationships in Referential Expressions with Compositional Modular Networks
Figure 3 for Modeling Relationships in Referential Expressions with Compositional Modular Networks
Figure 4 for Modeling Relationships in Referential Expressions with Compositional Modular Networks
Viaarxiv icon

Ask Your Neurons: A Deep Learning Approach to Visual Question Answering

Add code
Nov 24, 2016
Figure 1 for Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Figure 2 for Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Figure 3 for Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Figure 4 for Ask Your Neurons: A Deep Learning Approach to Visual Question Answering
Viaarxiv icon

Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding

Add code
Sep 24, 2016
Figure 1 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Figure 2 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Figure 3 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Figure 4 for Multimodal Compact Bilinear Pooling for Visual Question Answering and Visual Grounding
Viaarxiv icon

Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions

Add code
Aug 30, 2016
Figure 1 for Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions
Figure 2 for Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions
Figure 3 for Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions
Figure 4 for Utilizing Large Scale Vision and Text Datasets for Image Segmentation from Referring Expressions
Viaarxiv icon

Learning to Compose Neural Networks for Question Answering

Add code
Jun 07, 2016
Figure 1 for Learning to Compose Neural Networks for Question Answering
Figure 2 for Learning to Compose Neural Networks for Question Answering
Figure 3 for Learning to Compose Neural Networks for Question Answering
Figure 4 for Learning to Compose Neural Networks for Question Answering
Viaarxiv icon

Long-term Recurrent Convolutional Networks for Visual Recognition and Description

Add code
May 31, 2016
Figure 1 for Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Figure 2 for Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Figure 3 for Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Figure 4 for Long-term Recurrent Convolutional Networks for Visual Recognition and Description
Viaarxiv icon