Picture for Marcus Rohrbach

Marcus Rohrbach

TextCaps: a Dataset for Image Captioning with Reading Comprehension

Add code
Mar 24, 2020
Figure 1 for TextCaps: a Dataset for Image Captioning with Reading Comprehension
Figure 2 for TextCaps: a Dataset for Image Captioning with Reading Comprehension
Figure 3 for TextCaps: a Dataset for Image Captioning with Reading Comprehension
Figure 4 for TextCaps: a Dataset for Image Captioning with Reading Comprehension
Viaarxiv icon

Adversarial Continual Learning

Add code
Mar 21, 2020
Figure 1 for Adversarial Continual Learning
Figure 2 for Adversarial Continual Learning
Figure 3 for Adversarial Continual Learning
Figure 4 for Adversarial Continual Learning
Viaarxiv icon

In Defense of Grid Features for Visual Question Answering

Add code
Jan 10, 2020
Figure 1 for In Defense of Grid Features for Visual Question Answering
Figure 2 for In Defense of Grid Features for Visual Question Answering
Figure 3 for In Defense of Grid Features for Visual Question Answering
Figure 4 for In Defense of Grid Features for Visual Question Answering
Viaarxiv icon

Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA

Add code
Dec 05, 2019
Figure 1 for Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
Figure 2 for Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
Figure 3 for Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
Figure 4 for Iterative Answer Prediction with Pointer-Augmented Multimodal Transformers for TextVQA
Viaarxiv icon

12-in-1: Multi-Task Vision and Language Representation Learning

Add code
Dec 05, 2019
Figure 1 for 12-in-1: Multi-Task Vision and Language Representation Learning
Figure 2 for 12-in-1: Multi-Task Vision and Language Representation Learning
Figure 3 for 12-in-1: Multi-Task Vision and Language Representation Learning
Figure 4 for 12-in-1: Multi-Task Vision and Language Representation Learning
Viaarxiv icon

Decoupling Representation and Classifier for Long-Tailed Recognition

Add code
Oct 21, 2019
Figure 1 for Decoupling Representation and Classifier for Long-Tailed Recognition
Figure 2 for Decoupling Representation and Classifier for Long-Tailed Recognition
Figure 3 for Decoupling Representation and Classifier for Long-Tailed Recognition
Figure 4 for Decoupling Representation and Classifier for Long-Tailed Recognition
Viaarxiv icon

Uncertainty-guided Continual Learning with Bayesian Neural Networks

Add code
Jun 06, 2019
Figure 1 for Uncertainty-guided Continual Learning with Bayesian Neural Networks
Figure 2 for Uncertainty-guided Continual Learning with Bayesian Neural Networks
Figure 3 for Uncertainty-guided Continual Learning with Bayesian Neural Networks
Figure 4 for Uncertainty-guided Continual Learning with Bayesian Neural Networks
Viaarxiv icon

Learning to Generate Grounded Image Captions without Localization Supervision

Add code
Jun 01, 2019
Figure 1 for Learning to Generate Grounded Image Captions without Localization Supervision
Figure 2 for Learning to Generate Grounded Image Captions without Localization Supervision
Figure 3 for Learning to Generate Grounded Image Captions without Localization Supervision
Figure 4 for Learning to Generate Grounded Image Captions without Localization Supervision
Viaarxiv icon

Towards VQA Models That Can Read

Add code
May 13, 2019
Figure 1 for Towards VQA Models That Can Read
Figure 2 for Towards VQA Models That Can Read
Figure 3 for Towards VQA Models That Can Read
Figure 4 for Towards VQA Models That Can Read
Viaarxiv icon

Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution

Add code
Apr 30, 2019
Figure 1 for Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
Figure 2 for Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
Figure 3 for Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
Figure 4 for Drop an Octave: Reducing Spatial Redundancy in Convolutional Neural Networks with Octave Convolution
Viaarxiv icon