Picture for Ernest Valveny

Ernest Valveny

Hierarchical multimodal transformers for Multi-Page DocVQA

Add code
Dec 07, 2022
Figure 1 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 2 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 3 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 4 for Hierarchical multimodal transformers for Multi-Page DocVQA
Viaarxiv icon

OCR-IDL: OCR Annotations for Industry Document Library Dataset

Add code
Feb 25, 2022
Figure 1 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 2 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 3 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 4 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Viaarxiv icon

ICDAR 2021 Competition on Document VisualQuestion Answering

Add code
Nov 10, 2021
Figure 1 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 2 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 3 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 4 for ICDAR 2021 Competition on Document VisualQuestion Answering
Viaarxiv icon

External Knowledge Augmented Text Visual Question Answering

Add code
Aug 22, 2021
Figure 1 for External Knowledge Augmented Text Visual Question Answering
Figure 2 for External Knowledge Augmented Text Visual Question Answering
Figure 3 for External Knowledge Augmented Text Visual Question Answering
Figure 4 for External Knowledge Augmented Text Visual Question Answering
Viaarxiv icon

Document Collection Visual Question Answering

Add code
Apr 27, 2021
Figure 1 for Document Collection Visual Question Answering
Figure 2 for Document Collection Visual Question Answering
Figure 3 for Document Collection Visual Question Answering
Figure 4 for Document Collection Visual Question Answering
Viaarxiv icon

InfographicVQA

Add code
Apr 26, 2021
Figure 1 for InfographicVQA
Figure 2 for InfographicVQA
Figure 3 for InfographicVQA
Figure 4 for InfographicVQA
Viaarxiv icon

Multimodal grid features and cell pointers for Scene Text Visual Question Answering

Add code
Jun 25, 2020
Figure 1 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 2 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 3 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 4 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Viaarxiv icon

ICDAR 2019 Competition on Scene Text Visual Question Answering

Add code
Jun 30, 2019
Figure 1 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Figure 2 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Figure 3 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Figure 4 for ICDAR 2019 Competition on Scene Text Visual Question Answering
Viaarxiv icon

Scene Text Visual Question Answering

Add code
May 31, 2019
Figure 1 for Scene Text Visual Question Answering
Figure 2 for Scene Text Visual Question Answering
Figure 3 for Scene Text Visual Question Answering
Figure 4 for Scene Text Visual Question Answering
Viaarxiv icon

Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding

Add code
May 25, 2019
Figure 1 for Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding
Figure 2 for Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding
Figure 3 for Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding
Figure 4 for Beyond Visual Semantics: Exploring the Role of Scene Text in Image Understanding
Viaarxiv icon