Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

InfographicVQA

Add code
Apr 26, 2021
Figure 1 for InfographicVQA
Figure 2 for InfographicVQA
Figure 3 for InfographicVQA
Figure 4 for InfographicVQA
Viaarxiv icon

ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction

Add code
Mar 18, 2021
Figure 1 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Figure 2 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Figure 3 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Figure 4 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Viaarxiv icon

StacMR: Scene-Text Aware Cross-Modal Retrieval

Add code
Dec 08, 2020
Figure 1 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Figure 2 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Figure 3 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Figure 4 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Viaarxiv icon

Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval

Add code
Sep 21, 2020
Figure 1 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 2 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 3 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 4 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Viaarxiv icon

Document Visual Question Answering Challenge 2020

Add code
Aug 20, 2020
Figure 1 for Document Visual Question Answering Challenge 2020
Figure 2 for Document Visual Question Answering Challenge 2020
Figure 3 for Document Visual Question Answering Challenge 2020
Viaarxiv icon

Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation

Add code
Aug 11, 2020
Figure 1 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 2 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 3 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 4 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Viaarxiv icon

Text Recognition -- Real World Data and Where to Find Them

Add code
Jul 17, 2020
Figure 1 for Text Recognition -- Real World Data and Where to Find Them
Figure 2 for Text Recognition -- Real World Data and Where to Find Them
Figure 3 for Text Recognition -- Real World Data and Where to Find Them
Figure 4 for Text Recognition -- Real World Data and Where to Find Them
Viaarxiv icon

Location Sensitive Image Retrieval and Tagging

Add code
Jul 07, 2020
Figure 1 for Location Sensitive Image Retrieval and Tagging
Figure 2 for Location Sensitive Image Retrieval and Tagging
Figure 3 for Location Sensitive Image Retrieval and Tagging
Figure 4 for Location Sensitive Image Retrieval and Tagging
Viaarxiv icon

DocVQA: A Dataset for VQA on Document Images

Add code
Jul 01, 2020
Figure 1 for DocVQA: A Dataset for VQA on Document Images
Figure 2 for DocVQA: A Dataset for VQA on Document Images
Figure 3 for DocVQA: A Dataset for VQA on Document Images
Figure 4 for DocVQA: A Dataset for VQA on Document Images
Viaarxiv icon

Multimodal grid features and cell pointers for Scene Text Visual Question Answering

Add code
Jun 25, 2020
Figure 1 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 2 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 3 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 4 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Viaarxiv icon