Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval

Add code
Sep 21, 2020
Figure 1 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 2 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 3 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 4 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Viaarxiv icon

Document Visual Question Answering Challenge 2020

Add code
Aug 20, 2020
Figure 1 for Document Visual Question Answering Challenge 2020
Figure 2 for Document Visual Question Answering Challenge 2020
Figure 3 for Document Visual Question Answering Challenge 2020
Viaarxiv icon

Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation

Add code
Aug 11, 2020
Figure 1 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 2 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 3 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 4 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Viaarxiv icon

Text Recognition -- Real World Data and Where to Find Them

Add code
Jul 17, 2020
Figure 1 for Text Recognition -- Real World Data and Where to Find Them
Figure 2 for Text Recognition -- Real World Data and Where to Find Them
Figure 3 for Text Recognition -- Real World Data and Where to Find Them
Figure 4 for Text Recognition -- Real World Data and Where to Find Them
Viaarxiv icon

Location Sensitive Image Retrieval and Tagging

Add code
Jul 07, 2020
Figure 1 for Location Sensitive Image Retrieval and Tagging
Figure 2 for Location Sensitive Image Retrieval and Tagging
Figure 3 for Location Sensitive Image Retrieval and Tagging
Figure 4 for Location Sensitive Image Retrieval and Tagging
Viaarxiv icon

DocVQA: A Dataset for VQA on Document Images

Add code
Jul 01, 2020
Figure 1 for DocVQA: A Dataset for VQA on Document Images
Figure 2 for DocVQA: A Dataset for VQA on Document Images
Figure 3 for DocVQA: A Dataset for VQA on Document Images
Figure 4 for DocVQA: A Dataset for VQA on Document Images
Viaarxiv icon

Multimodal grid features and cell pointers for Scene Text Visual Question Answering

Add code
Jun 25, 2020
Figure 1 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 2 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 3 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Figure 4 for Multimodal grid features and cell pointers for Scene Text Visual Question Answering
Viaarxiv icon

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features

Add code
Jan 14, 2020
Figure 1 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Figure 2 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Figure 3 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Figure 4 for Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
Viaarxiv icon

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard

Add code
Dec 20, 2019
Figure 1 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Figure 2 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Figure 3 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Figure 4 for ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard
Viaarxiv icon

Exploring Hate Speech Detection in Multimodal Publications

Add code
Oct 09, 2019
Figure 1 for Exploring Hate Speech Detection in Multimodal Publications
Figure 2 for Exploring Hate Speech Detection in Multimodal Publications
Figure 3 for Exploring Hate Speech Detection in Multimodal Publications
Figure 4 for Exploring Hate Speech Detection in Multimodal Publications
Viaarxiv icon