Alert button
Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

Alert button

Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning

Oct 04, 2021
Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas

Figure 1 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 2 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 3 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 4 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Viaarxiv icon

Asking questions on handwritten document collections

Oct 02, 2021
Minesh Mathew, Lluis Gomez, Dimosthenis Karatzas, CV Jawahar

Figure 1 for Asking questions on handwritten document collections
Figure 2 for Asking questions on handwritten document collections
Figure 3 for Asking questions on handwritten document collections
Figure 4 for Asking questions on handwritten document collections
Viaarxiv icon

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

May 11, 2021
Mohamed Ali Souibgui, Ali Furkan Biten, Sounak Dey, Alicia Fornés, Yousri Kessentini, Lluis Gomez, Dimosthenis Karatzas, Josep Lladós

Figure 1 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 2 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 3 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 4 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Viaarxiv icon

Document Collection Visual Question Answering

Apr 27, 2021
Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny

Figure 1 for Document Collection Visual Question Answering
Figure 2 for Document Collection Visual Question Answering
Figure 3 for Document Collection Visual Question Answering
Figure 4 for Document Collection Visual Question Answering
Viaarxiv icon

InfographicVQA

Apr 26, 2021
Minesh Mathew, Viraj Bagal, Rubèn Pérez Tito, Dimosthenis Karatzas, Ernest Valveny, C. V Jawahar

Figure 1 for InfographicVQA
Figure 2 for InfographicVQA
Figure 3 for InfographicVQA
Figure 4 for InfographicVQA
Viaarxiv icon

ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction

Mar 18, 2021
Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas, Shjian Lu, C. V. Jawahar

Figure 1 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Figure 2 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Figure 3 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Figure 4 for ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction
Viaarxiv icon

StacMR: Scene-Text Aware Cross-Modal Retrieval

Dec 08, 2020
Andrés Mafla, Rafael Sampaio de Rezende, Lluís Gómez, Diane Larlus, Dimosthenis Karatzas

Figure 1 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Figure 2 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Figure 3 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Figure 4 for StacMR: Scene-Text Aware Cross-Modal Retrieval
Viaarxiv icon

Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval

Sep 21, 2020
Andres Mafla, Sounak Dey, Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas

Figure 1 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 2 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 3 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Figure 4 for Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
Viaarxiv icon

Document Visual Question Answering Challenge 2020

Aug 20, 2020
Minesh Mathew, Ruben Tito, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar

Figure 1 for Document Visual Question Answering Challenge 2020
Figure 2 for Document Visual Question Answering Challenge 2020
Figure 3 for Document Visual Question Answering Challenge 2020
Viaarxiv icon

Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation

Aug 11, 2020
Raul Gomez, Yahui Liu, Marco De Nadai, Dimosthenis Karatzas, Bruno Lepri, Nicu Sebe

Figure 1 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 2 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 3 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Figure 4 for Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
Viaarxiv icon