Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

Add code
Sep 21, 2022
Figure 1 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 2 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 3 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 4 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Viaarxiv icon

MUST-VQA: MUltilingual Scene-text VQA

Add code
Sep 14, 2022
Figure 1 for MUST-VQA: MUltilingual Scene-text VQA
Figure 2 for MUST-VQA: MUltilingual Scene-text VQA
Figure 3 for MUST-VQA: MUltilingual Scene-text VQA
Figure 4 for MUST-VQA: MUltilingual Scene-text VQA
Viaarxiv icon

Out-of-Vocabulary Challenge Report

Add code
Sep 14, 2022
Figure 1 for Out-of-Vocabulary Challenge Report
Figure 2 for Out-of-Vocabulary Challenge Report
Figure 3 for Out-of-Vocabulary Challenge Report
Figure 4 for Out-of-Vocabulary Challenge Report
Viaarxiv icon

Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

Add code
Mar 16, 2022
Figure 1 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 2 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 3 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 4 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Viaarxiv icon

OCR-IDL: OCR Annotations for Industry Document Library Dataset

Add code
Feb 25, 2022
Figure 1 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 2 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 3 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 4 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Viaarxiv icon

ICDAR 2021 Competition on Document VisualQuestion Answering

Add code
Nov 10, 2021
Figure 1 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 2 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 3 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 4 for ICDAR 2021 Competition on Document VisualQuestion Answering
Viaarxiv icon

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Add code
Oct 06, 2021
Figure 1 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 2 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 3 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 4 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Viaarxiv icon

Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning

Add code
Oct 04, 2021
Figure 1 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 2 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 3 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Figure 4 for Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning
Viaarxiv icon

Asking questions on handwritten document collections

Add code
Oct 02, 2021
Figure 1 for Asking questions on handwritten document collections
Figure 2 for Asking questions on handwritten document collections
Figure 3 for Asking questions on handwritten document collections
Figure 4 for Asking questions on handwritten document collections
Viaarxiv icon

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition

Add code
May 11, 2021
Figure 1 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 2 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 3 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Figure 4 for One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition
Viaarxiv icon