Alert button
Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

Alert button

Hierarchical multimodal transformers for Multi-Page DocVQA

Add code
Bookmark button
Alert button
Dec 07, 2022
Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny

Figure 1 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 2 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 3 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 4 for Hierarchical multimodal transformers for Multi-Page DocVQA
Viaarxiv icon

Watching the News: Towards VideoQA Models that can Read

Add code
Bookmark button
Alert button
Nov 10, 2022
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Watching the News: Towards VideoQA Models that can Read
Figure 2 for Watching the News: Towards VideoQA Models that can Read
Figure 3 for Watching the News: Towards VideoQA Models that can Read
Figure 4 for Watching the News: Towards VideoQA Models that can Read
Viaarxiv icon

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

Add code
Bookmark button
Alert button
Sep 21, 2022
Khanh Nguyen, Ali Furkan Biten, Andres Mafla, Lluis Gomez, Dimosthenis Karatzas

Figure 1 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 2 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 3 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Figure 4 for Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia
Viaarxiv icon

MUST-VQA: MUltilingual Scene-text VQA

Add code
Bookmark button
Alert button
Sep 14, 2022
Emanuele Vivoli, Ali Furkan Biten, Andres Mafla, Dimosthenis Karatzas, Lluis Gomez

Figure 1 for MUST-VQA: MUltilingual Scene-text VQA
Figure 2 for MUST-VQA: MUltilingual Scene-text VQA
Figure 3 for MUST-VQA: MUltilingual Scene-text VQA
Figure 4 for MUST-VQA: MUltilingual Scene-text VQA
Viaarxiv icon

Out-of-Vocabulary Challenge Report

Add code
Bookmark button
Alert button
Sep 14, 2022
Sergi Garcia-Bordils, Andrés Mafla, Ali Furkan Biten, Oren Nuriel, Aviad Aberdam, Shai Mazor, Ron Litman, Dimosthenis Karatzas

Figure 1 for Out-of-Vocabulary Challenge Report
Figure 2 for Out-of-Vocabulary Challenge Report
Figure 3 for Out-of-Vocabulary Challenge Report
Figure 4 for Out-of-Vocabulary Challenge Report
Viaarxiv icon

Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement

Add code
Bookmark button
Alert button
Mar 16, 2022
Mohamed Ali Souibgui, Sanket Biswas, Andres Mafla, Ali Furkan Biten, Alicia Fornés, Yousri Kessentini, Josep Lladós, Lluis Gomez, Dimosthenis Karatzas

Figure 1 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 2 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 3 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Figure 4 for Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement
Viaarxiv icon

OCR-IDL: OCR Annotations for Industry Document Library Dataset

Add code
Bookmark button
Alert button
Feb 25, 2022
Ali Furkan Biten, Rubèn Tito, Lluis Gomez, Ernest Valveny, Dimosthenis Karatzas

Figure 1 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 2 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 3 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Figure 4 for OCR-IDL: OCR Annotations for Industry Document Library Dataset
Viaarxiv icon

ICDAR 2021 Competition on Document VisualQuestion Answering

Add code
Bookmark button
Alert button
Nov 10, 2021
Rubèn Tito, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas

Figure 1 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 2 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 3 for ICDAR 2021 Competition on Document VisualQuestion Answering
Figure 4 for ICDAR 2021 Competition on Document VisualQuestion Answering
Viaarxiv icon

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching

Add code
Bookmark button
Alert button
Oct 06, 2021
Ali Furkan Biten, Andres Mafla, Lluis Gomez, Dimosthenis Karatzas

Figure 1 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 2 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 3 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Figure 4 for Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching
Viaarxiv icon