Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding

Add code
Jul 04, 2024
Viaarxiv icon

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking

Add code
Jul 03, 2024
Viaarxiv icon

Federated Document Visual Question Answering: A Pilot Study

Add code
May 10, 2024
Figure 1 for Federated Document Visual Question Answering: A Pilot Study
Figure 2 for Federated Document Visual Question Answering: A Pilot Study
Figure 3 for Federated Document Visual Question Answering: A Pilot Study
Figure 4 for Federated Document Visual Question Answering: A Pilot Study
Viaarxiv icon

Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism

Add code
Apr 29, 2024
Figure 1 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Figure 2 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Figure 3 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Figure 4 for Multi-Page Document Visual Question Answering using Self-Attention Scoring Mechanism
Viaarxiv icon

Machine Unlearning for Document Classification

Add code
Apr 29, 2024
Viaarxiv icon

Multimodal Transformer for Comics Text-Cloze

Add code
Mar 06, 2024
Figure 1 for Multimodal Transformer for Comics Text-Cloze
Figure 2 for Multimodal Transformer for Comics Text-Cloze
Figure 3 for Multimodal Transformer for Comics Text-Cloze
Figure 4 for Multimodal Transformer for Comics Text-Cloze
Viaarxiv icon

Privacy-Aware Document Visual Question Answering

Add code
Dec 15, 2023
Viaarxiv icon

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering

Add code
Sep 11, 2023
Figure 1 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 2 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 3 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 4 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Viaarxiv icon

STEP -- Towards Structured Scene-Text Spotting

Add code
Sep 05, 2023
Figure 1 for STEP -- Towards Structured Scene-Text Spotting
Figure 2 for STEP -- Towards Structured Scene-Text Spotting
Figure 3 for STEP -- Towards Structured Scene-Text Spotting
Figure 4 for STEP -- Towards Structured Scene-Text Spotting
Viaarxiv icon

Reading Between the Lanes: Text VideoQA on the Road

Add code
Jul 08, 2023
Figure 1 for Reading Between the Lanes: Text VideoQA on the Road
Figure 2 for Reading Between the Lanes: Text VideoQA on the Road
Figure 3 for Reading Between the Lanes: Text VideoQA on the Road
Figure 4 for Reading Between the Lanes: Text VideoQA on the Road
Viaarxiv icon