Alert button
Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

Alert button

Privacy-Aware Document Visual Question Answering

Dec 15, 2023
Rubèn Tito, Khanh Nguyen, Marlon Tobaben, Raouf Kerkouche, Mohamed Ali Souibgui, Kangsoo Jung, Lei Kang, Ernest Valveny, Antti Honkela, Mario Fritz, Dimosthenis Karatzas

Viaarxiv icon

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering

Sep 11, 2023
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 2 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 3 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 4 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Viaarxiv icon

STEP -- Towards Structured Scene-Text Spotting

Sep 05, 2023
Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol

Figure 1 for STEP -- Towards Structured Scene-Text Spotting
Figure 2 for STEP -- Towards Structured Scene-Text Spotting
Figure 3 for STEP -- Towards Structured Scene-Text Spotting
Figure 4 for STEP -- Towards Structured Scene-Text Spotting
Viaarxiv icon

Reading Between the Lanes: Text VideoQA on the Road

Jul 08, 2023
George Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Reading Between the Lanes: Text VideoQA on the Road
Figure 2 for Reading Between the Lanes: Text VideoQA on the Road
Figure 3 for Reading Between the Lanes: Text VideoQA on the Road
Figure 4 for Reading Between the Lanes: Text VideoQA on the Road
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Jun 05, 2023
Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai

Figure 1 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 2 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 3 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 4 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Viaarxiv icon

ICDAR 2023 Competition on Reading the Seal Title

Apr 24, 2023
Wenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai

Figure 1 for ICDAR 2023 Competition on Reading the Seal Title
Figure 2 for ICDAR 2023 Competition on Reading the Seal Title
Figure 3 for ICDAR 2023 Competition on Reading the Seal Title
Figure 4 for ICDAR 2023 Competition on Reading the Seal Title
Viaarxiv icon

ICDAR 2023 Video Text Reading Competition for Dense and Small Text

Apr 10, 2023
Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas, Xiang Bai

Figure 1 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 2 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 3 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 4 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Viaarxiv icon

DocILE Benchmark for Document Information Localization and Extraction

Feb 11, 2023
Štěpán Šimsa, Milan Šulc, Michal Uřičář, Yash Patel, Ahmed Hamdi, Matěj Kocián, Matyáš Skalický, Jiří Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas

Figure 1 for DocILE Benchmark for Document Information Localization and Extraction
Figure 2 for DocILE Benchmark for Document Information Localization and Extraction
Figure 3 for DocILE Benchmark for Document Information Localization and Extraction
Figure 4 for DocILE Benchmark for Document Information Localization and Extraction
Viaarxiv icon

Hierarchical multimodal transformers for Multi-Page DocVQA

Dec 07, 2022
Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny

Figure 1 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 2 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 3 for Hierarchical multimodal transformers for Multi-Page DocVQA
Figure 4 for Hierarchical multimodal transformers for Multi-Page DocVQA
Viaarxiv icon