Alert button
Picture for Dimosthenis Karatzas

Dimosthenis Karatzas

Alert button

Multimodal Transformer for Comics Text-Cloze

Add code
Bookmark button
Alert button
Mar 06, 2024
Emanuele Vivoli, Joan Lafuente Baeza, Ernest Valveny Llobet, Dimosthenis Karatzas

Figure 1 for Multimodal Transformer for Comics Text-Cloze
Figure 2 for Multimodal Transformer for Comics Text-Cloze
Figure 3 for Multimodal Transformer for Comics Text-Cloze
Figure 4 for Multimodal Transformer for Comics Text-Cloze
Viaarxiv icon

Privacy-Aware Document Visual Question Answering

Add code
Bookmark button
Alert button
Dec 15, 2023
Rubèn Tito, Khanh Nguyen, Marlon Tobaben, Raouf Kerkouche, Mohamed Ali Souibgui, Kangsoo Jung, Lei Kang, Ernest Valveny, Antti Honkela, Mario Fritz, Dimosthenis Karatzas

Viaarxiv icon

Understanding Video Scenes through Text: Insights from Text-based Video Question Answering

Add code
Bookmark button
Alert button
Sep 11, 2023
Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 2 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 3 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Figure 4 for Understanding Video Scenes through Text: Insights from Text-based Video Question Answering
Viaarxiv icon

STEP -- Towards Structured Scene-Text Spotting

Add code
Bookmark button
Alert button
Sep 05, 2023
Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol

Figure 1 for STEP -- Towards Structured Scene-Text Spotting
Figure 2 for STEP -- Towards Structured Scene-Text Spotting
Figure 3 for STEP -- Towards Structured Scene-Text Spotting
Figure 4 for STEP -- Towards Structured Scene-Text Spotting
Viaarxiv icon

Reading Between the Lanes: Text VideoQA on the Road

Add code
Bookmark button
Alert button
Jul 08, 2023
George Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Reading Between the Lanes: Text VideoQA on the Road
Figure 2 for Reading Between the Lanes: Text VideoQA on the Road
Figure 3 for Reading Between the Lanes: Text VideoQA on the Road
Figure 4 for Reading Between the Lanes: Text VideoQA on the Road
Viaarxiv icon

ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images

Add code
Bookmark button
Alert button
Jun 05, 2023
Wenwen Yu, Chengquan Zhang, Haoyu Cao, Wei Hua, Bohan Li, Huang Chen, Mingyu Liu, Mingrui Chen, Jianfeng Kuang, Mengjun Cheng, Yuning Du, Shikun Feng, Xiaoguang Hu, Pengyuan Lyu, Kun Yao, Yuechen Yu, Yuliang Liu, Wanxiang Che, Errui Ding, Cheng-Lin Liu, Jiebo Luo, Shuicheng Yan, Min Zhang, Dimosthenis Karatzas, Xing Sun, Jingdong Wang, Xiang Bai

Figure 1 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 2 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 3 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Figure 4 for ICDAR 2023 Competition on Structured Text Extraction from Visually-Rich Document Images
Viaarxiv icon

ICDAR 2023 Competition on Reading the Seal Title

Add code
Bookmark button
Alert button
Apr 24, 2023
Wenwen Yu, Mingyu Liu, Mingrui Chen, Ning Lu, Yinlong Wen, Yuliang Liu, Dimosthenis Karatzas, Xiang Bai

Figure 1 for ICDAR 2023 Competition on Reading the Seal Title
Figure 2 for ICDAR 2023 Competition on Reading the Seal Title
Figure 3 for ICDAR 2023 Competition on Reading the Seal Title
Figure 4 for ICDAR 2023 Competition on Reading the Seal Title
Viaarxiv icon

ICDAR 2023 Video Text Reading Competition for Dense and Small Text

Add code
Bookmark button
Alert button
Apr 10, 2023
Weijia Wu, Yuzhong Zhao, Zhuang Li, Jiahong Li, Mike Zheng Shou, Umapada Pal, Dimosthenis Karatzas, Xiang Bai

Figure 1 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 2 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 3 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Figure 4 for ICDAR 2023 Video Text Reading Competition for Dense and Small Text
Viaarxiv icon

DocILE Benchmark for Document Information Localization and Extraction

Add code
Bookmark button
Alert button
Feb 11, 2023
Štěpán Šimsa, Milan Šulc, Michal Uřičář, Yash Patel, Ahmed Hamdi, Matěj Kocián, Matyáš Skalický, Jiří Matas, Antoine Doucet, Mickaël Coustaty, Dimosthenis Karatzas

Figure 1 for DocILE Benchmark for Document Information Localization and Extraction
Figure 2 for DocILE Benchmark for Document Information Localization and Extraction
Figure 3 for DocILE Benchmark for Document Information Localization and Extraction
Figure 4 for DocILE Benchmark for Document Information Localization and Extraction
Viaarxiv icon