Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Dimosthenis Karatzas

Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching


Oct 06, 2021
Ali Furkan Biten, Andres Mafla, Lluis Gomez, Dimosthenis Karatzas

* Accepted WACV 2022 

  Access Paper or Ask Questions

Let there be a clock on the beach: Reducing Object Hallucination in Image Captioning


Oct 04, 2021
Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas

* Accepted to WACV 2022 

  Access Paper or Ask Questions

Asking questions on handwritten document collections


Oct 02, 2021
Minesh Mathew, Lluis Gomez, Dimosthenis Karatzas, CV Jawahar

* journal = {Int. J. Document Anal. Recognit.}, volume = {24}, number = {3}, pages = {235--249}, year = {2021} 
* pre-print version 

  Access Paper or Ask Questions

One-shot Compositional Data Generation for Low Resource Handwritten Text Recognition


May 11, 2021
Mohamed Ali Souibgui, Ali Furkan Biten, Sounak Dey, Alicia Fornés, Yousri Kessentini, Lluis Gomez, Dimosthenis Karatzas, Josep Lladós


  Access Paper or Ask Questions

Document Collection Visual Question Answering


Apr 27, 2021
Rubèn Tito, Dimosthenis Karatzas, Ernest Valveny


  Access Paper or Ask Questions

InfographicVQA


Apr 26, 2021
Minesh Mathew, Viraj Bagal, Rubèn Pérez Tito, Dimosthenis Karatzas, Ernest Valveny, C. V Jawahar


  Access Paper or Ask Questions

ICDAR2019 Competition on Scanned Receipt OCR and Information Extraction


Mar 18, 2021
Zheng Huang, Kai Chen, Jianhua He, Xiang Bai, Dimosthenis Karatzas, Shjian Lu, C. V. Jawahar


  Access Paper or Ask Questions

StacMR: Scene-Text Aware Cross-Modal Retrieval


Dec 08, 2020
Andrés Mafla, Rafael Sampaio de Rezende, Lluís Gómez, Diane Larlus, Dimosthenis Karatzas


  Access Paper or Ask Questions

Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval


Sep 21, 2020
Andres Mafla, Sounak Dey, Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas


  Access Paper or Ask Questions

Document Visual Question Answering Challenge 2020


Aug 20, 2020
Minesh Mathew, Ruben Tito, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar

* to be published as a short paper in DAS 2020 

  Access Paper or Ask Questions

Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation


Aug 11, 2020
Raul Gomez, Yahui Liu, Marco De Nadai, Dimosthenis Karatzas, Bruno Lepri, Nicu Sebe

* Submitted to ACM MM '20, October 12-16, 2020, Seattle, WA, USA 

  Access Paper or Ask Questions

Text Recognition -- Real World Data and Where to Find Them


Jul 17, 2020
Klára Janoušková, Jiri Matas, Lluis Gomez, Dimosthenis Karatzas

* 10 pages 

  Access Paper or Ask Questions

Location Sensitive Image Retrieval and Tagging


Jul 07, 2020
Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas

* ECCV 2020 

  Access Paper or Ask Questions

DocVQA: A Dataset for VQA on Document Images


Jul 01, 2020
Minesh Mathew, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar


  Access Paper or Ask Questions

Multimodal grid features and cell pointers for Scene Text Visual Question Answering


Jun 25, 2020
Lluís Gómez, Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Marçal Rusiñol, Ernest Valveny, Dimosthenis Karatzas

* This paper is under consideration at Pattern Recognition Letters 

  Access Paper or Ask Questions

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features


Jan 14, 2020
Andres Mafla, Sounak Dey, Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas

* Winter Conference on Applications of Computer Vision (WACV 2020) Accepted paper 

  Access Paper or Ask Questions

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard


Dec 20, 2019
Xi Liu, Rui Zhang, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar

* International Conference on Document Analysis and Recognition, 2019 

  Access Paper or Ask Questions

Exploring Hate Speech Detection in Multimodal Publications


Oct 09, 2019
Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas


  Access Paper or Ask Questions

ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling -- RRC-LSVT


Sep 17, 2019
Yipeng Sun, Zihan Ni, Chee-Kheng Chng, Yuliang Liu, Canjie Luo, Chun Chet Ng, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

* ICDAR 2019 Robust Reading Challenge in IAPR International Conference on Document Analysis and Recognition (ICDAR) 

  Access Paper or Ask Questions

ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)


Sep 16, 2019
Chee-Kheng Chng, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

* Technical report of ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) Competition 

  Access Paper or Ask Questions

ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019


Jul 01, 2019
Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-lin Liu, Jean-Marc Ogier

* ICDAR'19 camera-ready version. Competition available at https://rrc.cvc.uab.es/?ch=15. The first two authors contributed equally 

  Access Paper or Ask Questions

ICDAR 2019 Competition on Scene Text Visual Question Answering


Jun 30, 2019
Ali Furkan Biten, Rubèn Tito, Andres Mafla, Lluis Gomez, Marçal Rusiñol, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas

* 15th International Conference on Document Analysis and Recognition (ICDAR 2019) 

  Access Paper or Ask Questions

Selective Style Transfer for Text


Jun 04, 2019
Raul Gomez, Ali Furkan Biten, Lluis Gomez, Jaume Gibert, Marçal Rusiñol, Dimosthenis Karatzas

* Accepted in ICDAR 2019 

  Access Paper or Ask Questions

Scene Text Visual Question Answering


May 31, 2019
Ali Furkan Biten, Ruben Tito, Andres Mafla, Lluis Gomez, Marçal Rusiñol, Ernest Valveny, C. V. Jawahar, Dimosthenis Karatzas


  Access Paper or Ask Questions

Good News, Everyone! Context driven entity-aware captioning for news images


Apr 02, 2019
Ali Furkan Biten, Lluis Gomez, Marçal Rusiñol, Dimosthenis Karatzas

* IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019) 

  Access Paper or Ask Questions

Self-Supervised Visual Representations for Cross-Modal Retrieval


Jan 31, 2019
Yash Patel, Lluis Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

* arXiv admin note: text overlap with arXiv:1807.02110 

  Access Paper or Ask Questions

Self-Supervised Learning from Web Data for Multimodal Retrieval


Jan 07, 2019
Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas

* Submitted to Multi-Modal Scene Understanding. arXiv admin note: substantial text overlap with arXiv:1808.06368 

  Access Paper or Ask Questions

Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene Images


Sep 04, 2018
Dena Bazazian, Dimosthenis Karatzas, Andrew D. Bagdanov

* 4 pages, 3 figures, The Third International Workshop on Egocentric Perception, Interaction and Computing (EPIC) at ECCV2018 

  Access Paper or Ask Questions