Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval

Sep 21, 2020
Andres Mafla, Sounak Dey, Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas


  Access Paper or Ask Questions

Document Visual Question Answering Challenge 2020

Aug 20, 2020
Minesh Mathew, Ruben Tito, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar

* to be published as a short paper in DAS 2020 

  Access Paper or Ask Questions

Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation

Aug 11, 2020
Raul Gomez, Yahui Liu, Marco De Nadai, Dimosthenis Karatzas, Bruno Lepri, Nicu Sebe

* Submitted to ACM MM '20, October 12-16, 2020, Seattle, WA, USA 

  Access Paper or Ask Questions

Text Recognition -- Real World Data and Where to Find Them

Jul 17, 2020
Klára Janoušková, Jiri Matas, Lluis Gomez, Dimosthenis Karatzas

* 10 pages 

  Access Paper or Ask Questions

Location Sensitive Image Retrieval and Tagging

Jul 07, 2020
Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas

* ECCV 2020 

  Access Paper or Ask Questions

DocVQA: A Dataset for VQA on Document Images

Jul 01, 2020
Minesh Mathew, Dimosthenis Karatzas, R. Manmatha, C. V. Jawahar


  Access Paper or Ask Questions

Multimodal grid features and cell pointers for Scene Text Visual Question Answering

Jun 25, 2020
Lluís Gómez, Ali Furkan Biten, Rubèn Tito, Andrés Mafla, Marçal Rusiñol, Ernest Valveny, Dimosthenis Karatzas

* This paper is under consideration at Pattern Recognition Letters 

  Access Paper or Ask Questions

Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features

Jan 14, 2020
Andres Mafla, Sounak Dey, Ali Furkan Biten, Lluis Gomez, Dimosthenis Karatzas

* Winter Conference on Applications of Computer Vision (WACV 2020) Accepted paper 

  Access Paper or Ask Questions

ICDAR 2019 Robust Reading Challenge on Reading Chinese Text on Signboard

Dec 20, 2019
Xi Liu, Rui Zhang, Yongsheng Zhou, Qianyi Jiang, Qi Song, Nan Li, Kai Zhou, Lei Wang, Dong Wang, Minghui Liao, Mingkun Yang, Xiang Bai, Baoguang Shi, Dimosthenis Karatzas, Shijian Lu, C. V. Jawahar

* International Conference on Document Analysis and Recognition, 2019 

  Access Paper or Ask Questions

Exploring Hate Speech Detection in Multimodal Publications

Oct 09, 2019
Raul Gomez, Jaume Gibert, Lluis Gomez, Dimosthenis Karatzas


  Access Paper or Ask Questions

ICDAR 2019 Competition on Large-scale Street View Text with Partial Labeling -- RRC-LSVT

Sep 17, 2019
Yipeng Sun, Zihan Ni, Chee-Kheng Chng, Yuliang Liu, Canjie Luo, Chun Chet Ng, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

* ICDAR 2019 Robust Reading Challenge in IAPR International Conference on Document Analysis and Recognition (ICDAR) 

  Access Paper or Ask Questions

ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT)

Sep 16, 2019
Chee-Kheng Chng, Yuliang Liu, Yipeng Sun, Chun Chet Ng, Canjie Luo, Zihan Ni, ChuanMing Fang, Shuaitao Zhang, Junyu Han, Errui Ding, Jingtuo Liu, Dimosthenis Karatzas, Chee Seng Chan, Lianwen Jin

* Technical report of ICDAR2019 Robust Reading Challenge on Arbitrary-Shaped Text (RRC-ArT) Competition 

  Access Paper or Ask Questions

ICDAR2019 Robust Reading Challenge on Multi-lingual Scene Text Detection and Recognition -- RRC-MLT-2019

Jul 01, 2019
Nibal Nayef, Yash Patel, Michal Busta, Pinaki Nath Chowdhury, Dimosthenis Karatzas, Wafa Khlif, Jiri Matas, Umapada Pal, Jean-Christophe Burie, Cheng-lin Liu, Jean-Marc Ogier

* ICDAR'19 camera-ready version. Competition available at https://rrc.cvc.uab.es/?ch=15. The first two authors contributed equally 

  Access Paper or Ask Questions

ICDAR 2019 Competition on Scene Text Visual Question Answering

Jun 30, 2019
Ali Furkan Biten, Rubèn Tito, Andres Mafla, Lluis Gomez, Marçal Rusiñol, Minesh Mathew, C. V. Jawahar, Ernest Valveny, Dimosthenis Karatzas

* 15th International Conference on Document Analysis and Recognition (ICDAR 2019) 

  Access Paper or Ask Questions

Selective Style Transfer for Text

Jun 04, 2019
Raul Gomez, Ali Furkan Biten, Lluis Gomez, Jaume Gibert, Marçal Rusiñol, Dimosthenis Karatzas

* Accepted in ICDAR 2019 

  Access Paper or Ask Questions

Scene Text Visual Question Answering

May 31, 2019
Ali Furkan Biten, Ruben Tito, Andres Mafla, Lluis Gomez, Marçal Rusiñol, Ernest Valveny, C. V. Jawahar, Dimosthenis Karatzas


  Access Paper or Ask Questions

Good News, Everyone! Context driven entity-aware captioning for news images

Apr 02, 2019
Ali Furkan Biten, Lluis Gomez, Marçal Rusiñol, Dimosthenis Karatzas

* IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019) 

  Access Paper or Ask Questions

Self-Supervised Visual Representations for Cross-Modal Retrieval

Jan 31, 2019
Yash Patel, Lluis Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

* arXiv admin note: text overlap with arXiv:1807.02110 

  Access Paper or Ask Questions

Self-Supervised Learning from Web Data for Multimodal Retrieval

Jan 07, 2019
Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas

* Submitted to Multi-Modal Scene Understanding. arXiv admin note: substantial text overlap with arXiv:1808.06368 

  Access Paper or Ask Questions

Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene Images

Sep 04, 2018
Dena Bazazian, Dimosthenis Karatzas, Andrew D. Bagdanov

* 4 pages, 3 figures, The Third International Workshop on Egocentric Perception, Interaction and Computing (EPIC) at ECCV2018 

  Access Paper or Ask Questions

Single Shot Scene Text Retrieval

Aug 27, 2018
Lluís Gómez, Andrés Mafla, Marçal Rusiñol, Dimosthenis Karatzas

* ECCV 2018 

  Access Paper or Ask Questions

Learning from #Barcelona Instagram data what Locals and Tourists post about its Neighbourhoods

Aug 20, 2018
Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas

* ECCV MULA Workshop 2018 

  Access Paper or Ask Questions

Learning to Learn from Web Data through Deep Semantic Embeddings

Aug 20, 2018
Raul Gomez, Lluis Gomez, Jaume Gibert, Dimosthenis Karatzas

* ECCV MULA Workshop 2018 

  Access Paper or Ask Questions

TextTopicNet - Self-Supervised Learning of Visual Features Through Embedding Images on Semantic Text Spaces

Jul 04, 2018
Yash Patel, Lluis Gomez, Raul Gomez, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

* arXiv admin note: text overlap with arXiv:1705.08631 

  Access Paper or Ask Questions

Non-deterministic Behavior of Ranking-based Metrics when Evaluating Embeddings

Jun 19, 2018
Anguelos Nicolaou, Sounak Dey, Vincent Christlein, Andreas Maier, Dimosthenis Karatzas


  Access Paper or Ask Questions

The Robust Reading Competition Annotation and Evaluation Platform

May 21, 2018
Dimosthenis Karatzas, Lluis Gómez, Anguelos Nicolaou, Marçal Rusiñol

* Proc. of the 13th IAPR Int. W. on Document Analysis Systems (DAS 2018), IEEE CPS, pp. 61-66, 2018 
* 6 pages, accepted to DAS 2018 

  Access Paper or Ask Questions

Self-supervised learning of visual features through embedding images into text topic spaces

May 24, 2017
Lluis Gomez, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar

* Accepted CVPR 2017 paper 

  Access Paper or Ask Questions

Improving Text Proposals for Scene Images with Fully Convolutional Networks

Feb 16, 2017
Dena Bazazian, Raul Gomez, Anguelos Nicolaou, Lluis Gomez, Dimosthenis Karatzas, Andrew D. Bagdanov

* 6 pages, 8 figures, International Conference on Pattern Recognition (ICPR) - DLPR (Deep Learning for Pattern Recognition) workshop 

  Access Paper or Ask Questions