Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:CSTR: A Classification Perspective on Scene Text Recognition

Feb 22, 2021

Hongxiang Cai, Jun Sun, Yichao Xiong

Figure 1 for CSTR: A Classification Perspective on Scene Text Recognition

Figure 2 for CSTR: A Classification Perspective on Scene Text Recognition

Figure 3 for CSTR: A Classification Perspective on Scene Text Recognition

Figure 4 for CSTR: A Classification Perspective on Scene Text Recognition

Share this with someone who'll enjoy it:

Abstract:The prevalent perspectives of scene text recognition are from sequence to sequence (seq2seq) and segmentation. In this paper, we propose a new perspective on scene text recognition, in which we model the scene text recognition as an image classification problem. Based on the image classification perspective, a scene text recognition model is proposed, which is named as CSTR. The CSTR model consists of a series of convolutional layers and a global average pooling layer at the end, followed by independent multi-class classification heads, each of which predicts the corresponding character of the word sequence in input image. The CSTR model is easy to train using parallel cross entropy losses. CSTR is as simple as image classification models like ResNet \cite{he2016deep} which makes it easy to implement, and the fully convolutional neural network architecture makes it efficient to train and deploy. We demonstrate the effectiveness of the classification perspective on scene text recognition with thorough experiments. Futhermore, CSTR achieves nearly state-of-the-art performance on six public benchmarks including regular text, irregular text. The code will be available at https://github.com/Media-Smart/vedastr.

* 10 pages, 4 figures

View paper on

Share this with someone who'll enjoy it:

Title:CSTR: A Classification Perspective on Scene Text Recognition

Paper and Code