Optical Character Recognition


Optical Character Recognition or Optical Character Reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene photo (for example the text on signs and billboards in a landscape photo, license plates in cars...) or from subtitle text superimposed on an image.

UniVL: Unified Vision-Language Embedding for Spatially Grounded Contextual Image Generation

Add code
May 20, 2026
Viaarxiv icon

CC-OCR V2: Benchmarking Large Multimodal Models for Literacy in Real-world Document Processing

Add code
May 05, 2026
Viaarxiv icon

ATLAS: Article Tracking, Linking, and Analysis of Swedish Encyclopedias

Add code
May 04, 2026
Viaarxiv icon

Benchmarking OCR Pipelines with Adaptive Enhancement for Multi-Domain Retail Bill Digitization

Add code
Apr 28, 2026
Viaarxiv icon

Reading in the Dark: Low-light Scene Text Recognition

Add code
Apr 26, 2026
Viaarxiv icon

GlotOCR Bench: OCR Models Still Struggle Beyond a Handful of Unicode Scripts

Add code
Apr 14, 2026
Viaarxiv icon

DocRevive: A Unified Pipeline for Document Text Restoration

Add code
Apr 11, 2026
Viaarxiv icon

AtlasOCR: Building the First Open-Source Darija OCR Model with Vision Language Models

Add code
Apr 09, 2026
Viaarxiv icon

Multi-Head Attention based interaction-aware architecture for Bangla Handwritten Character Recognition: Introducing a Primary Dataset

Add code
Apr 08, 2026
Viaarxiv icon

LLM-based Schema-Guided Extraction and Validation of Missing-Person Intelligence from Heterogeneous Data Sources

Add code
Apr 08, 2026
Viaarxiv icon