Scene Text Recognition


Scene text recognition is the process of identifying and transcribing text in natural scenes using computer vision techniques.

Visual and textual prompts for enhancing emotion recognition in video

Add code
Apr 24, 2025
Viaarxiv icon

Analysing the Robustness of Vision-Language-Models to Common Corruptions

Add code
Apr 21, 2025
Viaarxiv icon

SemiETS: Integrating Spatial and Content Consistencies for Semi-Supervised End-to-end Text Spotting

Add code
Apr 14, 2025
Viaarxiv icon

Efficient and Accurate Scene Text Recognition with Cascaded-Transformers

Add code
Mar 24, 2025
Viaarxiv icon

Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition

Add code
Mar 24, 2025
Viaarxiv icon

Accurate Scene Text Recognition with Efficient Model Scaling and Cloze Self-Distillation

Add code
Mar 20, 2025
Viaarxiv icon

Edge Approximation Text Detector

Add code
Apr 05, 2025
Viaarxiv icon

Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery

Add code
Apr 02, 2025
Viaarxiv icon

A Context-Driven Training-Free Network for Lightweight Scene Text Segmentation and Recognition

Add code
Mar 19, 2025
Viaarxiv icon

NCAP: Scene Text Image Super-Resolution with Non-CAtegorical Prior

Add code
Apr 01, 2025
Viaarxiv icon