Scene Text Recognition


Scene text recognition is the process of identifying and transcribing text in natural scenes using computer vision techniques.

Adaptive Prototype Model for Attribute-based Multi-label Few-shot Action Recognition

Add code
Feb 18, 2025
Viaarxiv icon

3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning

Add code
Feb 13, 2025
Viaarxiv icon

3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds

Add code
Feb 27, 2025
Viaarxiv icon

TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition

Add code
Dec 02, 2024
Figure 1 for TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Figure 2 for TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Figure 3 for TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Figure 4 for TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition
Viaarxiv icon

SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild

Add code
Jan 07, 2025
Figure 1 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 2 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 3 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Figure 4 for SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild
Viaarxiv icon

Arbitrary Reading Order Scene Text Spotter with Local Semantics Guidance

Add code
Dec 13, 2024
Viaarxiv icon

SVTRv2: CTC Beats Encoder-Decoder Models in Scene Text Recognition

Add code
Nov 24, 2024
Viaarxiv icon

Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing

Add code
Nov 23, 2024
Figure 1 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Figure 2 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Figure 3 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Figure 4 for Boosting Semi-Supervised Scene Text Recognition via Viewing and Summarizing
Viaarxiv icon

Relational Contrastive Learning and Masked Image Modeling for Scene Text Recognition

Add code
Nov 19, 2024
Viaarxiv icon

Dynamic Scene Understanding from Vision-Language Representations

Add code
Jan 20, 2025
Figure 1 for Dynamic Scene Understanding from Vision-Language Representations
Figure 2 for Dynamic Scene Understanding from Vision-Language Representations
Figure 3 for Dynamic Scene Understanding from Vision-Language Representations
Figure 4 for Dynamic Scene Understanding from Vision-Language Representations
Viaarxiv icon