Text Spotting


Text spotting is the combination of Scene Text Detection and Scene Text Recognition in an end-to-end manner. It is the ability to read natural text in the wild.

TiCLS : Tightly Coupled Language Text Spotter

Add code
Feb 03, 2026
Viaarxiv icon

ReCALL: Recalibrating Capability Degradation for MLLM-based Composed Image Retrieval

Add code
Feb 02, 2026
Viaarxiv icon

PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing

Add code
Jan 29, 2026
Viaarxiv icon

MATE: Matryoshka Audio-Text Embeddings for Open-Vocabulary Keyword Spotting

Add code
Jan 20, 2026
Viaarxiv icon

Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration

Add code
Dec 09, 2025
Viaarxiv icon

Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting

Add code
Dec 16, 2025
Figure 1 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Figure 2 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Figure 3 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Figure 4 for Joint Multimodal Contrastive Learning for Robust Spoken Term Detection and Keyword Spotting
Viaarxiv icon

Prime and Reach: Synthesising Body Motion for Gaze-Primed Object Reach

Add code
Dec 18, 2025
Figure 1 for Prime and Reach: Synthesising Body Motion for Gaze-Primed Object Reach
Figure 2 for Prime and Reach: Synthesising Body Motion for Gaze-Primed Object Reach
Figure 3 for Prime and Reach: Synthesising Body Motion for Gaze-Primed Object Reach
Figure 4 for Prime and Reach: Synthesising Body Motion for Gaze-Primed Object Reach
Viaarxiv icon

Pose-Based Sign Language Spotting via an End-to-End Encoder Architecture

Add code
Dec 09, 2025
Viaarxiv icon

Script Gap: Evaluating LLM Triage on Indian Languages in Native vs Roman Scripts in a Real World Setting

Add code
Dec 11, 2025
Figure 1 for Script Gap: Evaluating LLM Triage on Indian Languages in Native vs Roman Scripts in a Real World Setting
Figure 2 for Script Gap: Evaluating LLM Triage on Indian Languages in Native vs Roman Scripts in a Real World Setting
Figure 3 for Script Gap: Evaluating LLM Triage on Indian Languages in Native vs Roman Scripts in a Real World Setting
Figure 4 for Script Gap: Evaluating LLM Triage on Indian Languages in Native vs Roman Scripts in a Real World Setting
Viaarxiv icon

LRANet++: Low-Rank Approximation Network for Accurate and Efficient Text Spotting

Add code
Nov 08, 2025
Viaarxiv icon