Text Detection


Detecting the text in the image and localizing it using a bounding box. The text can be in any shape and size. We need to localize all such instances of text in the entire image along with bounding boxes for each word.

HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Real-World Hallucination Detection

Add code
May 01, 2025
Viaarxiv icon

Visual Text Processing: A Comprehensive Review and Unified Evaluation

Add code
Apr 30, 2025
Viaarxiv icon

Traceback of Poisoning Attacks to Retrieval-Augmented Generation

Add code
Apr 30, 2025
Viaarxiv icon

Dynamic Attention Analysis for Backdoor Detection in Text-to-Image Diffusion Models

Add code
Apr 29, 2025
Viaarxiv icon

MemeBLIP2: A novel lightweight multimodal system to detect harmful memes

Add code
Apr 29, 2025
Viaarxiv icon

Detecting and Mitigating Hateful Content in Multimodal Memes with Vision-Language Models

Add code
Apr 30, 2025
Viaarxiv icon

GLIP-OOD: Zero-Shot Graph OOD Detection with Foundation Model

Add code
Apr 29, 2025
Viaarxiv icon

Unified Multi-Task Learning & Model Fusion for Efficient Language Model Guardrailing

Add code
Apr 29, 2025
Viaarxiv icon

Clustering Internet Memes Through Template Matching and Multi-Dimensional Similarity

Add code
Apr 30, 2025
Viaarxiv icon

Physics-Informed Diffusion Models for SAR Ship Wake Generation from Text Prompts

Add code
Apr 28, 2025
Viaarxiv icon