Text Detection


Detecting the text in the image and localizing it using a bounding box. The text can be in any shape and size. We need to localize all such instances of text in the entire image along with bounding boxes for each word.

PromptSplit: Revealing Prompt-Level Disagreement in Generative Models

Add code
Feb 03, 2026
Viaarxiv icon

Beyond Open Vocabulary: Multimodal Prompting for Object Detection in Remote Sensing Images

Add code
Feb 02, 2026
Viaarxiv icon

Trust The Typical

Add code
Feb 04, 2026
Viaarxiv icon

Beyond Content: Behavioral Policies Reveal Actors in Information Operations

Add code
Feb 02, 2026
Viaarxiv icon

Reliable and Responsible Foundation Models: A Comprehensive Survey

Add code
Feb 04, 2026
Viaarxiv icon

Fact or Fake? Assessing the Role of Deepfake Detectors in Multimodal Misinformation Detection

Add code
Feb 02, 2026
Viaarxiv icon

LALM-as-a-Judge: Benchmarking Large Audio-Language Models for Safety Evaluation in Multi-Turn Spoken Dialogues

Add code
Feb 04, 2026
Viaarxiv icon

RegionReasoner: Region-Grounded Multi-Round Visual Reasoning

Add code
Feb 03, 2026
Viaarxiv icon

HALT: Hallucination Assessment via Log-probs as Time series

Add code
Feb 02, 2026
Viaarxiv icon

Semantic-level Backdoor Attack against Text-to-Image Diffusion Models

Add code
Feb 03, 2026
Viaarxiv icon