Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

AnyDepth-DETR/-YOLO: Any-depth object detection with a single network

Add code
May 10, 2026
Viaarxiv icon

What-Where Transformer: A Slot-Centric Visual Backbone for Concurrent Representation and Localization

Add code
May 12, 2026
Viaarxiv icon

Prediction of Rectal Cancer Regrowth from Longitudinal Endoscopy

Add code
May 13, 2026
Viaarxiv icon

Beyond AI as Assistants: Toward Autonomous Discovery in Cosmology

Add code
May 14, 2026
Viaarxiv icon

Grounded or Guessing? LVLM Confidence Estimation via Blind-Image Contrastive Ranking

Add code
May 11, 2026
Viaarxiv icon

Distilling 3D Spatial Reasoning into a Lightweight Vision-Language Model with CoT

Add code
May 10, 2026
Viaarxiv icon

CrossVL: Complexity-Aware Feature Routing and Paired Curriculum for Cross-View Vision-Language Detection

Add code
May 10, 2026
Viaarxiv icon

MMVIAD: Multi-view Multi-task Video Understanding for Industrial Anomaly Detection

Add code
May 11, 2026
Viaarxiv icon

Moltbook Moderation: Uncovering Hidden Intent Through Multi-Turn Dialogue

Add code
May 14, 2026
Viaarxiv icon

MVB-Grasp: Minimum-Volume-Box Filtering of Diffusion-based Grasps for Frontal Manipulation

Add code
May 10, 2026
Viaarxiv icon