Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

URoPE: Universal Relative Position Embedding across Geometric Spaces

Add code
Apr 20, 2026
Viaarxiv icon

Attend what matters: Leveraging vision foundational models for breast cancer classification using mammograms

Add code
Apr 21, 2026
Viaarxiv icon

Toward Multimodal Conversational AI for Age-Related Macular Degeneration

Add code
Apr 28, 2026
Viaarxiv icon

Text-Guided Multimodal Unified Industrial Anomaly Detection

Add code
Apr 24, 2026
Viaarxiv icon

ArchSym: Detecting 3D-Grounded Architectural Symmetries in the Wild

Add code
Apr 24, 2026
Viaarxiv icon

R-CoV: Region-Aware Chain-of-Verification for Alleviating Object Hallucinations in LVLMs

Add code
Apr 22, 2026
Viaarxiv icon

Advancing Vision Transformer with Enhanced Spatial Priors

Add code
Apr 20, 2026
Viaarxiv icon

Paradigm Shift from Statistical Channel Modeling to Digital Twin Prediction: An Environment-Generalizable ChannelLM for 6G AI-enabled Air Interface

Add code
Apr 20, 2026
Viaarxiv icon

OneDrive: Unified Multi-Paradigm Driving with Vision-Language-Action Models

Add code
Apr 20, 2026
Viaarxiv icon

Autonomous Unmanned Aircraft Systems for Enhanced Search and Rescue of Drowning Swimmers: Image-Based Localization and Mission Simulation

Add code
Apr 20, 2026
Viaarxiv icon