Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection

Add code
Dec 20, 2025
Figure 1 for ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection
Figure 2 for ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection
Figure 3 for ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection
Figure 4 for ALIGN: Advanced Query Initialization with LiDAR-Image Guidance for Occlusion-Robust 3D Object Detection
Viaarxiv icon

${D}^{3}${ETOR}: ${D}$ebate-Enhanced Pseudo Labeling and Frequency-Aware Progressive ${D}$ebiasing for Weakly-Supervised Camouflaged Object ${D}$etection with Scribble Annotations

Add code
Dec 23, 2025
Viaarxiv icon

LiteFusion: Taming 3D Object Detectors from Vision-Based to Multi-Modal with Minimal Adaptation

Add code
Dec 23, 2025
Viaarxiv icon

Bridging Modalities and Transferring Knowledge: Enhanced Multimodal Understanding and Recognition

Add code
Dec 23, 2025
Viaarxiv icon

Spectral Discrepancy and Cross-modal Semantic Consistency Learning for Object Detection in Hyperspectral Image

Add code
Dec 20, 2025
Viaarxiv icon

Auto-Vocabulary 3D Object Detection

Add code
Dec 18, 2025
Figure 1 for Auto-Vocabulary 3D Object Detection
Figure 2 for Auto-Vocabulary 3D Object Detection
Figure 3 for Auto-Vocabulary 3D Object Detection
Figure 4 for Auto-Vocabulary 3D Object Detection
Viaarxiv icon

Building UI/UX Dataset for Dark Pattern Detection and YOLOv12x-based Real-Time Object Recognition Detection System

Add code
Dec 20, 2025
Viaarxiv icon

Decoupling Constraint from Two Direction in Evolutionary Constrained Multi-objective Optimization

Add code
Dec 30, 2025
Viaarxiv icon

Medical Image Classification on Imbalanced Data Using ProGAN and SMA-Optimized ResNet: Application to COVID-19

Add code
Dec 30, 2025
Viaarxiv icon

Pyramidal Adaptive Cross-Gating for Multimodal Detection

Add code
Dec 20, 2025
Viaarxiv icon