Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

Building UI/UX Dataset for Dark Pattern Detection and YOLOv12x-based Real-Time Object Recognition Detection System

Add code
Dec 20, 2025
Viaarxiv icon

Physics-Inspired Modeling and Content Adaptive Routing in an Infrared Gas Leak Detection Network

Add code
Dec 29, 2025
Viaarxiv icon

Optical Flow-Guided 6DoF Object Pose Tracking with an Event Camera

Add code
Dec 24, 2025
Viaarxiv icon

Pyramidal Adaptive Cross-Gating for Multimodal Detection

Add code
Dec 20, 2025
Viaarxiv icon

Auto-Vocabulary 3D Object Detection

Add code
Dec 18, 2025
Figure 1 for Auto-Vocabulary 3D Object Detection
Figure 2 for Auto-Vocabulary 3D Object Detection
Figure 3 for Auto-Vocabulary 3D Object Detection
Figure 4 for Auto-Vocabulary 3D Object Detection
Viaarxiv icon

Application of deep learning approaches for medieval historical documents transcription

Add code
Dec 21, 2025
Viaarxiv icon

SSCATeR: Sparse Scatter-Based Convolution Algorithm with Temporal Data Recycling for Real-Time 3D Object Detection in LiDAR Point Clouds

Add code
Dec 19, 2025
Viaarxiv icon

StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection

Add code
Dec 19, 2025
Figure 1 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 2 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 3 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 4 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Viaarxiv icon

Neural Probe-Based Hallucination Detection for Large Language Models

Add code
Dec 24, 2025
Viaarxiv icon

Toward Intelligent Scene Augmentation for Context-Aware Object Placement and Sponsor-Logo Integration

Add code
Dec 25, 2025
Viaarxiv icon