Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting

Add code
Oct 02, 2025
Viaarxiv icon

Modeling the Multivariate Relationship with Contextualized Representations for Effective Human-Object Interaction Detection

Add code
Sep 16, 2025
Viaarxiv icon

Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection

Add code
Sep 11, 2025
Figure 1 for Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Figure 2 for Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Figure 3 for Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Figure 4 for Dark-ISP: Enhancing RAW Image Processing for Low-Light Object Detection
Viaarxiv icon

Data Augmentation via Latent Diffusion Models for Detecting Smell-Related Objects in Historical Artworks

Add code
Sep 18, 2025
Viaarxiv icon

Explicit Multimodal Graph Modeling for Human-Object Interaction Detection

Add code
Sep 16, 2025
Viaarxiv icon

Joint Optimization of Speaker and Spoof Detectors for Spoofing-Robust Automatic Speaker Verification

Add code
Oct 02, 2025
Viaarxiv icon

A Structured Review of Underwater Object Detection Challenges and Solutions: From Traditional to Large Vision Language Models

Add code
Sep 10, 2025
Viaarxiv icon

Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation

Add code
Oct 02, 2025
Figure 1 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Figure 2 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Figure 3 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Figure 4 for Addressing Pitfalls in the Evaluation of Uncertainty Estimation Methods for Natural Language Generation
Viaarxiv icon

UNIV: Unified Foundation Model for Infrared and Visible Modalities

Add code
Sep 19, 2025
Viaarxiv icon

An Exploratory Study on Abstract Images and Visual Representations Learned from Them

Add code
Sep 17, 2025
Viaarxiv icon