Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach

Add code
Dec 21, 2025
Figure 1 for Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach
Figure 2 for Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach
Figure 3 for Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach
Figure 4 for Commercial Vehicle Braking Optimization: A Robust SIFT-Trajectory Approach
Viaarxiv icon

PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases

Add code
Dec 19, 2025
Figure 1 for PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases
Figure 2 for PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases
Figure 3 for PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases
Figure 4 for PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases
Viaarxiv icon

RecurGS: Interactive Scene Modeling via Discrete-State Recurrent Gaussian Fusion

Add code
Dec 20, 2025
Viaarxiv icon

Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection

Add code
Dec 18, 2025
Figure 1 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 2 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 3 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Figure 4 for Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
Viaarxiv icon

FocalComm: Hard Instance-Aware Multi-Agent Perception

Add code
Dec 20, 2025
Figure 1 for FocalComm: Hard Instance-Aware Multi-Agent Perception
Figure 2 for FocalComm: Hard Instance-Aware Multi-Agent Perception
Figure 3 for FocalComm: Hard Instance-Aware Multi-Agent Perception
Figure 4 for FocalComm: Hard Instance-Aware Multi-Agent Perception
Viaarxiv icon

CoDi -- an exemplar-conditioned diffusion model for low-shot counting

Add code
Dec 23, 2025
Viaarxiv icon

LogicLens: Visual-Logical Co-Reasoning for Text-Centric Forgery Analysis

Add code
Dec 25, 2025
Viaarxiv icon

ReasonCD: A Multimodal Reasoning Large Model for Implicit Change-of-Interest Semantic Mining

Add code
Dec 22, 2025
Figure 1 for ReasonCD: A Multimodal Reasoning Large Model for Implicit Change-of-Interest Semantic Mining
Figure 2 for ReasonCD: A Multimodal Reasoning Large Model for Implicit Change-of-Interest Semantic Mining
Figure 3 for ReasonCD: A Multimodal Reasoning Large Model for Implicit Change-of-Interest Semantic Mining
Figure 4 for ReasonCD: A Multimodal Reasoning Large Model for Implicit Change-of-Interest Semantic Mining
Viaarxiv icon

IMKD: Intensity-Aware Multi-Level Knowledge Distillation for Camera-Radar Fusion

Add code
Dec 17, 2025
Viaarxiv icon

Beyond Proximity: A Keypoint-Trajectory Framework for Classifying Affiliative and Agonistic Social Networks in Dairy Cattle

Add code
Dec 17, 2025
Viaarxiv icon