Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

FusionCounting: Robust visible-infrared image fusion guided by crowd counting via multi-task learning

Add code
Aug 28, 2025
Viaarxiv icon

FPI-Det: a face--phone Interaction Dataset for phone-use detection and understanding

Add code
Sep 11, 2025
Viaarxiv icon

Context-aware Sparse Spatiotemporal Learning for Event-based Vision

Add code
Aug 27, 2025
Figure 1 for Context-aware Sparse Spatiotemporal Learning for Event-based Vision
Figure 2 for Context-aware Sparse Spatiotemporal Learning for Event-based Vision
Figure 3 for Context-aware Sparse Spatiotemporal Learning for Event-based Vision
Figure 4 for Context-aware Sparse Spatiotemporal Learning for Event-based Vision
Viaarxiv icon

Maybe you don't need a U-Net: convolutional feature upsampling for materials micrograph segmentation

Add code
Aug 29, 2025
Viaarxiv icon

OVGrasp: Open-Vocabulary Grasping Assistance via Multimodal Intent Detection

Add code
Sep 04, 2025
Figure 1 for OVGrasp: Open-Vocabulary Grasping Assistance via Multimodal Intent Detection
Figure 2 for OVGrasp: Open-Vocabulary Grasping Assistance via Multimodal Intent Detection
Figure 3 for OVGrasp: Open-Vocabulary Grasping Assistance via Multimodal Intent Detection
Figure 4 for OVGrasp: Open-Vocabulary Grasping Assistance via Multimodal Intent Detection
Viaarxiv icon

How Well Do Vision--Language Models Understand Cities? A Comparative Study on Spatial Reasoning from Street-View Images

Add code
Aug 29, 2025
Viaarxiv icon

Blind Source Separation of Radar Signals in Time Domain Using Deep Learning

Add code
Sep 19, 2025
Viaarxiv icon

PointAD+: Learning Hierarchical Representations for Zero-shot 3D Anomaly Detection

Add code
Sep 03, 2025
Viaarxiv icon

MAGENTA: Magnitude and Geometry-ENhanced Training Approach for Robust Long-Tailed Sound Event Localization and Detection

Add code
Sep 19, 2025
Viaarxiv icon

E-ConvNeXt: A Lightweight and Efficient ConvNeXt Variant with Cross-Stage Partial Connections

Add code
Aug 28, 2025
Viaarxiv icon