Object Detection


Object detection is a computer vision task in which the goal is to detect and locate objects of interest in an image or video. The task involves identifying the position and boundaries of objects in an image, and classifying the objects into different categories. It forms a crucial part of vision recognition, alongside image classification and retrieval.

GenDet: Painting Colored Bounding Boxes on Images via Diffusion Model for Object Detection

Add code
Jan 12, 2026
Viaarxiv icon

From Prompts to Deployment: Auto-Curated Domain-Specific Dataset Generation via Diffusion Models

Add code
Jan 13, 2026
Viaarxiv icon

DentalX: Context-Aware Dental Disease Detection with Radiographs

Add code
Jan 13, 2026
Viaarxiv icon

Representation Learning with Semantic-aware Instance and Sparse Token Alignments

Add code
Jan 13, 2026
Viaarxiv icon

WaveFormer: Frequency-Time Decoupled Vision Modeling with Wave Equation

Add code
Jan 13, 2026
Viaarxiv icon

Edge-Optimized Multimodal Learning for UAV Video Understanding via BLIP-2

Add code
Jan 13, 2026
Viaarxiv icon

Human-inspired Global-to-Parallel Multi-scale Encoding for Lightweight Vision Models

Add code
Jan 13, 2026
Viaarxiv icon

FTDMamba: Frequency-Assisted Temporal Dilation Mamba for Unmanned Aerial Vehicle Video Anomaly Detection

Add code
Jan 16, 2026
Viaarxiv icon

Listen, Look, Drive: Coupling Audio Instructions for User-aware VLA-based Autonomous Driving

Add code
Jan 17, 2026
Viaarxiv icon

DeTracker: Motion-decoupled Vehicle Detection and Tracking in Unstabilized Satellite Videos

Add code
Jan 14, 2026
Viaarxiv icon