Alert button

"Object Detection": models, code, and papers
Alert button

Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors

Mar 18, 2024
Jakub Mandula, Jonas Kühne, Luca Pascarella, Michele Magno

Viaarxiv icon

FlexCap: Generating Rich, Localized, and Flexible Captions in Images

Mar 18, 2024
Debidatta Dwibedi, Vidhi Jain, Jonathan Tompson, Andrew Zisserman, Yusuf Aytar

Viaarxiv icon

Continual Forgetting for Pre-trained Vision Models

Mar 18, 2024
Hongbo Zhao, Bolin Ni, Haochen Wang, Junsong Fan, Fei Zhu, Yuxi Wang, Yuntao Chen, Gaofeng Meng, Zhaoxiang Zhang

Viaarxiv icon

ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models

Mar 17, 2024
Siyuan Huang, Iaroslav Ponomarenko, Zhengkai Jiang, Xiaoqi Li, Xiaobin Hu, Peng Gao, Hongsheng Li, Hao Dong

Viaarxiv icon

Spiking Neural Networks for Fast-Moving Object Detection on Neuromorphic Hardware Devices Using an Event-Based Camera

Mar 15, 2024
Andreas Ziegler, Karl Vetter, Thomas Gossard, Jonas Tebbe, Andreas Zell

Viaarxiv icon

Reframe Anything: LLM Agent for Open World Video Reframing

Mar 10, 2024
Jiawang Cao, Yongliang Wu, Weiheng Chi, Wenbo Zhu, Ziyue Su, Jay Wu

Viaarxiv icon

PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution

Mar 16, 2024
Honghao Chen, Xiangxiang Chu, Yongjian Ren, Xin Zhao, Kaiqi Huang

Viaarxiv icon

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

Mar 14, 2024
Yufei Zhan, Yousong Zhu, Hongyin Zhao, Fan Yang, Ming Tang, Jinqiao Wang

Viaarxiv icon

Real-time Traffic Object Detection for Autonomous Driving

Jan 31, 2024
Abdul Hannan Khan, Syed Tahseen Raza Rizvi, Andreas Dengel

Viaarxiv icon

CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow

Mar 13, 2024
Chenbin Pan, Burhaneddin Yaman, Senem Velipasalar, Liu Ren

Viaarxiv icon