Alert button

"Object Detection": models, code, and papers
Alert button

SAMF: Small-Area-Aware Multi-focus Image Fusion for Object Detection

Jan 31, 2024
Xilai Li, Xiaosong Li, Haishu Tan, Jinyang Li

Viaarxiv icon

Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering

Feb 29, 2024
Xiang Chen, Wenjie Zhu, Jiayuan Chen, Tong Zhang, Changyan Yi, Jun Cai

Viaarxiv icon

ACC-ViT : Atrous Convolution's Comeback in Vision Transformers

Mar 07, 2024
Nabil Ibtehaz, Ning Yan, Masood Mortazavi, Daisuke Kihara

Figure 1 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Figure 2 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Figure 3 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Figure 4 for ACC-ViT : Atrous Convolution's Comeback in Vision Transformers
Viaarxiv icon

Knowledge Graph Driven UAV Cognitive Semantic Communication Systems for Efficient Object Detection

Jan 25, 2024
Xi Song, Lu Yuan, Zhibo Qu, Fuhui Zhou, Qihui Wu, Tony Q. S. Quek, Rose Qingyang Hu

Viaarxiv icon

Debiased Novel Category Discovering and Localization

Feb 29, 2024
Juexiao Feng, Yuhong Yang, Yanchun Xie, Yaqian Li, Yandong Guo, Yuchen Guo, Yuwei He, Liuyu Xiang, Guiguang Ding

Viaarxiv icon

Effectiveness Assessment of Recent Large Vision-Language Models

Mar 07, 2024
Yao Jiang, Xinyu Yan, Ge-Peng Ji, Keren Fu, Meijun Sun, Huan Xiong, Deng-Ping Fan, Fahad Shahbaz Khan

Figure 1 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 2 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 3 for Effectiveness Assessment of Recent Large Vision-Language Models
Figure 4 for Effectiveness Assessment of Recent Large Vision-Language Models
Viaarxiv icon

FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View

Mar 05, 2024
Jiawei Hou, Xiaoyan Li, Wenhao Guan, Gang Zhang, Di Feng, Yuheng Du, Xiangyang Xue, Jian Pu

Figure 1 for FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View
Figure 2 for FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View
Figure 3 for FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View
Figure 4 for FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View
Viaarxiv icon

YOLO-World: Real-Time Open-Vocabulary Object Detection

Feb 02, 2024
Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan

Viaarxiv icon

Streamlined Hybrid Annotation Framework using Scalable Codestream for Bandwidth-Restricted UAV Object Detection

Feb 07, 2024
Karim El Khoury, Tiffanie Godelaine, Simon Delvaux, Sebastien Lugan, Benoit Macq

Viaarxiv icon

MCA: Moment Channel Attention Networks

Mar 04, 2024
Yangbo Jiang, Zhiwei Jiang, Le Han, Zenan Huang, Nenggan Zheng

Viaarxiv icon