Alert button

"Object Detection": models, code, and papers
Alert button

Detection of Micromobility Vehicles in Urban Traffic Videos

Add code
Bookmark button
Alert button
Feb 28, 2024
Khalil Sabri, Célia Djilali, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Wassim Bouachir

Viaarxiv icon

CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow

Mar 13, 2024
Chenbin Pan, Burhaneddin Yaman, Senem Velipasalar, Liu Ren

Figure 1 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Figure 2 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Figure 3 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Figure 4 for CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow
Viaarxiv icon

Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring

Add code
Bookmark button
Alert button
Mar 14, 2024
Yufei Zhan, Yousong Zhu, Hongyin Zhao, Fan Yang, Ming Tang, Jinqiao Wang

Figure 1 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 2 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 3 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Figure 4 for Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring
Viaarxiv icon

Rectify the Regression Bias in Long-Tailed Object Detection

Jan 31, 2024
Ke Zhu, Minghao Fu, Jie Shao, Tianyu Liu, Jianxin Wu

Viaarxiv icon

EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection

Add code
Bookmark button
Alert button
Mar 14, 2024
Jiaqing Zhang, Mingxiang Cao, Xue Yang, Weiying Xie, Jie Lei, Daixun Li, Geng Yang, Wenbo Huang, Yunsong Li

Figure 1 for EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection
Figure 2 for EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection
Figure 3 for EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection
Figure 4 for EfficientMFD: Towards More Efficient Multimodal Synchronous Fusion Detection
Viaarxiv icon

G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object Detection

Add code
Bookmark button
Alert button
Feb 07, 2024
Fan Wu, Jinling Gao, Lanqing Hong, Xinbing Wang, Chenghu Zhou, Nanyang Ye

Viaarxiv icon

PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution

Mar 16, 2024
Honghao Chen, Xiangxiang Chu, Yongjian Ren, Xin Zhao, Kaiqi Huang

Figure 1 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 2 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 3 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 4 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Viaarxiv icon

A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions

Mar 12, 2024
Quoc-Vinh Lai-Dang

Figure 1 for A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
Figure 2 for A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
Figure 3 for A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions
Viaarxiv icon

Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems

Feb 12, 2024
Justin Davis, Mehmet E. Belviranli

Viaarxiv icon

AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision Transformer

Add code
Bookmark button
Alert button
Feb 12, 2024
Tanmoy Dam, Sanjay Bhargav Dharavath, Sameer Alam, Nimrod Lilith, Supriyo Chakraborty, Mir Feroskhan

Viaarxiv icon