Alert button

"Object Detection": models, code, and papers
Alert button

$V_kD:$ Improving Knowledge Distillation using Orthogonal Projections

Add code
Bookmark button
Alert button
Mar 10, 2024
Roy Miles, Ismail Elezi, Jiankang Deng

Figure 1 for $V_kD:$ Improving Knowledge Distillation using Orthogonal Projections
Figure 2 for $V_kD:$ Improving Knowledge Distillation using Orthogonal Projections
Figure 3 for $V_kD:$ Improving Knowledge Distillation using Orthogonal Projections
Figure 4 for $V_kD:$ Improving Knowledge Distillation using Orthogonal Projections
Viaarxiv icon

Knowledge Graph Driven UAV Cognitive Semantic Communication Systems for Efficient Object Detection

Jan 25, 2024
Xi Song, Lu Yuan, Zhibo Qu, Fuhui Zhou, Qihui Wu, Tony Q. S. Quek, Rose Qingyang Hu

Viaarxiv icon

Poly Kernel Inception Network for Remote Sensing Detection

Add code
Bookmark button
Alert button
Mar 10, 2024
Xinhao Cai, Qiuxia Lai, Yuwei Wang, Wenguan Wang, Zeren Sun, Yazhou Yao

Figure 1 for Poly Kernel Inception Network for Remote Sensing Detection
Figure 2 for Poly Kernel Inception Network for Remote Sensing Detection
Figure 3 for Poly Kernel Inception Network for Remote Sensing Detection
Figure 4 for Poly Kernel Inception Network for Remote Sensing Detection
Viaarxiv icon

FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything

Add code
Bookmark button
Alert button
Feb 29, 2024
Safouane El Ghazouali, Youssef Mhirit, Ali Oukhrid, Umberto Michelucci, Hichem Nouira

Figure 1 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Figure 2 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Figure 3 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Figure 4 for FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anything
Viaarxiv icon

Frequency-Adaptive Dilated Convolution for Semantic Segmentation

Add code
Bookmark button
Alert button
Mar 12, 2024
Linwei Chen, Lin Gu, Ying Fu

Figure 1 for Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Figure 2 for Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Figure 3 for Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Figure 4 for Frequency-Adaptive Dilated Convolution for Semantic Segmentation
Viaarxiv icon

PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution

Mar 12, 2024
Honghao Chen, Xiangxiang Chu, Yongjian Ren, Xin Zhao, Kaiqi Huang

Figure 1 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 2 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 3 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Figure 4 for PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution
Viaarxiv icon

Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers

Mar 11, 2024
Alexander H. Berger, Laurin Lux, Suprosanna Shit, Ivan Ezhov, Georgios Kaissis, Martin J. Menten, Daniel Rueckert, Johannes C. Paetzold

Figure 1 for Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers
Figure 2 for Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers
Figure 3 for Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers
Figure 4 for Cross-domain and Cross-dimension Learning for Image-to-Graph Transformers
Viaarxiv icon

Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks

Add code
Bookmark button
Alert button
Mar 08, 2024
Hamed Hosseini, Mehdi Tale Masouleh, Ahmad Kalhor

Figure 1 for Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks
Figure 2 for Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks
Figure 3 for Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks
Figure 4 for Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks
Viaarxiv icon

Detecting Concrete Visual Tokens for Multimodal Machine Translation

Mar 05, 2024
Braeden Bowen, Vipin Vijayan, Scott Grigsby, Timothy Anderson, Jeremy Gwinnup

Figure 1 for Detecting Concrete Visual Tokens for Multimodal Machine Translation
Figure 2 for Detecting Concrete Visual Tokens for Multimodal Machine Translation
Figure 3 for Detecting Concrete Visual Tokens for Multimodal Machine Translation
Figure 4 for Detecting Concrete Visual Tokens for Multimodal Machine Translation
Viaarxiv icon

YOLO-World: Real-Time Open-Vocabulary Object Detection

Add code
Bookmark button
Alert button
Feb 02, 2024
Tianheng Cheng, Lin Song, Yixiao Ge, Wenyu Liu, Xinggang Wang, Ying Shan

Viaarxiv icon