Alert button

"Image": models, code, and papers
Alert button

FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object Detection

Add code
Bookmark button
Alert button
Sep 21, 2023
Zixuan Yin, Han Sun, Ningzhong Liu, Huiyu Zhou, Jiaquan Shen

Figure 1 for FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object Detection
Figure 2 for FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object Detection
Figure 3 for FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object Detection
Figure 4 for FGFusion: Fine-Grained Lidar-Camera Fusion for 3D Object Detection
Viaarxiv icon

DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning

Add code
Bookmark button
Alert button
Sep 25, 2023
Qingjie Meng, Wenjia Bai, Declan P O'Regan, and Daniel Rueckert

Figure 1 for DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning
Figure 2 for DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning
Figure 3 for DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning
Figure 4 for DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning
Viaarxiv icon

Towards A Robust Group-level Emotion Recognition via Uncertainty-Aware Learning

Oct 06, 2023
Qing Zhu, Qirong Mao, Jialin Zhang, Xiaohua Huang, Wenming Zheng

Figure 1 for Towards A Robust Group-level Emotion Recognition via Uncertainty-Aware Learning
Figure 2 for Towards A Robust Group-level Emotion Recognition via Uncertainty-Aware Learning
Figure 3 for Towards A Robust Group-level Emotion Recognition via Uncertainty-Aware Learning
Figure 4 for Towards A Robust Group-level Emotion Recognition via Uncertainty-Aware Learning
Viaarxiv icon

BrainSCUBA: Fine-Grained Natural Language Captions of Visual Cortex Selectivity

Oct 06, 2023
Andrew F. Luo, Margaret M. Henderson, Michael J. Tarr, Leila Wehbe

Viaarxiv icon

Module-wise Adaptive Distillation for Multimodality Foundation Models

Oct 06, 2023
Chen Liang, Jiahui Yu, Ming-Hsuan Yang, Matthew Brown, Yin Cui, Tuo Zhao, Boqing Gong, Tianyi Zhou

Figure 1 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 2 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 3 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Figure 4 for Module-wise Adaptive Distillation for Multimodality Foundation Models
Viaarxiv icon

SPADE: Sparsity-Guided Debugging for Deep Neural Networks

Oct 06, 2023
Arshia Soltani Moakhar, Eugenia Iofinova, Dan Alistarh

Figure 1 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 2 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 3 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Figure 4 for SPADE: Sparsity-Guided Debugging for Deep Neural Networks
Viaarxiv icon

Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors

Add code
Bookmark button
Alert button
Sep 15, 2023
Yancheng Cai, Bo Zhang, Baopu Li, Tao Chen, Hongliang Yan, Jingdong Zhang, Jiahao Xu

Figure 1 for Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors
Figure 2 for Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors
Figure 3 for Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors
Figure 4 for Rethinking Cross-Domain Pedestrian Detection: A Background-Focused Distribution Alignment Framework for Instance-Free One-Stage Detectors
Viaarxiv icon

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Add code
Bookmark button
Alert button
Oct 09, 2023
Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

Figure 1 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Figure 2 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Figure 3 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Figure 4 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Viaarxiv icon

INR-LDDMM: Fluid-based Medical Image Registration Integrating Implicit Neural Representation and Large Deformation Diffeomorphic Metric Mapping

Aug 21, 2023
Chulong Zhang, Xiaokun Liang

Figure 1 for INR-LDDMM: Fluid-based Medical Image Registration Integrating Implicit Neural Representation and Large Deformation Diffeomorphic Metric Mapping
Figure 2 for INR-LDDMM: Fluid-based Medical Image Registration Integrating Implicit Neural Representation and Large Deformation Diffeomorphic Metric Mapping
Figure 3 for INR-LDDMM: Fluid-based Medical Image Registration Integrating Implicit Neural Representation and Large Deformation Diffeomorphic Metric Mapping
Figure 4 for INR-LDDMM: Fluid-based Medical Image Registration Integrating Implicit Neural Representation and Large Deformation Diffeomorphic Metric Mapping
Viaarxiv icon

PEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial Robotics

Add code
Bookmark button
Alert button
Sep 29, 2023
Haechan Mark Bong, Rongge Zhang, Ricardo de Azambuja, Giovanni Beltrame

Figure 1 for PEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial Robotics
Figure 2 for PEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial Robotics
Figure 3 for PEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial Robotics
Figure 4 for PEACE: Prompt Engineering Automation for CLIPSeg Enhancement in Aerial Robotics
Viaarxiv icon