Alert button

"Image": models, code, and papers
Alert button

Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models

Aug 18, 2023
Navid Rajabi, Jana Kosecka

Figure 1 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Figure 2 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Figure 3 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Figure 4 for Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models
Viaarxiv icon

Automated mapping of virtual environments with visual predictive coding

Aug 20, 2023
James Gornet, Matthew Thomson

Figure 1 for Automated mapping of virtual environments with visual predictive coding
Figure 2 for Automated mapping of virtual environments with visual predictive coding
Figure 3 for Automated mapping of virtual environments with visual predictive coding
Figure 4 for Automated mapping of virtual environments with visual predictive coding
Viaarxiv icon

Improving Generalization of Synthetically Trained Sonar Image Descriptors for Underwater Place Recognition

Add code
Bookmark button
Alert button
Aug 02, 2023
Ivano Donadi, Emilio Olivastri, Daniel Fusaro, Wanmeng Li, Daniele Evangelista, Alberto Pretto

Figure 1 for Improving Generalization of Synthetically Trained Sonar Image Descriptors for Underwater Place Recognition
Figure 2 for Improving Generalization of Synthetically Trained Sonar Image Descriptors for Underwater Place Recognition
Figure 3 for Improving Generalization of Synthetically Trained Sonar Image Descriptors for Underwater Place Recognition
Figure 4 for Improving Generalization of Synthetically Trained Sonar Image Descriptors for Underwater Place Recognition
Viaarxiv icon

SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation

Add code
Bookmark button
Alert button
May 24, 2023
Yunxiang Li, Meixu Chen, Wenxuan Yang, Kai Wang, Jun Ma, Alan C. Bovik, You Zhang

Figure 1 for SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation
Figure 2 for SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation
Figure 3 for SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation
Figure 4 for SAMScore: A Semantic Structural Similarity Metric for Image Translation Evaluation
Viaarxiv icon

On the Interplay of Convolutional Padding and Adversarial Robustness

Aug 12, 2023
Paul Gavrikov, Janis Keuper

Figure 1 for On the Interplay of Convolutional Padding and Adversarial Robustness
Figure 2 for On the Interplay of Convolutional Padding and Adversarial Robustness
Figure 3 for On the Interplay of Convolutional Padding and Adversarial Robustness
Figure 4 for On the Interplay of Convolutional Padding and Adversarial Robustness
Viaarxiv icon

An Empirical Study of CLIP for Text-based Person Search

Add code
Bookmark button
Alert button
Aug 19, 2023
Min Cao, Yang Bai, Ziyin Zeng, Mang Ye, Min Zhang

Figure 1 for An Empirical Study of CLIP for Text-based Person Search
Figure 2 for An Empirical Study of CLIP for Text-based Person Search
Figure 3 for An Empirical Study of CLIP for Text-based Person Search
Figure 4 for An Empirical Study of CLIP for Text-based Person Search
Viaarxiv icon

Dual Branch Deep Learning Network for Detection and Stage Grading of Diabetic Retinopathy

Aug 19, 2023
Hossein Shakibania, Sina Raoufi, Behnam Pourafkham, Hassan Khotanlou, Muharram Mansoorizadeh

Figure 1 for Dual Branch Deep Learning Network for Detection and Stage Grading of Diabetic Retinopathy
Figure 2 for Dual Branch Deep Learning Network for Detection and Stage Grading of Diabetic Retinopathy
Figure 3 for Dual Branch Deep Learning Network for Detection and Stage Grading of Diabetic Retinopathy
Figure 4 for Dual Branch Deep Learning Network for Detection and Stage Grading of Diabetic Retinopathy
Viaarxiv icon

ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval

Aug 19, 2023
Kaihang Pan, Juncheng Li, Hongye Song, Hao Fei, Wei Ji, Shuo Zhang, Jun Lin, Xiaozhong Liu, Siliang Tang

Figure 1 for ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval
Figure 2 for ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval
Figure 3 for ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval
Figure 4 for ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval
Viaarxiv icon

Numerical Uncertainty of Convolutional Neural Networks Inference for Structural Brain MRI Analysis

Add code
Bookmark button
Alert button
Aug 03, 2023
Inés Gonzalez Pepe, Vinuyan Sivakolunthu, Hae Lang Park, Yohan Chatelain, Tristan Glatard

Viaarxiv icon

A Differential Testing Framework to Evaluate Image Recognition Model Robustness

Add code
Bookmark button
Alert button
Jun 05, 2023
Nikolaos Louloudakis, Perry Gibson, José Cano, Ajitha Rajan

Figure 1 for A Differential Testing Framework to Evaluate Image Recognition Model Robustness
Figure 2 for A Differential Testing Framework to Evaluate Image Recognition Model Robustness
Figure 3 for A Differential Testing Framework to Evaluate Image Recognition Model Robustness
Figure 4 for A Differential Testing Framework to Evaluate Image Recognition Model Robustness
Viaarxiv icon