Alert button

"Image": models, code, and papers
Alert button

Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance

May 09, 2023
Xin Shen, Xiaonan Zhao, Rui Luo

Figure 1 for Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance
Figure 2 for Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance
Figure 3 for Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance
Figure 4 for Semantic Embedded Deep Neural Network: A Generic Approach to Boost Multi-Label Image Classification Performance
Viaarxiv icon

Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization

Jul 02, 2023
Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva

Figure 1 for Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
Figure 2 for Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
Figure 3 for Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
Figure 4 for Intra- & Extra-Source Exemplar-Based Style Synthesis for Improved Domain Generalization
Viaarxiv icon

UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks

Jun 07, 2023
Yanan Sun, Zihan Zhong, Qi Fan, Chi-Keung Tang, Yu-Wing Tai

Figure 1 for UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Figure 2 for UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Figure 3 for UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Figure 4 for UniBoost: Unsupervised Unimodal Pre-training for Boosting Zero-shot Vision-Language Tasks
Viaarxiv icon

A Semi-supervised Object Detection Algorithm for Underwater Imagery

Jun 07, 2023
Suraj Bijjahalli, Oscar Pizarro, Stefan B. Williams

Figure 1 for A Semi-supervised Object Detection Algorithm for Underwater Imagery
Figure 2 for A Semi-supervised Object Detection Algorithm for Underwater Imagery
Figure 3 for A Semi-supervised Object Detection Algorithm for Underwater Imagery
Figure 4 for A Semi-supervised Object Detection Algorithm for Underwater Imagery
Viaarxiv icon

Modality-Invariant Representation for Infrared and Visible Image Registration

Apr 12, 2023
Zhiying Jiang, Zengxi Zhang, Jinyuan Liu, Xin Fan, Risheng Liu

Figure 1 for Modality-Invariant Representation for Infrared and Visible Image Registration
Figure 2 for Modality-Invariant Representation for Infrared and Visible Image Registration
Figure 3 for Modality-Invariant Representation for Infrared and Visible Image Registration
Figure 4 for Modality-Invariant Representation for Infrared and Visible Image Registration
Viaarxiv icon

ASIC: Aligning Sparse in-the-wild Image Collections

Mar 28, 2023
Kamal Gupta, Varun Jampani, Carlos Esteves, Abhinav Shrivastava, Ameesh Makadia, Noah Snavely, Abhishek Kar

Figure 1 for ASIC: Aligning Sparse in-the-wild Image Collections
Figure 2 for ASIC: Aligning Sparse in-the-wild Image Collections
Figure 3 for ASIC: Aligning Sparse in-the-wild Image Collections
Figure 4 for ASIC: Aligning Sparse in-the-wild Image Collections
Viaarxiv icon

Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering

Jun 16, 2023
Rabiul Awal, Le Zhang, Aishwarya Agrawal

Figure 1 for Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Figure 2 for Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Figure 3 for Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Figure 4 for Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering
Viaarxiv icon

VisText: A Benchmark for Semantically Rich Chart Captioning

Jun 28, 2023
Benny J. Tang, Angie Boggust, Arvind Satyanarayan

Figure 1 for VisText: A Benchmark for Semantically Rich Chart Captioning
Figure 2 for VisText: A Benchmark for Semantically Rich Chart Captioning
Figure 3 for VisText: A Benchmark for Semantically Rich Chart Captioning
Figure 4 for VisText: A Benchmark for Semantically Rich Chart Captioning
Viaarxiv icon

Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks

Jun 28, 2023
Leyla Benhamida, Slimane Larabi

Figure 1 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Figure 2 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Figure 3 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Figure 4 for Theater Aid System for the Visually Impaired Through Transfer Learning of Spatio-Temporal Graph Convolution Networks
Viaarxiv icon

MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications

Jul 01, 2023
Mustafa Munir, William Avery, Radu Marculescu

Figure 1 for MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications
Figure 2 for MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications
Figure 3 for MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications
Figure 4 for MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications
Viaarxiv icon