Alert button

"Image": models, code, and papers
Alert button

Recent Advances in Scene Image Representation and Classification

Jun 15, 2022
Chiranjibi Sitaula, Tej Bahadur Shahi, Faezeh Marzbanrad

Figure 1 for Recent Advances in Scene Image Representation and Classification
Figure 2 for Recent Advances in Scene Image Representation and Classification
Figure 3 for Recent Advances in Scene Image Representation and Classification
Figure 4 for Recent Advances in Scene Image Representation and Classification
Viaarxiv icon

Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology

Add code
Bookmark button
Alert button
Jun 30, 2022
Marin Scalbert, Maria Vakalopoulou, Florent Couzinié-Devy

Figure 1 for Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology
Figure 2 for Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology
Figure 3 for Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology
Figure 4 for Test-time image-to-image translation ensembling improves out-of-distribution generalization in histopathology
Viaarxiv icon

SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization

Add code
Bookmark button
Alert button
Sep 05, 2022
Asish Bera, Zachary Wharton, Yonghuai Liu, Nik Bessis, Ardhendu Behera

Figure 1 for SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization
Figure 2 for SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization
Figure 3 for SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization
Figure 4 for SR-GNN: Spatial Relation-aware Graph Neural Network for Fine-Grained Image Categorization
Viaarxiv icon

Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval

Jul 29, 2022
Hao Wang, Guosheng Lin, Steven C. H. Hoi, Chunyan Miao

Figure 1 for Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval
Figure 2 for Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval
Figure 3 for Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval
Figure 4 for Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval
Viaarxiv icon

Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation

Apr 23, 2022
Yupei Lin, Sen Zhang, Tianshui Chen, Yongyi Lu, Guangping Li, Yukai Shi

Figure 1 for Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation
Figure 2 for Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation
Figure 3 for Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation
Figure 4 for Exploring Negatives in Contrastive Learning for Unpaired Image-to-Image Translation
Viaarxiv icon

Domain Generalization via Ensemble Stacking for Face Presentation Attack Detection

Jan 05, 2023
Usman Muhammad, Djamila Romaissa Beddiar, Mourad Oussalah

Figure 1 for Domain Generalization via Ensemble Stacking for Face Presentation Attack Detection
Figure 2 for Domain Generalization via Ensemble Stacking for Face Presentation Attack Detection
Figure 3 for Domain Generalization via Ensemble Stacking for Face Presentation Attack Detection
Figure 4 for Domain Generalization via Ensemble Stacking for Face Presentation Attack Detection
Viaarxiv icon

CLIP4IDC: CLIP for Image Difference Captioning

Add code
Bookmark button
Alert button
Jun 01, 2022
Zixin Guo, Tzu-Jui Julius Wang, Jorma Laaksonen

Figure 1 for CLIP4IDC: CLIP for Image Difference Captioning
Figure 2 for CLIP4IDC: CLIP for Image Difference Captioning
Figure 3 for CLIP4IDC: CLIP for Image Difference Captioning
Figure 4 for CLIP4IDC: CLIP for Image Difference Captioning
Viaarxiv icon

ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding

Add code
Bookmark button
Alert button
Aug 05, 2022
Bingning Wang, Feiyang Lv, Ting Yao, Yiming Yuan, Jin Ma, Yu Luo, Haijin Liang

Figure 1 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Figure 2 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Figure 3 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Figure 4 for ChiQA: A Large Scale Image-based Real-World Question Answering Dataset for Multi-Modal Understanding
Viaarxiv icon

Learning Transformations To Reduce the Geometric Shift in Object Detection

Jan 13, 2023
Vidit Vidit, Martin Engilberge, Mathieu Salzmann

Figure 1 for Learning Transformations To Reduce the Geometric Shift in Object Detection
Figure 2 for Learning Transformations To Reduce the Geometric Shift in Object Detection
Figure 3 for Learning Transformations To Reduce the Geometric Shift in Object Detection
Figure 4 for Learning Transformations To Reduce the Geometric Shift in Object Detection
Viaarxiv icon

Detecting Objects with Graph Priors and Graph Refinement

Dec 23, 2022
Aritra Bhowmik, Martin R. Oswald, Yu Wang, Nora Baka, Cees G. M. Snoek

Figure 1 for Detecting Objects with Graph Priors and Graph Refinement
Figure 2 for Detecting Objects with Graph Priors and Graph Refinement
Figure 3 for Detecting Objects with Graph Priors and Graph Refinement
Figure 4 for Detecting Objects with Graph Priors and Graph Refinement
Viaarxiv icon