Alert button

"Image": models, code, and papers
Alert button

Ensemble of Anchor-Free Models for Robust Bangla Document Layout Segmentation

Aug 29, 2023
U Mong Sain Chak, Md. Asib Rahman

Figure 1 for Ensemble of Anchor-Free Models for Robust Bangla Document Layout Segmentation
Figure 2 for Ensemble of Anchor-Free Models for Robust Bangla Document Layout Segmentation
Figure 3 for Ensemble of Anchor-Free Models for Robust Bangla Document Layout Segmentation
Figure 4 for Ensemble of Anchor-Free Models for Robust Bangla Document Layout Segmentation
Viaarxiv icon

Is attention all you need in medical image analysis? A review

Jul 24, 2023
Giorgos Papanastasiou, Nikolaos Dikaios, Jiahao Huang, Chengjia Wang, Guang Yang

Figure 1 for Is attention all you need in medical image analysis? A review
Figure 2 for Is attention all you need in medical image analysis? A review
Figure 3 for Is attention all you need in medical image analysis? A review
Figure 4 for Is attention all you need in medical image analysis? A review
Viaarxiv icon

Catching Image Retrieval Generalization

Jun 23, 2023
Maksim Zhdanov, Ivan Karpukhin

Figure 1 for Catching Image Retrieval Generalization
Figure 2 for Catching Image Retrieval Generalization
Figure 3 for Catching Image Retrieval Generalization
Figure 4 for Catching Image Retrieval Generalization
Viaarxiv icon

Can Prompt Learning Benefit Radiology Report Generation?

Aug 30, 2023
Jun Wang, Lixing Zhu, Abhir Bhalerao, Yulan He

Figure 1 for Can Prompt Learning Benefit Radiology Report Generation?
Figure 2 for Can Prompt Learning Benefit Radiology Report Generation?
Figure 3 for Can Prompt Learning Benefit Radiology Report Generation?
Figure 4 for Can Prompt Learning Benefit Radiology Report Generation?
Viaarxiv icon

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Jul 03, 2023
Zibo Zhao, Wen Liu, Xin Chen, Xianfang Zeng, Rui Wang, Pei Cheng, Bin Fu, Tao Chen, Gang Yu, Shenghua Gao

Figure 1 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 2 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 3 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Figure 4 for Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
Viaarxiv icon

ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration

Jun 23, 2023
Jiaqi Ma, Tianheng Cheng, Guoli Wang, Qian Zhang, Xinggang Wang, Lefei Zhang

Figure 1 for ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Figure 2 for ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Figure 3 for ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Figure 4 for ProRes: Exploring Degradation-aware Visual Prompt for Universal Image Restoration
Viaarxiv icon

Cross-modality Attention-based Multimodal Fusion for Non-small Cell Lung Cancer (NSCLC) Patient Survival Prediction

Aug 18, 2023
Ruining Deng, Nazim Shaikh, Gareth Shannon, Yao Nie

Figure 1 for Cross-modality Attention-based Multimodal Fusion for Non-small Cell Lung Cancer (NSCLC) Patient Survival Prediction
Figure 2 for Cross-modality Attention-based Multimodal Fusion for Non-small Cell Lung Cancer (NSCLC) Patient Survival Prediction
Figure 3 for Cross-modality Attention-based Multimodal Fusion for Non-small Cell Lung Cancer (NSCLC) Patient Survival Prediction
Viaarxiv icon

MILA: Memory-Based Instance-Level Adaptation for Cross-Domain Object Detection

Sep 03, 2023
Onkar Krishna, Hiroki Ohashi, Saptarshi Sinha

Figure 1 for MILA: Memory-Based Instance-Level Adaptation for Cross-Domain Object Detection
Figure 2 for MILA: Memory-Based Instance-Level Adaptation for Cross-Domain Object Detection
Figure 3 for MILA: Memory-Based Instance-Level Adaptation for Cross-Domain Object Detection
Figure 4 for MILA: Memory-Based Instance-Level Adaptation for Cross-Domain Object Detection
Viaarxiv icon

M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce

Aug 22, 2023
Tao Chen, Ze Lin, Hui Li, Jiayi Ji, Yiyi Zhou, Guanbin Li, Rongrong Ji

Figure 1 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 2 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 3 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Figure 4 for M3PS: End-to-End Multi-Grained Multi-Modal Attribute-Aware Product Summarization in E-commerce
Viaarxiv icon

Counting Guidance for High Fidelity Text-to-Image Synthesis

Jun 30, 2023
Wonjun Kang, Kevin Galim, Hyung Il Koo

Figure 1 for Counting Guidance for High Fidelity Text-to-Image Synthesis
Figure 2 for Counting Guidance for High Fidelity Text-to-Image Synthesis
Figure 3 for Counting Guidance for High Fidelity Text-to-Image Synthesis
Figure 4 for Counting Guidance for High Fidelity Text-to-Image Synthesis
Viaarxiv icon