Alert button

"Image": models, code, and papers
Alert button

Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models

Add code
Bookmark button
Alert button
Jun 03, 2023
Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Figure 1 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Figure 2 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Figure 3 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Figure 4 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Viaarxiv icon

Exploring the Grounding Issues in Image Caption

May 24, 2023
Pin-Er Chen, Hsin-Yu Chou, Po-Ya Angela Wang, Yu-Hsiang Tseng, Shu-Kai Hsieh

Figure 1 for Exploring the Grounding Issues in Image Caption
Figure 2 for Exploring the Grounding Issues in Image Caption
Figure 3 for Exploring the Grounding Issues in Image Caption
Figure 4 for Exploring the Grounding Issues in Image Caption
Viaarxiv icon

Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs

Jun 18, 2023
Amani Almalki, Longin Jan Latecki

Figure 1 for Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs
Figure 2 for Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs
Figure 3 for Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs
Figure 4 for Enhanced Masked Image Modeling for Analysis of Dental Panoramic Radiographs
Viaarxiv icon

Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression

Jun 25, 2023
A. Burakhan Koyuncu, Panqi Jia, Atanas Boev, Elena Alshina, Eckehard Steinbach

Figure 1 for Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression
Figure 2 for Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression
Figure 3 for Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression
Figure 4 for Efficient Contextformer: Spatio-Channel Window Attention for Fast Context Modeling in Learned Image Compression
Viaarxiv icon

AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization

Add code
Bookmark button
Alert button
Jul 28, 2023
Chuyan Zhang, Yuncheng Yang, Hao Zheng, Yun Gu

Figure 1 for AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization
Figure 2 for AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization
Figure 3 for AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization
Figure 4 for AC-Norm: Effective Tuning for Medical Image Analysis via Affine Collaborative Normalization
Viaarxiv icon

Generalizable Synthetic Image Detection via Language-guided Contrastive Learning

Add code
Bookmark button
Alert button
May 23, 2023
Haiwei Wu, Jiantao Zhou, Shile Zhang

Figure 1 for Generalizable Synthetic Image Detection via Language-guided Contrastive Learning
Figure 2 for Generalizable Synthetic Image Detection via Language-guided Contrastive Learning
Figure 3 for Generalizable Synthetic Image Detection via Language-guided Contrastive Learning
Figure 4 for Generalizable Synthetic Image Detection via Language-guided Contrastive Learning
Viaarxiv icon

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

Add code
Bookmark button
Alert button
May 31, 2023
Chi Zhang, Yiwen Chen, Yijun Fu, Zhenglin Zhou, Gang YU, Billzb Wang, Bin Fu, Tao Chen, Guosheng Lin, Chunhua Shen

Figure 1 for StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Figure 2 for StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Figure 3 for StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Figure 4 for StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation
Viaarxiv icon

Can Self-Supervised Representation Learning Methods Withstand Distribution Shifts and Corruptions?

Add code
Bookmark button
Alert button
Jul 31, 2023
Prakash Chandra Chhipa, Johan Rodahl Holmgren, Kanjar De, Rajkumar Saini, Marcus Liwicki

Figure 1 for Can Self-Supervised Representation Learning Methods Withstand Distribution Shifts and Corruptions?
Figure 2 for Can Self-Supervised Representation Learning Methods Withstand Distribution Shifts and Corruptions?
Figure 3 for Can Self-Supervised Representation Learning Methods Withstand Distribution Shifts and Corruptions?
Figure 4 for Can Self-Supervised Representation Learning Methods Withstand Distribution Shifts and Corruptions?
Viaarxiv icon

A Study on Quantifying Sim2Real Image Gap in Autonomous Driving Simulations Using Lane Segmentation Attention Map Similarity

Jun 18, 2023
Seongjeong Park, Jinu Pahk, Lennart Lorenz Freimuth Jahn, Yongseob Lim, Jinung An, Gyeungho Choi

Figure 1 for A Study on Quantifying Sim2Real Image Gap in Autonomous Driving Simulations Using Lane Segmentation Attention Map Similarity
Figure 2 for A Study on Quantifying Sim2Real Image Gap in Autonomous Driving Simulations Using Lane Segmentation Attention Map Similarity
Figure 3 for A Study on Quantifying Sim2Real Image Gap in Autonomous Driving Simulations Using Lane Segmentation Attention Map Similarity
Figure 4 for A Study on Quantifying Sim2Real Image Gap in Autonomous Driving Simulations Using Lane Segmentation Attention Map Similarity
Viaarxiv icon

PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators

Add code
Bookmark button
Alert button
Jun 15, 2023
Runmin Cong, Wenyu Yang, Wei Zhang, Chongyi Li, Chun-Le Guo, Qingming Huang, Sam Kwong

Figure 1 for PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators
Figure 2 for PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators
Figure 3 for PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators
Figure 4 for PUGAN: Physical Model-Guided Underwater Image Enhancement Using GAN with Dual-Discriminators
Viaarxiv icon