Alert button

"Image": models, code, and papers
Alert button

Improved EATFormer: A Vision Transformer for Medical Image Classification

Mar 19, 2024
Yulong Shisu, Susano Mingwin, Yongshuai Wanwag, Zengqiang Chenso, Sunshin Huing

Figure 1 for Improved EATFormer: A Vision Transformer for Medical Image Classification
Figure 2 for Improved EATFormer: A Vision Transformer for Medical Image Classification
Figure 3 for Improved EATFormer: A Vision Transformer for Medical Image Classification
Figure 4 for Improved EATFormer: A Vision Transformer for Medical Image Classification
Viaarxiv icon

Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts

Add code
Bookmark button
Alert button
Mar 17, 2024
Michael Saxon, Yiran Luo, Sharon Levy, Chitta Baral, Yezhou Yang, William Yang Wang

Figure 1 for Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts
Figure 2 for Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts
Figure 3 for Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts
Figure 4 for Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts
Viaarxiv icon

Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models

Mar 26, 2024
Minchan Kim, Minyeong Kim, Junik Bae, Suhwan Choi, Sungkyung Kim, Buru Chang

Viaarxiv icon

Super-Resolution of SOHO/MDI Magnetograms of Solar Active Regions Using SDO/HMI Data and an Attention-Aided Convolutional Neural Network

Mar 27, 2024
Chunhui Xu, Jason T. L. Wang, Haimin Wang, Haodi Jiang, Qin Li, Yasser Abduallah, Yan Xu

Viaarxiv icon

Towards Realistic Scene Generation with LiDAR Diffusion Models

Add code
Bookmark button
Alert button
Mar 31, 2024
Haoxi Ran, Vitor Guizilini, Yue Wang

Viaarxiv icon

Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation

Mar 13, 2024
Tianyi Chu, Wei Xing, Jiafu Chen, Zhizhong Wang, Jiakai Sun, Lei Zhao, Haibo Chen, Huaizhong Lin

Figure 1 for Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation
Figure 2 for Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation
Figure 3 for Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation
Figure 4 for Attack Deterministic Conditional Image Generative Models for Diverse and Controllable Generation
Viaarxiv icon

ViTAR: Vision Transformer with Any Resolution

Mar 28, 2024
Qihang Fan, Quanzeng You, Xiaotian Han, Yongfei Liu, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Viaarxiv icon

Plug-and-Play Grounding of Reasoning in Multimodal Large Language Models

Add code
Bookmark button
Alert button
Mar 28, 2024
Jiaxing Chen, Yuxuan Liu, Dehu Li, Xiang An, Ziyong Feng, Yongle Zhao, Yin Xie

Viaarxiv icon

Shortcut Learning in Medical Image Segmentation

Mar 11, 2024
Manxi Lin, Nina Weng, Kamil Mikolaj, Zahra Bashir, Morten Bo Søndergaard Svendsen, Martin Tolsgaard, Anders Nymark Christensen, Aasa Feragen

Figure 1 for Shortcut Learning in Medical Image Segmentation
Figure 2 for Shortcut Learning in Medical Image Segmentation
Figure 3 for Shortcut Learning in Medical Image Segmentation
Figure 4 for Shortcut Learning in Medical Image Segmentation
Viaarxiv icon

Active Generation for Image Classification

Add code
Bookmark button
Alert button
Mar 11, 2024
Tao Huang, Jiaqi Liu, Shan You, Chang Xu

Figure 1 for Active Generation for Image Classification
Figure 2 for Active Generation for Image Classification
Figure 3 for Active Generation for Image Classification
Figure 4 for Active Generation for Image Classification
Viaarxiv icon