Alert button

"Image": models, code, and papers
Alert button

Kosmos-2.5: A Multimodal Literate Model

Add code
Bookmark button
Alert button
Sep 20, 2023
Tengchao Lv, Yupan Huang, Jingye Chen, Lei Cui, Shuming Ma, Yaoyao Chang, Shaohan Huang, Wenhui Wang, Li Dong, Weiyao Luo, Shaoxiang Wu, Guoxin Wang, Cha Zhang, Furu Wei

Figure 1 for Kosmos-2.5: A Multimodal Literate Model
Figure 2 for Kosmos-2.5: A Multimodal Literate Model
Figure 3 for Kosmos-2.5: A Multimodal Literate Model
Figure 4 for Kosmos-2.5: A Multimodal Literate Model
Viaarxiv icon

Face Aging via Diffusion-based Editing

Add code
Bookmark button
Alert button
Sep 20, 2023
Xiangyi Chen, Stéphane Lathuilière

Figure 1 for Face Aging via Diffusion-based Editing
Figure 2 for Face Aging via Diffusion-based Editing
Figure 3 for Face Aging via Diffusion-based Editing
Figure 4 for Face Aging via Diffusion-based Editing
Viaarxiv icon

MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion

Add code
Bookmark button
Alert button
Aug 08, 2023
Yizhuo Lu, Changde Du, Qiongyi zhou, Dianpeng Wang, Huiguang He

Figure 1 for MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Figure 2 for MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Figure 3 for MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Figure 4 for MindDiffuser: Controlled Image Reconstruction from Human Brain Activity with Semantic and Structural Diffusion
Viaarxiv icon

Global-correlated 3D-decoupling Transformer for Clothed Avatar Reconstruction

Add code
Bookmark button
Alert button
Sep 26, 2023
Zechuan Zhang, Li Sun, Zongxin Yang, Ling Chen, Yi Yang

Viaarxiv icon

VPA: Fully Test-Time Visual Prompt Adaptation

Sep 26, 2023
Jiachen Sun, Mark Ibrahim, Melissa Hall, Ivan Evtimov, Z. Morley Mao, Cristian Canton Ferrer, Caner Hazirbas

Figure 1 for VPA: Fully Test-Time Visual Prompt Adaptation
Figure 2 for VPA: Fully Test-Time Visual Prompt Adaptation
Figure 3 for VPA: Fully Test-Time Visual Prompt Adaptation
Figure 4 for VPA: Fully Test-Time Visual Prompt Adaptation
Viaarxiv icon

CTP-Net: Character Texture Perception Network for Document Image Forgery Localization

Aug 04, 2023
Xin Liao, Siliang Chen, Jiaxin Chen, Tianyi Wang, Xiehua Li

Figure 1 for CTP-Net: Character Texture Perception Network for Document Image Forgery Localization
Figure 2 for CTP-Net: Character Texture Perception Network for Document Image Forgery Localization
Figure 3 for CTP-Net: Character Texture Perception Network for Document Image Forgery Localization
Figure 4 for CTP-Net: Character Texture Perception Network for Document Image Forgery Localization
Viaarxiv icon

How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?

Add code
Bookmark button
Alert button
Aug 17, 2023
Gregory Holste, Ziyu Jiang, Ajay Jaiswal, Maria Hanna, Shlomo Minkowitz, Alan C. Legasto, Joanna G. Escalon, Sharon Steinberger, Mark Bittman, Thomas C. Shen, Ying Ding, Ronald M. Summers, George Shih, Yifan Peng, Zhangyang Wang

Figure 1 for How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?
Figure 2 for How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?
Figure 3 for How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?
Figure 4 for How Does Pruning Impact Long-Tailed Multi-Label Medical Image Classifiers?
Viaarxiv icon

Text-guided Foundation Model Adaptation for Pathological Image Classification

Add code
Bookmark button
Alert button
Jul 27, 2023
Yunkun Zhang, Jin Gao, Mu Zhou, Xiaosong Wang, Yu Qiao, Shaoting Zhang, Dequan Wang

Figure 1 for Text-guided Foundation Model Adaptation for Pathological Image Classification
Figure 2 for Text-guided Foundation Model Adaptation for Pathological Image Classification
Figure 3 for Text-guided Foundation Model Adaptation for Pathological Image Classification
Figure 4 for Text-guided Foundation Model Adaptation for Pathological Image Classification
Viaarxiv icon

AG-CRC: Anatomy-Guided Colorectal Cancer Segmentation in CT with Imperfect Anatomical Knowledge

Add code
Bookmark button
Alert button
Oct 07, 2023
Rongzhao Zhang, Zhian Bai, Ruoying Yu, Wenrao Pang, Lingyun Wang, Lifeng Zhu, Xiaofan Zhang, Huan Zhang, Weiguo Hu

Viaarxiv icon

Activate and Reject: Towards Safe Domain Generalization under Category Shift

Oct 07, 2023
Chaoqi Chen, Luyao Tang, Leitian Tao, Hong-Yu Zhou, Yue Huang, Xiaoguang Han, Yizhou Yu

Figure 1 for Activate and Reject: Towards Safe Domain Generalization under Category Shift
Figure 2 for Activate and Reject: Towards Safe Domain Generalization under Category Shift
Figure 3 for Activate and Reject: Towards Safe Domain Generalization under Category Shift
Figure 4 for Activate and Reject: Towards Safe Domain Generalization under Category Shift
Viaarxiv icon