Alert button

"Image": models, code, and papers
Alert button

Counterfactual Image Generation for adversarially robust and interpretable Classifiers

Oct 01, 2023
Rafael Bischof, Florian Scheidegger, Michael A. Kraus, A. Cristiano I. Malossi

Viaarxiv icon

Machine learning refinement of in situ images acquired by low electron dose LC-TEM

Add code
Bookmark button
Alert button
Oct 31, 2023
Hiroyasu Katsuno, Yuki Kimura, Tomoya Yamazaki, Ichigaku Takigawa

Figure 1 for Machine learning refinement of in situ images acquired by low electron dose LC-TEM
Figure 2 for Machine learning refinement of in situ images acquired by low electron dose LC-TEM
Figure 3 for Machine learning refinement of in situ images acquired by low electron dose LC-TEM
Figure 4 for Machine learning refinement of in situ images acquired by low electron dose LC-TEM
Viaarxiv icon

Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models

Nov 17, 2023
Yimeng Li, Navid Rajabi, Sulabh Shrestha, Md Alimoor Reza, Jana Kosecka

Viaarxiv icon

Multi-entity Video Transformers for Fine-Grained Video Representation Learning

Add code
Bookmark button
Alert button
Nov 17, 2023
Matthew Walmer, Rose Kanjirathinkal, Kai Sheng Tai, Keyur Muzumdar, Taipeng Tian, Abhinav Shrivastava

Viaarxiv icon

On Manipulating Scene Text in the Wild with Diffusion Models

Nov 03, 2023
Joshua Santoso, Christian Simon, Williem Pao

Figure 1 for On Manipulating Scene Text in the Wild with Diffusion Models
Viaarxiv icon

GPT-4V(ision) as a Generalist Evaluator for Vision-Language Tasks

Nov 02, 2023
Xinlu Zhang, Yujie Lu, Weizhi Wang, An Yan, Jun Yan, Lianke Qin, Heng Wang, Xifeng Yan, William Yang Wang, Linda Ruth Petzold

Viaarxiv icon

Zero-shot audio captioning with audio-language model guidance and audio context keywords

Add code
Bookmark button
Alert button
Nov 14, 2023
Leonard Salewski, Stefan Fauth, A. Sophia Koepke, Zeynep Akata

Figure 1 for Zero-shot audio captioning with audio-language model guidance and audio context keywords
Figure 2 for Zero-shot audio captioning with audio-language model guidance and audio context keywords
Figure 3 for Zero-shot audio captioning with audio-language model guidance and audio context keywords
Viaarxiv icon

CogVLM: Visual Expert for Pretrained Language Models

Add code
Bookmark button
Alert button
Nov 06, 2023
Weihan Wang, Qingsong Lv, Wenmeng Yu, Wenyi Hong, Ji Qi, Yan Wang, Junhui Ji, Zhuoyi Yang, Lei Zhao, Xixuan Song, Jiazheng Xu, Bin Xu, Juanzi Li, Yuxiao Dong, Ming Ding, Jie Tang

Figure 1 for CogVLM: Visual Expert for Pretrained Language Models
Figure 2 for CogVLM: Visual Expert for Pretrained Language Models
Figure 3 for CogVLM: Visual Expert for Pretrained Language Models
Figure 4 for CogVLM: Visual Expert for Pretrained Language Models
Viaarxiv icon

VcT: Visual change Transformer for Remote Sensing Image Change Detection

Add code
Bookmark button
Alert button
Oct 17, 2023
Bo Jiang, Zitian Wang, Xixi Wang, Ziyan Zhang, Lan Chen, Xiao Wang, Bin Luo

Viaarxiv icon

A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization

Add code
Bookmark button
Alert button
Nov 07, 2023
Xingzhe He, Zhiwen Cao, Nicholas Kolkin, Lantao Yu, Helge Rhodin, Ratheesh Kalarot

Figure 1 for A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Figure 2 for A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Figure 3 for A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Figure 4 for A Data Perspective on Enhanced Identity Preservation for Diffusion Personalization
Viaarxiv icon