Alert button

"Image": models, code, and papers
Alert button

Relational Concept Based Models

Aug 23, 2023
Pietro Barbiero, Francesco Giannini, Gabriele Ciravegna, Michelangelo Diligenti, Giuseppe Marra

Viaarxiv icon

InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing

Aug 10, 2023
Anant Khandelwal

Figure 1 for InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing
Figure 2 for InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing
Figure 3 for InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing
Figure 4 for InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing
Viaarxiv icon

Story Visualization by Online Text Augmentation with Context Memory

Add code
Bookmark button
Alert button
Aug 15, 2023
Daechul Ahn, Daneul Kim, Gwangmo Song, Seung Hwan Kim, Honglak Lee, Dongyeop Kang, Jonghyun Choi

Figure 1 for Story Visualization by Online Text Augmentation with Context Memory
Figure 2 for Story Visualization by Online Text Augmentation with Context Memory
Figure 3 for Story Visualization by Online Text Augmentation with Context Memory
Figure 4 for Story Visualization by Online Text Augmentation with Context Memory
Viaarxiv icon

GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement

Aug 18, 2023
Xiaohan Zhang, Xingyu Li, Waqas Sultani, Chen Chen, Safwan Wshah

Figure 1 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Figure 2 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Figure 3 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Figure 4 for GeoDTR+: Toward generic cross-view geolocalization via geometric disentanglement
Viaarxiv icon

Denoising Diffusion for 3D Hand Pose Estimation from Images

Aug 18, 2023
Maksym Ivashechkin, Oscar Mendez, Richard Bowden

Figure 1 for Denoising Diffusion for 3D Hand Pose Estimation from Images
Figure 2 for Denoising Diffusion for 3D Hand Pose Estimation from Images
Figure 3 for Denoising Diffusion for 3D Hand Pose Estimation from Images
Figure 4 for Denoising Diffusion for 3D Hand Pose Estimation from Images
Viaarxiv icon

Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations

Aug 16, 2023
Yuewei Yang, Hai Li, Yiran Chen

Figure 1 for Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations
Figure 2 for Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations
Figure 3 for Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations
Figure 4 for Stable and Causal Inference for Discriminative Self-supervised Deep Visual Representations
Viaarxiv icon

Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection

Add code
Bookmark button
Alert button
Aug 16, 2023
Rui Cao, Ming Shan Hee, Adriel Kuek, Wen-Haw Chong, Roy Ka-Wei Lee, Jing Jiang

Figure 1 for Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Figure 2 for Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Figure 3 for Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Figure 4 for Pro-Cap: Leveraging a Frozen Vision-Language Model for Hateful Meme Detection
Viaarxiv icon

Text-Only Training for Visual Storytelling

Aug 17, 2023
Yuechen Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li

Figure 1 for Text-Only Training for Visual Storytelling
Figure 2 for Text-Only Training for Visual Storytelling
Figure 3 for Text-Only Training for Visual Storytelling
Figure 4 for Text-Only Training for Visual Storytelling
Viaarxiv icon

Introducing Language Guidance in Prompt-based Continual Learning

Aug 30, 2023
Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Didier Stricker, Federico Tombari, Muhammad Zeshan Afzal

Figure 1 for Introducing Language Guidance in Prompt-based Continual Learning
Figure 2 for Introducing Language Guidance in Prompt-based Continual Learning
Figure 3 for Introducing Language Guidance in Prompt-based Continual Learning
Figure 4 for Introducing Language Guidance in Prompt-based Continual Learning
Viaarxiv icon

Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering

Jun 08, 2023
Param Ahir, Dr. Hiteishi Diwanji

Figure 1 for Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering
Figure 2 for Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering
Figure 3 for Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering
Figure 4 for Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering
Viaarxiv icon