Alert button

"Image": models, code, and papers
Alert button

From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations

May 21, 2023
Toni Albert, Bjoern Eskofier, Dario Zanca

Figure 1 for From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations
Figure 2 for From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations
Figure 3 for From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations
Figure 4 for From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations
Viaarxiv icon

Controlled and Conditional Text to Image Generation with Diffusion Prior

Add code
Bookmark button
Alert button
Feb 23, 2023
Pranav Aggarwal, Hareesh Ravi, Naveen Marri, Sachin Kelkar, Fengbin Chen, Vinh Khuc, Midhun Harikumar, Ritiz Tambi, Sudharshan Reddy Kakumanu, Purvak Lapsiya, Alvin Ghouas, Sarah Saber, Malavika Ramprasad, Baldo Faieta, Ajinkya Kale

Figure 1 for Controlled and Conditional Text to Image Generation with Diffusion Prior
Figure 2 for Controlled and Conditional Text to Image Generation with Diffusion Prior
Figure 3 for Controlled and Conditional Text to Image Generation with Diffusion Prior
Figure 4 for Controlled and Conditional Text to Image Generation with Diffusion Prior
Viaarxiv icon

Learning to Scale Temperature in Masked Self-Attention for Image Inpainting

Add code
Bookmark button
Alert button
Feb 13, 2023
Xiang Zhou, Yuan Zeng, Yi Gong

Figure 1 for Learning to Scale Temperature in Masked Self-Attention for Image Inpainting
Figure 2 for Learning to Scale Temperature in Masked Self-Attention for Image Inpainting
Figure 3 for Learning to Scale Temperature in Masked Self-Attention for Image Inpainting
Figure 4 for Learning to Scale Temperature in Masked Self-Attention for Image Inpainting
Viaarxiv icon

High-order Spatial Interactions Enhanced Lightweight Model for Optical Remote Sensing Image-based Small Ship Detection

Apr 07, 2023
Yifan Yin, Xu Cheng, Fan Shi, Xiufeng Liu, Huan Huo, Shengyong Chen

Figure 1 for High-order Spatial Interactions Enhanced Lightweight Model for Optical Remote Sensing Image-based Small Ship Detection
Figure 2 for High-order Spatial Interactions Enhanced Lightweight Model for Optical Remote Sensing Image-based Small Ship Detection
Figure 3 for High-order Spatial Interactions Enhanced Lightweight Model for Optical Remote Sensing Image-based Small Ship Detection
Figure 4 for High-order Spatial Interactions Enhanced Lightweight Model for Optical Remote Sensing Image-based Small Ship Detection
Viaarxiv icon

Multi-source adversarial transfer learning based on similar source domains with local features

Add code
Bookmark button
Alert button
May 30, 2023
Yifu Zhang, Hongru Li, Shimeng Shi, Youqi Li, Jiansong Zhang

Figure 1 for Multi-source adversarial transfer learning based on similar source domains with local features
Figure 2 for Multi-source adversarial transfer learning based on similar source domains with local features
Figure 3 for Multi-source adversarial transfer learning based on similar source domains with local features
Figure 4 for Multi-source adversarial transfer learning based on similar source domains with local features
Viaarxiv icon

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

Add code
Bookmark button
Alert button
May 18, 2023
Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu

Figure 1 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 2 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 3 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 4 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Viaarxiv icon

OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding

Add code
Bookmark button
Alert button
May 18, 2023
Minghua Liu, Ruoxi Shi, Kaiming Kuang, Yinhao Zhu, Xuanlin Li, Shizhong Han, Hong Cai, Fatih Porikli, Hao Su

Figure 1 for OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding
Figure 2 for OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding
Figure 3 for OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding
Figure 4 for OpenShape: Scaling Up 3D Shape Representation Towards Open-World Understanding
Viaarxiv icon

BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models

Add code
Bookmark button
Alert button
Jan 30, 2023
Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi

Figure 1 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 2 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 3 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Figure 4 for BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Viaarxiv icon

Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste

Add code
Bookmark button
Alert button
May 03, 2023
Mengyun Shi, Serge Belongie, Claire Cardie

Figure 1 for Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste
Figure 2 for Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste
Figure 3 for Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste
Figure 4 for Fashionpedia-Taste: A Dataset towards Explaining Human Fashion Taste
Viaarxiv icon

Towards L-System Captioning for Tree Reconstruction

May 10, 2023
Jannes S. Magnusson, Anna Hilsmann, Peter Eisert

Figure 1 for Towards L-System Captioning for Tree Reconstruction
Figure 2 for Towards L-System Captioning for Tree Reconstruction
Figure 3 for Towards L-System Captioning for Tree Reconstruction
Figure 4 for Towards L-System Captioning for Tree Reconstruction
Viaarxiv icon