Alert button

"Image": models, code, and papers
Alert button

FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory

Add code
Bookmark button
Alert button
Aug 20, 2023
Anwesan Pal, Sahil Wadhwa, Ayush Jaiswal, Xu Zhang, Yue Wu, Rakesh Chada, Pradeep Natarajan, Henrik I. Christensen

Figure 1 for FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory
Figure 2 for FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory
Figure 3 for FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory
Figure 4 for FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory
Viaarxiv icon

T$^3$Bench: Benchmarking Current Progress in Text-to-3D Generation

Add code
Bookmark button
Alert button
Oct 04, 2023
Yuze He, Yushi Bai, Matthieu Lin, Wang Zhao, Yubin Hu, Jenny Sheng, Ran Yi, Juanzi Li, Yong-Jin Liu

Viaarxiv icon

Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation

Sep 30, 2023
Jingliang Deng, Zonghan Li

Figure 1 for Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation
Figure 2 for Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation
Figure 3 for Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation
Figure 4 for Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation
Viaarxiv icon

Improving Medical Image Classification in Noisy Labels Using Only Self-supervised Pretraining

Add code
Bookmark button
Alert button
Aug 08, 2023
Bidur Khanal, Binod Bhattarai, Bishesh Khanal, Cristian A. Linte

Figure 1 for Improving Medical Image Classification in Noisy Labels Using Only Self-supervised Pretraining
Figure 2 for Improving Medical Image Classification in Noisy Labels Using Only Self-supervised Pretraining
Figure 3 for Improving Medical Image Classification in Noisy Labels Using Only Self-supervised Pretraining
Figure 4 for Improving Medical Image Classification in Noisy Labels Using Only Self-supervised Pretraining
Viaarxiv icon

PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions

Add code
Bookmark button
Alert button
Aug 09, 2023
John Joon Young Chung, Eytan Adar

Figure 1 for PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
Figure 2 for PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
Figure 3 for PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
Figure 4 for PromptPaint: Steering Text-to-Image Generation Through Paint Medium-like Interactions
Viaarxiv icon

YODA: You Only Diffuse Areas. An Area-Masked Diffusion Approach For Image Super-Resolution

Add code
Bookmark button
Alert button
Aug 15, 2023
Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

Figure 1 for YODA: You Only Diffuse Areas. An Area-Masked Diffusion Approach For Image Super-Resolution
Figure 2 for YODA: You Only Diffuse Areas. An Area-Masked Diffusion Approach For Image Super-Resolution
Figure 3 for YODA: You Only Diffuse Areas. An Area-Masked Diffusion Approach For Image Super-Resolution
Figure 4 for YODA: You Only Diffuse Areas. An Area-Masked Diffusion Approach For Image Super-Resolution
Viaarxiv icon

Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation

Aug 11, 2023
Yuki Endo

Figure 1 for Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Figure 2 for Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Figure 3 for Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Figure 4 for Masked-Attention Diffusion Guidance for Spatially Controlling Text-to-Image Generation
Viaarxiv icon

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

Add code
Bookmark button
Alert button
Sep 22, 2023
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy

Figure 1 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Figure 2 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Figure 3 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Figure 4 for MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
Viaarxiv icon

Zero-Shot Object Counting with Language-Vision Models

Sep 22, 2023
Jingyi Xu, Hieu Le, Dimitris Samaras

Figure 1 for Zero-Shot Object Counting with Language-Vision Models
Figure 2 for Zero-Shot Object Counting with Language-Vision Models
Figure 3 for Zero-Shot Object Counting with Language-Vision Models
Figure 4 for Zero-Shot Object Counting with Language-Vision Models
Viaarxiv icon

Uncertainty-Aware Multi-View Visual Semantic Embedding

Sep 15, 2023
Wenzhang Wei, Zhipeng Gui, Changguang Wu, Anqi Zhao, Xingguang Wang, Huayi Wu

Viaarxiv icon