Alert button

"Image": models, code, and papers
Alert button

CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation

Sep 22, 2023
Xiaoheng Jiang, Kaiyi Guo, Yang Lu, Feng Yan, Hao Liu, Jiale Cao, Mingliang Xu, Dacheng Tao

Figure 1 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Figure 2 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Figure 3 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Figure 4 for CINFormer: Transformer network with multi-stage CNN feature injection for surface defect segmentation
Viaarxiv icon

RHINO: Regularizing the Hash-based Implicit Neural Representation

Sep 22, 2023
Hao Zhu, Fengyi Liu, Qi Zhang, Xun Cao, Zhan Ma

Viaarxiv icon

Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images

Sep 11, 2023
Qingping Zheng, Yuanfan Guo, Jiankang Deng, Jianhua Han, Ying Li, Songcen Xu, Hang Xu

Figure 1 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Figure 2 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Figure 3 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Figure 4 for Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Viaarxiv icon

Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion

Sep 04, 2023
Ryota Yoshihashi, Yuya Otsuka, Kenji Doi, Tomohiro Tanaka

Figure 1 for Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion
Figure 2 for Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion
Figure 3 for Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion
Figure 4 for Attention as Annotation: Generating Images and Pseudo-masks for Weakly Supervised Semantic Segmentation with Diffusion
Viaarxiv icon

Viewpoint Textual Inversion: Unleashing Novel View Synthesis with Pretrained 2D Diffusion Models

Sep 14, 2023
James Burgess, Kuan-Chieh Wang, Serena Yeung

Viaarxiv icon

Semantic Adversarial Attacks via Diffusion Models

Sep 14, 2023
Chenan Wang, Jinhao Duan, Chaowei Xiao, Edward Kim, Matthew Stamm, Kaidi Xu

Viaarxiv icon

Efficient Post-processing of Diffusion Tensor Cardiac Magnetic Imaging Using Texture-conserving Deformable Registration

Sep 12, 2023
Fanwen Wang, Pedro F. Ferreira, Yinzhe Wu, Andrew D. Scott, Camila Munoz, Ke Wen, Yaqing Luo, Jiahao Huang, Sonia Nielles-Vallespin, Dudley J. Pennell, Guang Yang

Figure 1 for Efficient Post-processing of Diffusion Tensor Cardiac Magnetic Imaging Using Texture-conserving Deformable Registration
Figure 2 for Efficient Post-processing of Diffusion Tensor Cardiac Magnetic Imaging Using Texture-conserving Deformable Registration
Figure 3 for Efficient Post-processing of Diffusion Tensor Cardiac Magnetic Imaging Using Texture-conserving Deformable Registration
Figure 4 for Efficient Post-processing of Diffusion Tensor Cardiac Magnetic Imaging Using Texture-conserving Deformable Registration
Viaarxiv icon

Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle

Sep 12, 2023
Maria Priisalu

Figure 1 for Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle
Figure 2 for Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle
Figure 3 for Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle
Figure 4 for Semantic and Articulated Pedestrian Sensing Onboard a Moving Vehicle
Viaarxiv icon

Language Models as Black-Box Optimizers for Vision-Language Models

Sep 25, 2023
Shihong Liu, Samuel Yu, Zhiqiu Lin, Deepak Pathak, Deva Ramanan

Figure 1 for Language Models as Black-Box Optimizers for Vision-Language Models
Figure 2 for Language Models as Black-Box Optimizers for Vision-Language Models
Figure 3 for Language Models as Black-Box Optimizers for Vision-Language Models
Figure 4 for Language Models as Black-Box Optimizers for Vision-Language Models
Viaarxiv icon

A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision

Aug 15, 2023
Julio Silva-Rodriguez, Hadi Chakor, Riadh Kobbi, Jose Dolz, Ismail Ben Ayed

Figure 1 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Figure 2 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Figure 3 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Figure 4 for A Foundation LAnguage-Image model of the Retina (FLAIR): Encoding expert knowledge in text supervision
Viaarxiv icon