Alert button

"Image": models, code, and papers
Alert button

From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models

Add code
Bookmark button
Alert button
Oct 13, 2023
Dongsheng Jiang, Yuchen Liu, Songlin Liu, Xiaopeng Zhang, Jin Li, Hongkai Xiong, Qi Tian

Figure 1 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Figure 2 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Figure 3 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Figure 4 for From CLIP to DINO: Visual Encoders Shout in Multi-modal Large Language Models
Viaarxiv icon

No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling

Oct 09, 2023
Xuwei Xu, Changlin Li, Yudong Chen, Xiaojun Chang, Jiajun Liu, Sen Wang

Figure 1 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Figure 2 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Figure 3 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Figure 4 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Viaarxiv icon

SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation

Aug 04, 2023
Shikun Sun, Longhui Wei, Junliang Xing, Jia Jia, Qi Tian

Figure 1 for SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation
Figure 2 for SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation
Figure 3 for SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation
Figure 4 for SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation
Viaarxiv icon

Research on Key Technologies of Infrastructure Digitalization based on Multimodal Spatial Data

Oct 22, 2023
Zhanyuan Tian, Tianrui Zhu, Zerui Tian, Zhen Dong

Viaarxiv icon

ASPIRE: Language-Guided Augmentation for Robust Image Classification

Add code
Bookmark button
Alert button
Aug 19, 2023
Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Sakshi Singh, Sanjoy Chowdhury, Dinesh Manocha

Figure 1 for ASPIRE: Language-Guided Augmentation for Robust Image Classification
Figure 2 for ASPIRE: Language-Guided Augmentation for Robust Image Classification
Figure 3 for ASPIRE: Language-Guided Augmentation for Robust Image Classification
Figure 4 for ASPIRE: Language-Guided Augmentation for Robust Image Classification
Viaarxiv icon

Deep Cardiac MRI Reconstruction with ADMM

Oct 10, 2023
George Yiasemis, Nikita Moriakov, Jan-Jakob Sonke, Jonas Teuwen

Figure 1 for Deep Cardiac MRI Reconstruction with ADMM
Figure 2 for Deep Cardiac MRI Reconstruction with ADMM
Figure 3 for Deep Cardiac MRI Reconstruction with ADMM
Figure 4 for Deep Cardiac MRI Reconstruction with ADMM
Viaarxiv icon

Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model

Add code
Bookmark button
Alert button
Sep 07, 2023
Sungwon Hwang, Junha Hyung, Jaegul Choo

Figure 1 for Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Figure 2 for Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Figure 3 for Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Figure 4 for Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffusion Model
Viaarxiv icon

MS-UNet-v2: Adaptive Denoising Method and Training Strategy for Medical Image Segmentation with Small Training Data

Sep 07, 2023
Haoyuan Chen, Yufei Han, Pin Xu, Yanyi Li, Kuan Li, Jianping Yin

Figure 1 for MS-UNet-v2: Adaptive Denoising Method and Training Strategy for Medical Image Segmentation with Small Training Data
Figure 2 for MS-UNet-v2: Adaptive Denoising Method and Training Strategy for Medical Image Segmentation with Small Training Data
Figure 3 for MS-UNet-v2: Adaptive Denoising Method and Training Strategy for Medical Image Segmentation with Small Training Data
Figure 4 for MS-UNet-v2: Adaptive Denoising Method and Training Strategy for Medical Image Segmentation with Small Training Data
Viaarxiv icon

CrowdRec: 3D Crowd Reconstruction from Single Color Images

Add code
Bookmark button
Alert button
Oct 10, 2023
Buzhen Huang, Jingyi Ju, Yangang Wang

Figure 1 for CrowdRec: 3D Crowd Reconstruction from Single Color Images
Figure 2 for CrowdRec: 3D Crowd Reconstruction from Single Color Images
Figure 3 for CrowdRec: 3D Crowd Reconstruction from Single Color Images
Figure 4 for CrowdRec: 3D Crowd Reconstruction from Single Color Images
Viaarxiv icon

Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models

Add code
Bookmark button
Alert button
Oct 10, 2023
Zhili Liu, Kai Chen, Yifan Zhang, Jianhua Han, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James Kwok

Figure 1 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Figure 2 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Figure 3 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Figure 4 for Geom-Erasing: Geometry-Driven Removal of Implicit Concept in Diffusion Models
Viaarxiv icon