Alert button

"Image": models, code, and papers
Alert button

Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Add code
Bookmark button
Alert button
Mar 19, 2024
Hansheng Chen, Ruoxi Shi, Yulin Liu, Bokui Shen, Jiayuan Gu, Gordon Wetzstein, Hao Su, Leonidas Guibas

Figure 1 for Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Figure 2 for Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Figure 3 for Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Figure 4 for Generic 3D Diffusion Adapter Using Controlled Multi-View Editing
Viaarxiv icon

Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation

Add code
Bookmark button
Alert button
Mar 11, 2024
Theodore Barfoot, Luis Garcia-Peraza-Herrera, Ben Glocker, Tom Vercauteren

Figure 1 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Figure 2 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Figure 3 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Figure 4 for Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation
Viaarxiv icon

Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting

Add code
Bookmark button
Alert button
Mar 22, 2024
Jun Guo, Xiaojian Ma, Yue Fan, Huaping Liu, Qing Li

Viaarxiv icon

SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models

Add code
Bookmark button
Alert button
Mar 20, 2024
Tongtian Yue, Jie Cheng, Longteng Guo, Xingyuan Dai, Zijia Zhao, Xingjian He, Gang Xiong, Yisheng Lv, Jing Liu

Figure 1 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 2 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 3 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Figure 4 for SC-Tune: Unleashing Self-Consistent Referential Comprehension in Large Vision Language Models
Viaarxiv icon

FFT-based Selection and Optimization of Statistics for Robust Recognition of Severely Corrupted Images

Mar 21, 2024
Elena Camuffo, Umberto Michieli, Jijoong Moon, Daehyun Kim, Mete Ozay

Figure 1 for FFT-based Selection and Optimization of Statistics for Robust Recognition of Severely Corrupted Images
Figure 2 for FFT-based Selection and Optimization of Statistics for Robust Recognition of Severely Corrupted Images
Figure 3 for FFT-based Selection and Optimization of Statistics for Robust Recognition of Severely Corrupted Images
Figure 4 for FFT-based Selection and Optimization of Statistics for Robust Recognition of Severely Corrupted Images
Viaarxiv icon

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

Add code
Bookmark button
Alert button
Mar 23, 2024
Hancheng Ye, Chong Yu, Peng Ye, Renqiu Xia, Yansong Tang, Jiwen Lu, Tao Chen, Bo Zhang

Viaarxiv icon

Innovative Quantitative Analysis for Disease Progression Assessment in Familial Cerebral Cavernous Malformations

Add code
Bookmark button
Alert button
Mar 23, 2024
Ruige Zong, Tao Wang, Chunwang Li, Xinlin Zhang, Yuanbin Chen, Longxuan Zhao, Qixuan Li, Qinquan Gao, Dezhi Kang, Fuxin Lin, Tong Tong

Viaarxiv icon

uniGradICON: A Foundation Model for Medical Image Registration

Add code
Bookmark button
Alert button
Mar 09, 2024
Lin Tian, Hastings Greer, Roland Kwitt, Francois-Xavier Vialard, Raul San Jose Estepar, Sylvain Bouix, Richard Rushmore, Marc Niethammer

Figure 1 for uniGradICON: A Foundation Model for Medical Image Registration
Figure 2 for uniGradICON: A Foundation Model for Medical Image Registration
Figure 3 for uniGradICON: A Foundation Model for Medical Image Registration
Figure 4 for uniGradICON: A Foundation Model for Medical Image Registration
Viaarxiv icon

Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing

Add code
Bookmark button
Alert button
Mar 06, 2024
Bingyan Liu, Chengyu Wang, Tingfeng Cao, Kui Jia, Jun Huang

Figure 1 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 2 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 3 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Figure 4 for Towards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image Editing
Viaarxiv icon

Language-Based Depth Hints for Monocular Depth Estimation

Mar 22, 2024
Dylan Auty, Krystian Mikolajczyk

Viaarxiv icon