Alert button

"Image": models, code, and papers
Alert button

ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation

Feb 23, 2024
Yi Zhang, Yun Tang, Wenjie Ruan, Xiaowei Huang, Siddartha Khastgir, Paul Jennings, Xingyu Zhao

Viaarxiv icon

Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction

Mar 04, 2024
Noah Maul, Annette Birkhold, Fabian Wagner, Mareike Thies, Maximilian Rohleder, Philipp Berg, Markus Kowarschik, Andreas Maier

Figure 1 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Figure 2 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Figure 3 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Figure 4 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Viaarxiv icon

ChatEarthNet: A Global-Scale, High-Quality Image-Text Dataset for Remote Sensing

Feb 17, 2024
Zhenghang Yuan, Zhitong Xiong, Lichao Mou, Xiao Xiang Zhu

Viaarxiv icon

Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks

Mar 01, 2024
Yuhao Liu, Fang Liu, Zhanghan Ke, Nanxuan Zhao, Rynson W. H. Lau

Figure 1 for Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Figure 2 for Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Figure 3 for Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Figure 4 for Diff-Plugin: Revitalizing Details for Diffusion-based Low-level Tasks
Viaarxiv icon

Invariant Test-Time Adaptation for Vision-Language Model Generalization

Mar 01, 2024
Huan Ma, Yan Zhu, Changqing Zhang, Peilin Zhao, Baoyuan Wu, Long-Kai Huang, Qinghua Hu, Bingzhe Wu

Figure 1 for Invariant Test-Time Adaptation for Vision-Language Model Generalization
Figure 2 for Invariant Test-Time Adaptation for Vision-Language Model Generalization
Figure 3 for Invariant Test-Time Adaptation for Vision-Language Model Generalization
Figure 4 for Invariant Test-Time Adaptation for Vision-Language Model Generalization
Viaarxiv icon

Neural Radiance Fields in Medical Imaging: Challenges and Next Steps

Mar 02, 2024
Xin Wang, Shu Hu, Heng Fan, Hongtu Zhu, Xin Li

Viaarxiv icon

TAMM: TriAdapter Multi-Modal Learning for 3D Shape Understanding

Feb 28, 2024
Zhihao Zhang, Shengcao Cao, Yu-Xiong Wang

Viaarxiv icon

OpenMEDLab: An Open-source Platform for Multi-modality Foundation Models in Medicine

Mar 04, 2024
Xiaosong Wang, Xiaofan Zhang, Guotai Wang, Junjun He, Zhongyu Li, Wentao Zhu, Yi Guo, Qi Dou, Xiaoxiao Li, Dequan Wang, Liang Hong, Qicheng Lao, Tong Ruan, Yukun Zhou, Yixue Li, Jie Zhao, Kang Li, Xin Sun, Lifeng Zhu, Shaoting Zhang

Viaarxiv icon

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition

Mar 03, 2024
Lingfeng Liu, Dong Ni, Hangjie Yuan

Figure 1 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 2 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 3 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 4 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Viaarxiv icon

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies

Mar 03, 2024
Zhende Song, Chenchen Wang, Jiamu Sheng, Chi Zhang, Gang Yu, Jiayuan Fan, Tao Chen

Figure 1 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 2 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 3 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 4 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Viaarxiv icon