Alert button

"Image": models, code, and papers
Alert button

Brush Your Text: Synthesize Any Scene Text on Images via Diffusion Model

Add code
Bookmark button
Alert button
Dec 19, 2023
Lingjun Zhang, Xinyuan Chen, Yaohui Wang, Yue Lu, Yu Qiao

Viaarxiv icon

Unsupervised Segmentation of Colonoscopy Images

Dec 19, 2023
Heming Yao, Jérôme Lüscher, Benjamin Gutierrez Becker, Josep Arús-Pous, Tommaso Biancalani, Amelie Bigorgne, David Richmond

Viaarxiv icon

SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction

Dec 28, 2023
Zikang Yuan, Jie Deng, Ruiye Ming, Fengtian Lang, Xin Yang

Viaarxiv icon

Research on the Laws of Multimodal Perception and Cognition from a Cross-cultural Perspective -- Taking Overseas Chinese Gardens as an Example

Dec 29, 2023
Ran Chen, Xueqi Yao, Jing Zhao, Shuhan Xu, Sirui Zhang, Yijun Mao

Viaarxiv icon

Toward Spatial Temporal Consistency of Joint Visual Tactile Perception in VR Applications

Dec 29, 2023
Fuqiang Zhao, Kehan Zhang, Qian Liu, Zhuoyi Lyu

Viaarxiv icon

KOALA: Self-Attention Matters in Knowledge Distillation of Latent Diffusion Models for Memory-Efficient and Fast Image Synthesis

Add code
Bookmark button
Alert button
Dec 07, 2023
Youngwan Lee, Kwanyong Park, Yoorhim Cho, Yong-Ju Lee, Sung Ju Hwang

Viaarxiv icon

A Two-stream Hybrid CNN-Transformer Network for Skeleton-based Human Interaction Recognition

Dec 31, 2023
Ruoqi Yin, Jianqin Yin

Viaarxiv icon

Fast Diffusion-Based Counterfactuals for Shortcut Removal and Generation

Dec 21, 2023
Nina Weng, Paraskevas Pegios, Aasa Feragen, Eike Petersen, Siavash Bigdeli

Viaarxiv icon

Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks

Dec 04, 2023
Mohammed Baharoon, Waseem Qureshi, Jiahong Ouyang, Yanwu Xu, Kilian Phol, Abdulrhman Aljouie, Wei Peng

Figure 1 for Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks
Figure 2 for Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks
Figure 3 for Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks
Figure 4 for Towards General Purpose Vision Foundation Models for Medical Image Analysis: An Experimental Study of DINOv2 on Radiology Benchmarks
Viaarxiv icon

SA$^2$VP: Spatially Aligned-and-Adapted Visual Prompt

Add code
Bookmark button
Alert button
Dec 16, 2023
Wenjie Pei, Tongqi Xia, Fanglin Chen, Jinsong Li, Jiandong Tian, Guangming Lu

Viaarxiv icon