Alert button

"Image": models, code, and papers
Alert button

Unlocking Pre-trained Image Backbones for Semantic Image Synthesis

Jan 08, 2024
Tariq Berrada, Jakob Verbeek, Camille Couprie, Karteek Alahari

Viaarxiv icon

CLIP Can Understand Depth

Feb 05, 2024
Dunam Kim, Seokju Lee

Viaarxiv icon

PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology

Feb 05, 2024
Yuxuan Sun, Hao Wu, Chenglu Zhu, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin, Lin Yang

Viaarxiv icon

A Review on Digital Pixel Sensors

Feb 07, 2024
Md Rahatul Islam Udoy, Shamiul Alam, Md Mazharul Islam, Akhilesh Jaiswal, Ahmedullah Aziz

Viaarxiv icon

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation

Feb 07, 2024
Jiaxiang Tang, Zhaoxi Chen, Xiaokang Chen, Tengfei Wang, Gang Zeng, Ziwei Liu

Viaarxiv icon

FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models

Feb 07, 2024
Chuhao Liu, Ke Wang, Jieqi Shi, Zhijian Qiao, Shaojie Shen

Viaarxiv icon

DMAT: A Dynamic Mask-Aware Transformer for Human De-occlusion

Feb 07, 2024
Guoqiang Liang, Jiahao Hu, Qingyue Wang, Shizhou Zhang

Viaarxiv icon

EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions

Feb 07, 2024
Shashank Kotyan, PoYuan Mao, Danilo Vasconcellos Vargas

Figure 1 for EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions
Figure 2 for EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions
Figure 3 for EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions
Figure 4 for EvoSeed: Unveiling the Threat on Deep Neural Networks with Real-World Illusions
Viaarxiv icon

Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models

Feb 07, 2024
Fangzhao Zhang, Mert Pilanci

Viaarxiv icon

Generalizing GradCAM for Embedding Networks

Feb 05, 2024
Mudit Bachhawat

Figure 1 for Generalizing GradCAM for Embedding Networks
Figure 2 for Generalizing GradCAM for Embedding Networks
Figure 3 for Generalizing GradCAM for Embedding Networks
Viaarxiv icon