Alert button

"Image": models, code, and papers
Alert button

PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology

Feb 05, 2024
Yuxuan Sun, Hao Wu, Chenglu Zhu, Sunyi Zheng, Qizi Chen, Kai Zhang, Yunlong Zhang, Xiaoxiao Lan, Mengyue Zheng, Jingxiong Li, Xinheng Lyu, Tao Lin, Lin Yang

Viaarxiv icon

Generalizing GradCAM for Embedding Networks

Feb 05, 2024
Mudit Bachhawat

Figure 1 for Generalizing GradCAM for Embedding Networks
Figure 2 for Generalizing GradCAM for Embedding Networks
Figure 3 for Generalizing GradCAM for Embedding Networks
Viaarxiv icon

Polyp-DAM: Polyp segmentation via depth anything model

Add code
Bookmark button
Alert button
Feb 03, 2024
Zhuoran Zheng, Chen Wu, Wei Wang, Yeying Jin, Xiuyi Jia

Viaarxiv icon

A Bandit Approach with Evolutionary Operators for Model Selection

Feb 07, 2024
Margaux Brégère, Julie Keisler

Viaarxiv icon

Universal Neural Functionals

Add code
Bookmark button
Alert button
Feb 07, 2024
Allan Zhou, Chelsea Finn, James Harrison

Viaarxiv icon

Code as Reward: Empowering Reinforcement Learning with VLMs

Feb 07, 2024
David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand

Viaarxiv icon

STAR: Shape-focused Texture Agnostic Representations for Improved Object Detection and 6D Pose Estimation

Add code
Bookmark button
Alert button
Feb 07, 2024
Peter Hönig, Stefan Thalhammer, Jean-Baptiste Weibel, Matthias Hirschmanner, Markus Vincze

Viaarxiv icon

Exploring Compressed Image Representation as a Perceptual Proxy: A Study

Jan 14, 2024
Chen-Hsiu Huang, Ja-Ling Wu

Viaarxiv icon

SAMF: Small-Area-Aware Multi-focus Image Fusion for Object Detection

Add code
Bookmark button
Alert button
Jan 16, 2024
Xilai Li, Xiaosong Li, Haishu Tan, Jinyang Li

Viaarxiv icon

More than the Sum of Its Parts: Ensembling Backbone Networks for Few-Shot Segmentation

Feb 09, 2024
Nico Catalano, Alessandro Maranelli, Agnese Chiatti, Matteo Matteucci

Viaarxiv icon