Alert button

"Image": models, code, and papers
Alert button

Touch-GS: Visual-Tactile Supervised 3D Gaussian Splatting

Mar 14, 2024
Aiden Swann, Matthew Strong, Won Kyung Do, Gadiel Sznaier Camps, Mac Schwager, Monroe Kennedy III

Viaarxiv icon

BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects

Mar 14, 2024
Tomas Hodan, Martin Sundermeyer, Yann Labbe, Van Nguyen Nguyen, Gu Wang, Eric Brachmann, Bertram Drost, Vincent Lepetit, Carsten Rother, Jiri Matas

Viaarxiv icon

Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts

Mar 14, 2024
Byeongjun Park, Hyojun Go, Jin-Young Kim, Sangmin Woo, Seokil Ham, Changick Kim

Viaarxiv icon

VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework

Mar 14, 2024
Chris Kelly, Luhui Hu, Bang Yang, Yu Tian, Deshun Yang, Cindy Yang, Zaoshan Huang, Zihao Li, Jiayin Hu, Yuexian Zou

Viaarxiv icon

DF4LCZ: A SAM-Empowered Data Fusion Framework for Scene-Level Local Climate Zone Classification

Mar 14, 2024
Qianqian Wu, Xianping Ma, Jialu Sui, Man-On Pun

Viaarxiv icon

The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

Mar 14, 2024
Qinyu Zhao, Ming Xu, Kartik Gupta, Akshay Asthana, Liang Zheng, Stephen Gould

Viaarxiv icon

Customizing Segmentation Foundation Model via Prompt Learning for Instance Segmentation

Mar 14, 2024
Hyung-Il Kim, Kimin Yun, Jun-Seok Yun, Yuseok Bae

Viaarxiv icon

GLFNET: Global-Local (frequency) Filter Networks for efficient medical image segmentation

Mar 01, 2024
Athanasios Tragakis, Qianying Liu, Chaitanya Kaul, Swalpa Kumar Roy, Hang Dai, Fani Deligianni, Roderick Murray-Smith, Daniele Faccio

Figure 1 for GLFNET: Global-Local (frequency) Filter Networks for efficient medical image segmentation
Figure 2 for GLFNET: Global-Local (frequency) Filter Networks for efficient medical image segmentation
Figure 3 for GLFNET: Global-Local (frequency) Filter Networks for efficient medical image segmentation
Figure 4 for GLFNET: Global-Local (frequency) Filter Networks for efficient medical image segmentation
Viaarxiv icon

A Decade's Battle on Dataset Bias: Are We There Yet?

Mar 13, 2024
Zhuang Liu, Kaiming He

Viaarxiv icon

WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis

Feb 29, 2024
Paul Friedrich, Julia Wolleb, Florentin Bieder, Alicia Durrer, Philippe C. Cattin

Viaarxiv icon