Alert button

"Image": models, code, and papers
Alert button

Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet

Oct 05, 2023
Hossein Jafari, Karim Faez, Hamidreza Amindavar

Figure 1 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Figure 2 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Figure 3 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Figure 4 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Viaarxiv icon

Improving mitosis detection on histopathology images using large vision-language models

Oct 11, 2023
Ruiwen Ding, James Hall, Neil Tenenholtz, Kristen Severson

Viaarxiv icon

Don't Look into the Sun: Adversarial Solarization Attacks on Image Classifiers

Add code
Bookmark button
Alert button
Aug 24, 2023
Paul Gavrikov, Janis Keuper

Viaarxiv icon

AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion

Add code
Bookmark button
Alert button
Sep 04, 2023
Dongyue Chen, Tingxuan Huang, Zhimin Song, Shizhuo Deng, Tong Jia

Figure 1 for AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion
Figure 2 for AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion
Figure 3 for AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion
Figure 4 for AGG-Net: Attention Guided Gated-convolutional Network for Depth Image Completion
Viaarxiv icon

Concept Bottleneck with Visual Concept Filtering for Explainable Medical Image Classification

Aug 23, 2023
Injae Kim, Jongha Kim, Joonmyung Choi, Hyunwoo J. Kim

Viaarxiv icon

Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor

Sep 22, 2023
Robin Göransson, Volker Krueger

Figure 1 for Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor
Figure 2 for Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor
Figure 3 for Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor
Figure 4 for Learning Actions and Control of Focus of Attention with a Log-Polar-like Sensor
Viaarxiv icon

Tackling VQA with Pretrained Foundation Models without Further Training

Sep 27, 2023
Alvin De Jun Tan, Bingquan Shen

Figure 1 for Tackling VQA with Pretrained Foundation Models without Further Training
Figure 2 for Tackling VQA with Pretrained Foundation Models without Further Training
Figure 3 for Tackling VQA with Pretrained Foundation Models without Further Training
Figure 4 for Tackling VQA with Pretrained Foundation Models without Further Training
Viaarxiv icon

Making LLaMA SEE and Draw with SEED Tokenizer

Add code
Bookmark button
Alert button
Oct 02, 2023
Yuying Ge, Sijie Zhao, Ziyun Zeng, Yixiao Ge, Chen Li, Xintao Wang, Ying Shan

Viaarxiv icon

I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal Information Extraction

Add code
Bookmark button
Alert button
Oct 10, 2023
Yusheng Huang, Zhouhan Lin

Figure 1 for I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal Information Extraction
Figure 2 for I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal Information Extraction
Figure 3 for I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal Information Extraction
Figure 4 for I2SRM: Intra- and Inter-Sample Relationship Modeling for Multimodal Information Extraction
Viaarxiv icon

Topological RANSAC for instance verification and retrieval without fine-tuning

Oct 10, 2023
Guoyuan An, Juhyung Seon, Inkyu An, Yuchi Huo, Sung-Eui Yoon

Viaarxiv icon