Alert button

"Image": models, code, and papers
Alert button

Multimodal Neurons in Pretrained Text-Only Transformers

Add code
Bookmark button
Alert button
Aug 03, 2023
Sarah Schwettmann, Neil Chowdhury, Antonio Torralba

Viaarxiv icon

Scoring Cycling Environments Perceived Safety using Pairwise Image Comparisons

Add code
Bookmark button
Alert button
Jul 31, 2023
Miguel Costa, Manuel Marques, Felix Wilhelm Siebert, Carlos Lima Azevedo, Filipe Moura

Figure 1 for Scoring Cycling Environments Perceived Safety using Pairwise Image Comparisons
Figure 2 for Scoring Cycling Environments Perceived Safety using Pairwise Image Comparisons
Figure 3 for Scoring Cycling Environments Perceived Safety using Pairwise Image Comparisons
Figure 4 for Scoring Cycling Environments Perceived Safety using Pairwise Image Comparisons
Viaarxiv icon

MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets

Add code
Bookmark button
Alert button
Jul 05, 2023
Siyi Du, Nourhan Bayasi, Ghassan Harmarneh, Rafeef Garbi

Figure 1 for MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
Figure 2 for MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
Figure 3 for MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
Figure 4 for MDViT: Multi-domain Vision Transformer for Small Medical Image Segmentation Datasets
Viaarxiv icon

Three Towers: Flexible Contrastive Learning with Pretrained Image Models

Add code
Bookmark button
Alert button
May 29, 2023
Jannik Kossen, Mark Collier, Basil Mustafa, Xiao Wang, Xiaohua Zhai, Lucas Beyer, Andreas Steiner, Jesse Berent, Rodolphe Jenatton, Efi Kokiopoulou

Figure 1 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 2 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 3 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 4 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Viaarxiv icon

AudioVMAF: Audio Quality Prediction with VMAF

Aug 07, 2023
Arijit Biswas, Harald Mundt

Figure 1 for AudioVMAF: Audio Quality Prediction with VMAF
Figure 2 for AudioVMAF: Audio Quality Prediction with VMAF
Figure 3 for AudioVMAF: Audio Quality Prediction with VMAF
Viaarxiv icon

Interaction-aware Joint Attention Estimation Using People Attributes

Add code
Bookmark button
Alert button
Aug 10, 2023
Chihiro Nakatani, Hiroaki Kawashima, Norimichi Ukita

Figure 1 for Interaction-aware Joint Attention Estimation Using People Attributes
Figure 2 for Interaction-aware Joint Attention Estimation Using People Attributes
Figure 3 for Interaction-aware Joint Attention Estimation Using People Attributes
Figure 4 for Interaction-aware Joint Attention Estimation Using People Attributes
Viaarxiv icon

Transforming Breast Cancer Diagnosis: Towards Real-Time Ultrasound to Mammogram Conversion for Cost-Effective Diagnosis

Aug 10, 2023
Sahar Almahfouz Nasser, Ashutosh Sharma, Anmol Saraf, Amruta Mahendra Parulekar, Purvi Haria, Amit Sethi

Figure 1 for Transforming Breast Cancer Diagnosis: Towards Real-Time Ultrasound to Mammogram Conversion for Cost-Effective Diagnosis
Figure 2 for Transforming Breast Cancer Diagnosis: Towards Real-Time Ultrasound to Mammogram Conversion for Cost-Effective Diagnosis
Figure 3 for Transforming Breast Cancer Diagnosis: Towards Real-Time Ultrasound to Mammogram Conversion for Cost-Effective Diagnosis
Figure 4 for Transforming Breast Cancer Diagnosis: Towards Real-Time Ultrasound to Mammogram Conversion for Cost-Effective Diagnosis
Viaarxiv icon

TrOMR:Transformer-Based Polyphonic Optical Music Recognition

Add code
Bookmark button
Alert button
Aug 18, 2023
Yixuan Li, Huaping Liu, Qiang Jin, Miaomiao Cai, Peng Li

Viaarxiv icon

Nested Diffusion Processes for Anytime Image Generation

Add code
Bookmark button
Alert button
May 30, 2023
Noam Elata, Bahjat Kawar, Tomer Michaeli, Michael Elad

Figure 1 for Nested Diffusion Processes for Anytime Image Generation
Figure 2 for Nested Diffusion Processes for Anytime Image Generation
Figure 3 for Nested Diffusion Processes for Anytime Image Generation
Figure 4 for Nested Diffusion Processes for Anytime Image Generation
Viaarxiv icon

1M parameters are enough? A lightweight CNN-based model for medical image segmentation

Add code
Bookmark button
Alert button
Jun 28, 2023
Binh-Duong Dinh, Thanh-Thu Nguyen, Thi-Thao Tran, Van-Truong Pham

Figure 1 for 1M parameters are enough? A lightweight CNN-based model for medical image segmentation
Figure 2 for 1M parameters are enough? A lightweight CNN-based model for medical image segmentation
Figure 3 for 1M parameters are enough? A lightweight CNN-based model for medical image segmentation
Figure 4 for 1M parameters are enough? A lightweight CNN-based model for medical image segmentation
Viaarxiv icon