Alert button

"Image": models, code, and papers
Alert button

Multi-scale Diffusion Denoised Smoothing

Oct 26, 2023
Jongheon Jeong, Jinwoo Shin

Viaarxiv icon

DiffS2UT: A Semantic Preserving Diffusion Model for Textless Direct Speech-to-Speech Translation

Oct 26, 2023
Yongxin Zhu, Zhujin Gao, Xinyuan Zhou, Zhongyi Ye, Linli Xu

Viaarxiv icon

Sign Languague Recognition without frame-sequencing constraints: A proof of concept on the Argentinian Sign Language

Oct 26, 2023
Franco Ronchetti, Facundo Manuel Quiroga, César Estrebou, Laura Lanzarini, Alejandro Rosete

Viaarxiv icon

JM3D & JM3D-LLM: Elevating 3D Representation with Joint Multi-modal Cues

Oct 20, 2023
Jiayi Ji, Haowei Wang, Changli Wu, Yiwei Ma, Xiaoshuai Sun, Rongrong Ji

Viaarxiv icon

Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining

Sep 29, 2023
Tianyu Han, Laura Žigutytė, Luisa Huck, Marc Huppertz, Robert Siepmann, Yossi Gandelsman, Christian Blüthgen, Firas Khader, Christiane Kuhl, Sven Nebelung, Jakob Kather, Daniel Truhn

Figure 1 for Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining
Figure 2 for Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining
Figure 3 for Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining
Figure 4 for Reconstruction of Patient-Specific Confounders in AI-based Radiologic Image Interpretation using Generative Pretraining
Viaarxiv icon

Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission

Sep 19, 2023
Lunan Sun, Yang Yang, Mingzhe Chen, Caili Guo

Figure 1 for Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission
Figure 2 for Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission
Figure 3 for Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission
Figure 4 for Disentangled Information Bottleneck guided Privacy-Protective JSCC for Image Transmission
Viaarxiv icon

MAD: Modality Agnostic Distance Measure for Image Registration

Sep 06, 2023
Vasiliki Sideri-Lampretsa, Veronika A. Zimmer, Huaqi Qiu, Georgios Kaissis, Daniel Rueckert

Figure 1 for MAD: Modality Agnostic Distance Measure for Image Registration
Figure 2 for MAD: Modality Agnostic Distance Measure for Image Registration
Figure 3 for MAD: Modality Agnostic Distance Measure for Image Registration
Figure 4 for MAD: Modality Agnostic Distance Measure for Image Registration
Viaarxiv icon

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Oct 17, 2023
Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao

Figure 1 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Figure 2 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Figure 3 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Figure 4 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Viaarxiv icon

UWB Based Static Gesture Classification

Oct 23, 2023
Abhishek Sebastian

Figure 1 for UWB Based Static Gesture Classification
Figure 2 for UWB Based Static Gesture Classification
Figure 3 for UWB Based Static Gesture Classification
Figure 4 for UWB Based Static Gesture Classification
Viaarxiv icon

Hyper-Skin: A Hyperspectral Dataset for Reconstructing Facial Skin-Spectra from RGB Images

Oct 27, 2023
Pai Chet Ng, Zhixiang Chi, Yannick Verdie, Juwei Lu, Konstantinos N. Plataniotis

Viaarxiv icon