Alert button

"Image": models, code, and papers
Alert button

Augmenting CLIP with Improved Visio-Linguistic Reasoning

Jul 18, 2023
Samyadeep Basu, Maziar Sanjabi, Daniela Massiceti, Shell Xu Hu, Soheil Feizi

Figure 1 for Augmenting CLIP with Improved Visio-Linguistic Reasoning
Figure 2 for Augmenting CLIP with Improved Visio-Linguistic Reasoning
Figure 3 for Augmenting CLIP with Improved Visio-Linguistic Reasoning
Figure 4 for Augmenting CLIP with Improved Visio-Linguistic Reasoning
Viaarxiv icon

Language-based Action Concept Spaces Improve Video Self-Supervised Learning

Jul 20, 2023
Kanchana Ranasinghe, Michael Ryoo

Figure 1 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Figure 2 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Figure 3 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Figure 4 for Language-based Action Concept Spaces Improve Video Self-Supervised Learning
Viaarxiv icon

DualAttNet: Synergistic Fusion of Image-level and Fine-Grained Disease Attention for Multi-Label Lesion Detection in Chest X-rays

Jun 23, 2023
Qing Xu, Wenting Duan

Viaarxiv icon

Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis

Jul 19, 2023
Jiajie Fan, Laure Vuaille, Hao Wang, Thomas Bäck

Figure 1 for Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis
Figure 2 for Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis
Figure 3 for Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis
Figure 4 for Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis
Viaarxiv icon

ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation

Jun 01, 2023
Shaozhe Hao, Kai Han, Shihao Zhao, Kwan-Yee K. Wong

Figure 1 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Figure 2 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Figure 3 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Figure 4 for ViCo: Detail-Preserving Visual Condition for Personalized Text-to-Image Generation
Viaarxiv icon

SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering

Aug 01, 2023
Wei Han, Hui Chen, Min-Yen Kan, Soujanya Poria

Figure 1 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Figure 2 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Figure 3 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Figure 4 for SAS Video-QA: Self-Adaptive Sampling for Efficient Video Question-Answering
Viaarxiv icon

FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene

Jul 27, 2023
Chengrui Wei, Meng Yang, Lei He, Nanning Zheng

Figure 1 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 2 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 3 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 4 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Viaarxiv icon

WaveDM: Wavelet-Based Diffusion Models for Image Restoration

May 23, 2023
Yi Huang, Jiancheng Huang, Jianzhuang Liu, Yu Dong, Jiaxi Lv, Shifeng Chen

Figure 1 for WaveDM: Wavelet-Based Diffusion Models for Image Restoration
Figure 2 for WaveDM: Wavelet-Based Diffusion Models for Image Restoration
Figure 3 for WaveDM: Wavelet-Based Diffusion Models for Image Restoration
Figure 4 for WaveDM: Wavelet-Based Diffusion Models for Image Restoration
Viaarxiv icon

DeDrift: Robust Similarity Search under Content Drift

Aug 05, 2023
Dmitry Baranchuk, Matthijs Douze, Yash Upadhyay, I. Zeki Yalniz

Figure 1 for DeDrift: Robust Similarity Search under Content Drift
Figure 2 for DeDrift: Robust Similarity Search under Content Drift
Figure 3 for DeDrift: Robust Similarity Search under Content Drift
Figure 4 for DeDrift: Robust Similarity Search under Content Drift
Viaarxiv icon

MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection

Jul 18, 2023
Zewei Lin, Yanqing Shen, Sanping Zhou, Shitao Chen, Nanning Zheng

Figure 1 for MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection
Figure 2 for MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection
Figure 3 for MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection
Figure 4 for MLF-DET: Multi-Level Fusion for Cross-Modal 3D Object Detection
Viaarxiv icon