Alert button

"Image": models, code, and papers
Alert button

Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators

Mar 23, 2023
Levon Khachatryan, Andranik Movsisyan, Vahram Tadevosyan, Roberto Henschel, Zhangyang Wang, Shant Navasardyan, Humphrey Shi

Figure 1 for Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Figure 2 for Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Figure 3 for Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Figure 4 for Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators
Viaarxiv icon

Vision Conformer: Incorporating Convolutions into Vision Transformer Layers

Apr 27, 2023
Brian Kenji Iwana, Akihiro Kusuda

Figure 1 for Vision Conformer: Incorporating Convolutions into Vision Transformer Layers
Figure 2 for Vision Conformer: Incorporating Convolutions into Vision Transformer Layers
Figure 3 for Vision Conformer: Incorporating Convolutions into Vision Transformer Layers
Figure 4 for Vision Conformer: Incorporating Convolutions into Vision Transformer Layers
Viaarxiv icon

Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction

May 04, 2023
Qi Wang, Zhijie Wen, Jun Shi, Qian Wang, Dinggang Shen, Shihui Ying

Figure 1 for Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction
Figure 2 for Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction
Figure 3 for Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction
Figure 4 for Spatial and Modal Optimal Transport for Fast Cross-Modal MRI Reconstruction
Viaarxiv icon

ReSup: Reliable Label Noise Suppression for Facial Expression Recognition

May 29, 2023
Xiang Zhang, Yan Lu, Huan Yan, Jingyang Huang, Yusheng Ji, Yu Gu

Figure 1 for ReSup: Reliable Label Noise Suppression for Facial Expression Recognition
Figure 2 for ReSup: Reliable Label Noise Suppression for Facial Expression Recognition
Figure 3 for ReSup: Reliable Label Noise Suppression for Facial Expression Recognition
Figure 4 for ReSup: Reliable Label Noise Suppression for Facial Expression Recognition
Viaarxiv icon

Towards Arbitrary Text-driven Image Manipulation via Space Alignment

Jan 25, 2023
Yunpeng Bai, Zihan Zhong, Chao Dong, Weichen Zhang, Guowei Xu, Chun Yuan

Figure 1 for Towards Arbitrary Text-driven Image Manipulation via Space Alignment
Figure 2 for Towards Arbitrary Text-driven Image Manipulation via Space Alignment
Figure 3 for Towards Arbitrary Text-driven Image Manipulation via Space Alignment
Figure 4 for Towards Arbitrary Text-driven Image Manipulation via Space Alignment
Viaarxiv icon

Evaluation of Confidence-based Ensembling in Deep Learning Image Classification

Mar 03, 2023
Rafael Rosales, Peter Popov, Michael Paulitsch

Figure 1 for Evaluation of Confidence-based Ensembling in Deep Learning Image Classification
Figure 2 for Evaluation of Confidence-based Ensembling in Deep Learning Image Classification
Figure 3 for Evaluation of Confidence-based Ensembling in Deep Learning Image Classification
Figure 4 for Evaluation of Confidence-based Ensembling in Deep Learning Image Classification
Viaarxiv icon

CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images

Mar 24, 2023
Jordan J. Bird, Ahmad Lotfi

Figure 1 for CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images
Figure 2 for CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images
Figure 3 for CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images
Figure 4 for CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images
Viaarxiv icon

A request for clarity over the End of Sequence token in the Self-Critical Sequence Training

May 20, 2023
Jia Cheng Hu, Roberto Cavicchioli, Alessandro Capotondi

Figure 1 for A request for clarity over the End of Sequence token in the Self-Critical Sequence Training
Figure 2 for A request for clarity over the End of Sequence token in the Self-Critical Sequence Training
Figure 3 for A request for clarity over the End of Sequence token in the Self-Critical Sequence Training
Figure 4 for A request for clarity over the End of Sequence token in the Self-Critical Sequence Training
Viaarxiv icon

Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning

Feb 08, 2023
Mozhgan Pourkeshavarz, Shahabedin Nabavi, Mohsen Ebrahimi Moghaddam, Mehrnoush Shamsfard

Figure 1 for Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Figure 2 for Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Figure 3 for Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Figure 4 for Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning
Viaarxiv icon

Attributing Image Generative Models using Latent Fingerprints

Apr 17, 2023
Guangyu Nie, Changhoon Kim, Yezhou Yang, Yi Ren

Figure 1 for Attributing Image Generative Models using Latent Fingerprints
Figure 2 for Attributing Image Generative Models using Latent Fingerprints
Figure 3 for Attributing Image Generative Models using Latent Fingerprints
Figure 4 for Attributing Image Generative Models using Latent Fingerprints
Viaarxiv icon