Alert button

"Image": models, code, and papers
Alert button

Exploring Semantic Variations in GAN Latent Spaces via Matrix Factorization

May 23, 2023
Andrey Palaev, Rustam A. Lukmanov, Adil Khan

Figure 1 for Exploring Semantic Variations in GAN Latent Spaces via Matrix Factorization
Figure 2 for Exploring Semantic Variations in GAN Latent Spaces via Matrix Factorization
Figure 3 for Exploring Semantic Variations in GAN Latent Spaces via Matrix Factorization
Figure 4 for Exploring Semantic Variations in GAN Latent Spaces via Matrix Factorization
Viaarxiv icon

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

May 24, 2023
Dongxu Yue, Qin Guo, Munan Ning, Jiaxi Cui, Yuesheng Zhu, Li Yuan

Figure 1 for ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation
Figure 2 for ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation
Figure 3 for ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation
Figure 4 for ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation
Viaarxiv icon

DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion

May 24, 2023
Taesun Yeom, Minhyeok Lee

Figure 1 for DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
Figure 2 for DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
Figure 3 for DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
Figure 4 for DuDGAN: Improving Class-Conditional GANs via Dual-Diffusion
Viaarxiv icon

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models

Jun 08, 2023
Muhammad Maaz, Hanoona Rasheed, Salman Khan, Fahad Shahbaz Khan

Figure 1 for Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
Figure 2 for Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
Figure 3 for Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
Figure 4 for Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
Viaarxiv icon

Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction

Jun 08, 2023
Ali Jamali, Swalpa Kumar Roy, Jonathan Li, Pedram Ghamisi

Figure 1 for Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction
Figure 2 for Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction
Figure 3 for Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction
Figure 4 for Neighborhood Attention Makes the Encoder of ResUNet Stronger for Accurate Road Extraction
Viaarxiv icon

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

May 25, 2023
Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu

Figure 1 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 2 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 3 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 4 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Viaarxiv icon

LANISTR: Multimodal Learning from Structured and Unstructured Data

May 26, 2023
Sayna Ebrahimi, Sercan O. Arik, Yihe Dong, Tomas Pfister

Figure 1 for LANISTR: Multimodal Learning from Structured and Unstructured Data
Figure 2 for LANISTR: Multimodal Learning from Structured and Unstructured Data
Figure 3 for LANISTR: Multimodal Learning from Structured and Unstructured Data
Figure 4 for LANISTR: Multimodal Learning from Structured and Unstructured Data
Viaarxiv icon

Contrastive Attention Networks for Attribution of Early Modern Print

Jun 12, 2023
Nikolai Vogler, Kartik Goyal, Kishore PV Reddy, Elizaveta Pertseva, Samuel V. Lemley, Christopher N. Warren, Max G'Sell, Taylor Berg-Kirkpatrick

Figure 1 for Contrastive Attention Networks for Attribution of Early Modern Print
Figure 2 for Contrastive Attention Networks for Attribution of Early Modern Print
Figure 3 for Contrastive Attention Networks for Attribution of Early Modern Print
Figure 4 for Contrastive Attention Networks for Attribution of Early Modern Print
Viaarxiv icon

No Free Lunch: The Hazards of Over-Expressive Representations in Anomaly Detection

Jun 12, 2023
Tal Reiss, Niv Cohen, Yedid Hoshen

Figure 1 for No Free Lunch: The Hazards of Over-Expressive Representations in Anomaly Detection
Figure 2 for No Free Lunch: The Hazards of Over-Expressive Representations in Anomaly Detection
Figure 3 for No Free Lunch: The Hazards of Over-Expressive Representations in Anomaly Detection
Figure 4 for No Free Lunch: The Hazards of Over-Expressive Representations in Anomaly Detection
Viaarxiv icon

LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation

Mar 22, 2023
Koutilya Pnvr, Bharat Singh, Pallabi Ghosh, Behjat Siddiquie, David Jacobs

Figure 1 for LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
Figure 2 for LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
Figure 3 for LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
Figure 4 for LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation
Viaarxiv icon