Alert button

"Image": models, code, and papers
Alert button

ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation

Feb 27, 2023
Yuxiang Wei, Yabo Zhang, Zhilong Ji, Jinfeng Bai, Lei Zhang, Wangmeng Zuo

Figure 1 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Figure 2 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Figure 3 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Figure 4 for ELITE: Encoding Visual Concepts into Textual Embeddings for Customized Text-to-Image Generation
Viaarxiv icon

Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation

Feb 05, 2023
Shiqi Sun, Shancheng Fang, Qian He, Wei Liu

Figure 1 for Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation
Figure 2 for Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation
Figure 3 for Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation
Figure 4 for Design Booster: A Text-Guided Diffusion Model for Image Translation with Spatial Layout Preservation
Viaarxiv icon

CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation

May 23, 2023
Jingning Xu, Benlai Tang, Mingjie Wang, Minghao Li, Meirong Ma

Figure 1 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Figure 2 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Figure 3 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Figure 4 for CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation
Viaarxiv icon

Enhancing Accuracy and Robustness through Adversarial Training in Class Incremental Continual Learning

May 23, 2023
Minchan Kwon, Kangil Kim

Figure 1 for Enhancing Accuracy and Robustness through Adversarial Training in Class Incremental Continual Learning
Figure 2 for Enhancing Accuracy and Robustness through Adversarial Training in Class Incremental Continual Learning
Figure 3 for Enhancing Accuracy and Robustness through Adversarial Training in Class Incremental Continual Learning
Figure 4 for Enhancing Accuracy and Robustness through Adversarial Training in Class Incremental Continual Learning
Viaarxiv icon

Multi-BVOC Super-Resolution Exploiting Compounds Inter-Connection

May 23, 2023
Antonio Giganti, Sara Mandelli, Paolo Bestagini, Marco Marcon, Stefano Tubaro

Figure 1 for Multi-BVOC Super-Resolution Exploiting Compounds Inter-Connection
Figure 2 for Multi-BVOC Super-Resolution Exploiting Compounds Inter-Connection
Figure 3 for Multi-BVOC Super-Resolution Exploiting Compounds Inter-Connection
Figure 4 for Multi-BVOC Super-Resolution Exploiting Compounds Inter-Connection
Viaarxiv icon

Learning Human-Human Interactions in Images from Weak Textual Supervision

Apr 27, 2023
Morris Alper, Hadar Averbuch-Elor

Figure 1 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Figure 2 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Figure 3 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Figure 4 for Learning Human-Human Interactions in Images from Weak Textual Supervision
Viaarxiv icon

Graph Neural Network for Accurate and Low-complexity SAR ATR

May 11, 2023
Bingyi Zhang, Sasindu Wijeratne, Rajgopal Kannan, Viktor Prasanna, Carl Busart

Figure 1 for Graph Neural Network for Accurate and Low-complexity SAR ATR
Figure 2 for Graph Neural Network for Accurate and Low-complexity SAR ATR
Figure 3 for Graph Neural Network for Accurate and Low-complexity SAR ATR
Figure 4 for Graph Neural Network for Accurate and Low-complexity SAR ATR
Viaarxiv icon

What can generic neural networks learn from a child's visual experience?

May 24, 2023
A. Emin Orhan, Brenden M. Lake

Figure 1 for What can generic neural networks learn from a child's visual experience?
Figure 2 for What can generic neural networks learn from a child's visual experience?
Figure 3 for What can generic neural networks learn from a child's visual experience?
Figure 4 for What can generic neural networks learn from a child's visual experience?
Viaarxiv icon

Album Storytelling with Iterative Story-aware Captioning and Large Language Models

May 24, 2023
Munan Ning, Yujia Xie, Dongdong Chen, Zeyin Song, Lu Yuan, Yonghong Tian, Qixiang Ye, Li Yuan

Figure 1 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 2 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 3 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Figure 4 for Album Storytelling with Iterative Story-aware Captioning and Large Language Models
Viaarxiv icon

When ChatGPT for Computer Vision Will Come? From 2D to 3D

May 10, 2023
Chenghao Li, Chaoning Zhang

Figure 1 for When ChatGPT for Computer Vision Will Come? From 2D to 3D
Figure 2 for When ChatGPT for Computer Vision Will Come? From 2D to 3D
Figure 3 for When ChatGPT for Computer Vision Will Come? From 2D to 3D
Figure 4 for When ChatGPT for Computer Vision Will Come? From 2D to 3D
Viaarxiv icon