Alert button

"Image": models, code, and papers
Alert button

BEiT: BERT Pre-Training of Image Transformers

Add code
Bookmark button
Alert button
Jun 15, 2021
Hangbo Bao, Li Dong, Furu Wei

Figure 1 for BEiT: BERT Pre-Training of Image Transformers
Figure 2 for BEiT: BERT Pre-Training of Image Transformers
Figure 3 for BEiT: BERT Pre-Training of Image Transformers
Figure 4 for BEiT: BERT Pre-Training of Image Transformers
Viaarxiv icon

Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens

Add code
Bookmark button
Alert button
Jun 15, 2022
Elad Ben-Avraham, Roei Herzig, Karttikeya Mangalam, Amir Bar, Anna Rohrbach, Leonid Karlinsky, Trevor Darrell, Amir Globerson

Figure 1 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Figure 2 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Figure 3 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Figure 4 for Bringing Image Scene Structure to Video via Frame-Clip Consistency of Object Tokens
Viaarxiv icon

Towards Lightweight Super-Resolution with Dual Regression Learning

Add code
Bookmark button
Alert button
Jul 21, 2022
Yong Guo, Jingdong Wang, Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, Jian Chen, Mingkui Tan

Figure 1 for Towards Lightweight Super-Resolution with Dual Regression Learning
Figure 2 for Towards Lightweight Super-Resolution with Dual Regression Learning
Figure 3 for Towards Lightweight Super-Resolution with Dual Regression Learning
Figure 4 for Towards Lightweight Super-Resolution with Dual Regression Learning
Viaarxiv icon

Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task

Add code
Bookmark button
Alert button
Aug 24, 2022
Stan Weixian Lei, Difei Gao, Jay Zhangjie Wu, Yuxuan Wang, Wei Liu, Mengmi Zhang, Mike Zheng Shou

Figure 1 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Figure 2 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Figure 3 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Figure 4 for Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Viaarxiv icon

PEDENet: Image Anomaly Localization via Patch Embedding and Density Estimation

Oct 29, 2021
Kaitai Zhang, Bin Wang, C. -C. Jay Kuo

Figure 1 for PEDENet: Image Anomaly Localization via Patch Embedding and Density Estimation
Figure 2 for PEDENet: Image Anomaly Localization via Patch Embedding and Density Estimation
Figure 3 for PEDENet: Image Anomaly Localization via Patch Embedding and Density Estimation
Figure 4 for PEDENet: Image Anomaly Localization via Patch Embedding and Density Estimation
Viaarxiv icon

AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation

Oct 20, 2021
Xiangyi Yan, Hao Tang, Shanlin Sun, Haoyu Ma, Deying Kong, Xiaohui Xie

Figure 1 for AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
Figure 2 for AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
Figure 3 for AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
Figure 4 for AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation
Viaarxiv icon

Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model

Add code
Bookmark button
Alert button
Nov 26, 2021
Zipeng Xu, Tianwei Lin, Hao Tang, Fu Li, Dongliang He, Nicu Sebe, Radu Timofte, Luc Van Gool, Errui Ding

Figure 1 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Figure 2 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Figure 3 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Figure 4 for Predict, Prevent, and Evaluate: Disentangled Text-Driven Image Manipulation Empowered by Pre-Trained Vision-Language Model
Viaarxiv icon

Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer

Add code
Bookmark button
Alert button
Jun 01, 2022
Guglielmo Camporese, Elena Izzo, Lamberto Ballan

Figure 1 for Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer
Figure 2 for Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer
Figure 3 for Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer
Figure 4 for Where are my Neighbors? Exploiting Patches Relations in Self-Supervised Vision Transformer
Viaarxiv icon

Robust and Decomposable Average Precision for Image Retrieval

Add code
Bookmark button
Alert button
Oct 01, 2021
Elias Ramzi, Nicolas Thome, Clément Rambour, Nicolas Audebert, Xavier Bitot

Figure 1 for Robust and Decomposable Average Precision for Image Retrieval
Figure 2 for Robust and Decomposable Average Precision for Image Retrieval
Figure 3 for Robust and Decomposable Average Precision for Image Retrieval
Figure 4 for Robust and Decomposable Average Precision for Image Retrieval
Viaarxiv icon

On the Study of Sample Complexity for Polynomial Neural Networks

Jul 18, 2022
Chao Pan, Chuanyi Zhang

Figure 1 for On the Study of Sample Complexity for Polynomial Neural Networks
Viaarxiv icon