Alert button

"Image": models, code, and papers
Alert button

An Efficient Ensemble Explainable AI (XAI) Approach for Morphed Face Detection

Apr 23, 2023
Rudresh Dwivedi, Ritesh Kumar, Deepak Chopra, Pranay Kothari, Manjot Singh

Figure 1 for An Efficient Ensemble Explainable AI (XAI) Approach for Morphed Face Detection
Figure 2 for An Efficient Ensemble Explainable AI (XAI) Approach for Morphed Face Detection
Figure 3 for An Efficient Ensemble Explainable AI (XAI) Approach for Morphed Face Detection
Figure 4 for An Efficient Ensemble Explainable AI (XAI) Approach for Morphed Face Detection
Viaarxiv icon

What does CLIP know about a red circle? Visual prompt engineering for VLMs

Apr 13, 2023
Aleksandar Shtedritski, Christian Rupprecht, Andrea Vedaldi

Figure 1 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 2 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 3 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Figure 4 for What does CLIP know about a red circle? Visual prompt engineering for VLMs
Viaarxiv icon

CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features

Add code
Bookmark button
Alert button
Feb 05, 2023
Zhaoshan Liu, Lei Shen

Figure 1 for CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features
Figure 2 for CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features
Figure 3 for CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features
Figure 4 for CECT: Controllable Ensemble CNN and Transformer for COVID-19 image classification by capturing both local and global image features
Viaarxiv icon

Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features

Apr 03, 2023
Takahiro Shindo, Taiju Watanabe, Kein Yamada, Hiroshi Watanabe

Figure 1 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Figure 2 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Figure 3 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Figure 4 for Accuracy Improvement of Object Detection in VVC Coded Video Using YOLO-v7 Features
Viaarxiv icon

Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention

May 12, 2023
Zizhang Wu, Zhuozheng Li, Zhi-Gang Fan, Yunzhe Wu, Yuanzhu Gan, Jian Pu, Xianzhi Li

Figure 1 for Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention
Figure 2 for Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention
Figure 3 for Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention
Figure 4 for Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention
Viaarxiv icon

Ship-D: Ship Hull Dataset for Design Optimization using Machine Learning

Add code
Bookmark button
Alert button
May 16, 2023
Noah J. Bagazinski, Faez Ahmed

Figure 1 for Ship-D: Ship Hull Dataset for Design Optimization using Machine Learning
Figure 2 for Ship-D: Ship Hull Dataset for Design Optimization using Machine Learning
Figure 3 for Ship-D: Ship Hull Dataset for Design Optimization using Machine Learning
Figure 4 for Ship-D: Ship Hull Dataset for Design Optimization using Machine Learning
Viaarxiv icon

Image-and-Language Understanding from Pixels Only

Add code
Bookmark button
Alert button
Dec 15, 2022
Michael Tschannen, Basil Mustafa, Neil Houlsby

Figure 1 for Image-and-Language Understanding from Pixels Only
Figure 2 for Image-and-Language Understanding from Pixels Only
Figure 3 for Image-and-Language Understanding from Pixels Only
Figure 4 for Image-and-Language Understanding from Pixels Only
Viaarxiv icon

Are Multimodal Models Robust to Image and Text Perturbations?

Add code
Bookmark button
Alert button
Dec 15, 2022
Jielin Qiu, Yi Zhu, Xingjian Shi, Florian Wenzel, Zhiqiang Tang, Ding Zhao, Bo Li, Mu Li

Figure 1 for Are Multimodal Models Robust to Image and Text Perturbations?
Figure 2 for Are Multimodal Models Robust to Image and Text Perturbations?
Figure 3 for Are Multimodal Models Robust to Image and Text Perturbations?
Figure 4 for Are Multimodal Models Robust to Image and Text Perturbations?
Viaarxiv icon

Improving Diffusion Models for Scene Text Editing with Dual Encoders

Add code
Bookmark button
Alert button
Apr 12, 2023
Jiabao Ji, Guanhua Zhang, Zhaowen Wang, Bairu Hou, Zhifei Zhang, Brian Price, Shiyu Chang

Figure 1 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Figure 2 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Figure 3 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Figure 4 for Improving Diffusion Models for Scene Text Editing with Dual Encoders
Viaarxiv icon

ShadowFormer: Global Context Helps Image Shadow Removal

Add code
Bookmark button
Alert button
Feb 03, 2023
Lanqing Guo, Siyu Huang, Ding Liu, Hao Cheng, Bihan Wen

Figure 1 for ShadowFormer: Global Context Helps Image Shadow Removal
Figure 2 for ShadowFormer: Global Context Helps Image Shadow Removal
Figure 3 for ShadowFormer: Global Context Helps Image Shadow Removal
Figure 4 for ShadowFormer: Global Context Helps Image Shadow Removal
Viaarxiv icon