Alert button

"Image": models, code, and papers
Alert button

"Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning

Jun 01, 2023
Abisek Rajakumar Kalarani, Pushpak Bhattacharyya, Niyati Chhaya, Sumit Shekhar

Figure 1 for "Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning
Figure 2 for "Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning
Figure 3 for "Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning
Figure 4 for "Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning
Viaarxiv icon

FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene

Jul 27, 2023
Chengrui Wei, Meng Yang, Lei He, Nanning Zheng

Figure 1 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 2 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 3 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Figure 4 for FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene
Viaarxiv icon

Symmetry Defense Against XGBoost Adversarial Perturbation Attacks

Aug 10, 2023
Blerta Lindqvist

Figure 1 for Symmetry Defense Against XGBoost Adversarial Perturbation Attacks
Figure 2 for Symmetry Defense Against XGBoost Adversarial Perturbation Attacks
Figure 3 for Symmetry Defense Against XGBoost Adversarial Perturbation Attacks
Figure 4 for Symmetry Defense Against XGBoost Adversarial Perturbation Attacks
Viaarxiv icon

Leverage Weakly Annotation to Pixel-wise Annotation via Zero-shot Segment Anything Model for Molecular-empowered Learning

Aug 10, 2023
Xueyuan Li, Ruining Deng, Yucheng Tang, Shunxing Bao, Haichun Yang, Yuankai Huo

Figure 1 for Leverage Weakly Annotation to Pixel-wise Annotation via Zero-shot Segment Anything Model for Molecular-empowered Learning
Figure 2 for Leverage Weakly Annotation to Pixel-wise Annotation via Zero-shot Segment Anything Model for Molecular-empowered Learning
Figure 3 for Leverage Weakly Annotation to Pixel-wise Annotation via Zero-shot Segment Anything Model for Molecular-empowered Learning
Figure 4 for Leverage Weakly Annotation to Pixel-wise Annotation via Zero-shot Segment Anything Model for Molecular-empowered Learning
Viaarxiv icon

COURIER: Contrastive User Intention Reconstruction for Large-Scale Pre-Train of Image Features

Jun 08, 2023
Jia-Qi Yang, Chenglei Dai, OU Dan, Ju Huang, De-Chuan Zhan, Qingwen Liu, Xiaoyi Zeng, Yang Yang

Figure 1 for COURIER: Contrastive User Intention Reconstruction for Large-Scale Pre-Train of Image Features
Figure 2 for COURIER: Contrastive User Intention Reconstruction for Large-Scale Pre-Train of Image Features
Figure 3 for COURIER: Contrastive User Intention Reconstruction for Large-Scale Pre-Train of Image Features
Figure 4 for COURIER: Contrastive User Intention Reconstruction for Large-Scale Pre-Train of Image Features
Viaarxiv icon

Three-dimensional echo-shifted EPI with simultaneous blip-up and blip-down acquisitions for correcting geometric distortion

Aug 12, 2023
Kaibao Sun, Zhifeng Chen, Guangyu Dan, Qingfei Luo, Lirong Yan, Feng Liu, Xiaohong Joe Zhou

Figure 1 for Three-dimensional echo-shifted EPI with simultaneous blip-up and blip-down acquisitions for correcting geometric distortion
Figure 2 for Three-dimensional echo-shifted EPI with simultaneous blip-up and blip-down acquisitions for correcting geometric distortion
Figure 3 for Three-dimensional echo-shifted EPI with simultaneous blip-up and blip-down acquisitions for correcting geometric distortion
Figure 4 for Three-dimensional echo-shifted EPI with simultaneous blip-up and blip-down acquisitions for correcting geometric distortion
Viaarxiv icon

DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation

Jul 31, 2023
Runyang Feng, Yixing Gao, Tze Ho Elden Tse, Xueqing Ma, Hyung Jin Chang

Figure 1 for DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation
Figure 2 for DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation
Figure 3 for DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation
Figure 4 for DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation
Viaarxiv icon

Text-Only Image Captioning with Multi-Context Data Generation

May 29, 2023
Feipeng Ma, Yizhou Zhou, Fengyun Rao, Yueyi Zhang, Xiaoyan Sun

Figure 1 for Text-Only Image Captioning with Multi-Context Data Generation
Figure 2 for Text-Only Image Captioning with Multi-Context Data Generation
Figure 3 for Text-Only Image Captioning with Multi-Context Data Generation
Figure 4 for Text-Only Image Captioning with Multi-Context Data Generation
Viaarxiv icon

Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models

Jun 03, 2023
Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

Figure 1 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Figure 2 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Figure 3 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Figure 4 for Table and Image Generation for Investigating Knowledge of Entities in Pre-trained Vision and Language Models
Viaarxiv icon

Pay Attention to the Atlas: Atlas-Guided Test-Time Adaptation Method for Robust 3D Medical Image Segmentation

Jul 02, 2023
Jingjie Guo, Weitong Zhang, Matthew Sinclair, Daniel Rueckert, Chen Chen

Figure 1 for Pay Attention to the Atlas: Atlas-Guided Test-Time Adaptation Method for Robust 3D Medical Image Segmentation
Figure 2 for Pay Attention to the Atlas: Atlas-Guided Test-Time Adaptation Method for Robust 3D Medical Image Segmentation
Figure 3 for Pay Attention to the Atlas: Atlas-Guided Test-Time Adaptation Method for Robust 3D Medical Image Segmentation
Figure 4 for Pay Attention to the Atlas: Atlas-Guided Test-Time Adaptation Method for Robust 3D Medical Image Segmentation
Viaarxiv icon