Alert button

"Image": models, code, and papers
Alert button

Bootstrapping Interactive Image-Text Alignment for Remote Sensing Image Captioning

Dec 02, 2023
Cong Yang, Zuchao Li, Lefei Zhang

Viaarxiv icon

Generalizable vision-language pre-training for annotation-free pathology localization

Jan 04, 2024
Hao Yang, Hong-Yu Zhou, Cheng Li, Weijian Huang, Jiarun Liu, Shanshan Wang

Viaarxiv icon

Improving the Stability of Diffusion Models for Content Consistent Super-Resolution

Dec 30, 2023
Lingchen Sun, Rongyuan Wu, Zhengqiang Zhang, Hongwei Yong, Lei Zhang

Viaarxiv icon

DiffusionEdge: Diffusion Probabilistic Model for Crisp Edge Detection

Jan 09, 2024
Yunfan Ye, Kai Xu, Yuhang Huang, Renjiao Yi, Zhiping Cai

Viaarxiv icon

MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multi-scale Dilated Convolution for Image Compressive Sensing (CS)

Jan 05, 2024
Youhao Yu, Richard M. Dansereau

Viaarxiv icon

SpeedUpNet: A Plug-and-Play Hyper-Network for Accelerating Text-to-Image Diffusion Models

Dec 20, 2023
Weilong Chai, DanDan Zheng, Jiajiong Cao, Zhiquan Chen, Changbao Wang, Chenguang Ma

Viaarxiv icon

VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection

Jan 05, 2024
Ziying Song, Guoxin Zhang, Jun Xie, Lin Liu, Caiyan Jia, Shaoqing Xu, Zhepeng Wang

Figure 1 for VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection
Figure 2 for VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection
Figure 3 for VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection
Figure 4 for VoxelNextFusion: A Simple, Unified and Effective Voxel Fusion Framework for Multi-Modal 3D Object Detection
Viaarxiv icon

Can We Generate Realistic Hands Only Using Convolution?

Jan 03, 2024
Mehran Hosseini, Peyman Hosseini

Viaarxiv icon

Patch-wise Graph Contrastive Learning for Image Translation

Dec 13, 2023
Chanyong Jung, Gihyun Kwon, Jong Chul Ye

Viaarxiv icon

SENet: Visual Detection of Online Social Engineering Attack Campaigns

Jan 10, 2024
Irfan Ozen, Karthika Subramani, Phani Vadrevu, Roberto Perdisci

Viaarxiv icon