Alert button

"Image": models, code, and papers
Alert button

TempBEV: Improving Learned BEV Encoders with Combined Image and BEV Space Temporal Aggregation

Apr 17, 2024
Thomas Monninger, Vandana Dokkadi, Md Zafar Anwar, Steffen Staab

Viaarxiv icon

LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?

Apr 16, 2024
Yuchi Wang, Shuhuai Ren, Rundong Gao, Linli Yao, Qingyan Guo, Kaikai An, Jianhong Bai, Xu Sun

Viaarxiv icon

GraFIQs: Face Image Quality Assessment Using Gradient Magnitudes

Apr 18, 2024
Jan Niklas Kolf, Naser Damer, Fadi Boutros

Viaarxiv icon

PATE-TripleGAN: Privacy-Preserving Image Synthesis with Gaussian Differential Privacy

Apr 19, 2024
Zepeng Jiang, Weiwei Ni, Yifan Zhang

Viaarxiv icon

AdaIR: Exploiting Underlying Similarities of Image Restoration Tasks with Adapters

Apr 17, 2024
Hao-Wei Chen, Yu-Syuan Xu, Kelvin C. K. Chan, Hsien-Kai Kuo, Chun-Yi Lee, Ming-Hsuan Yang

Viaarxiv icon

OPTiML: Dense Semantic Invariance Using Optimal Transport for Self-Supervised Medical Image Representation

Apr 18, 2024
Azad Singh, Vandan Gorade, Deepak Mishra

Viaarxiv icon

Transformer based Pluralistic Image Completion with Reduced Information Loss

Add code
Bookmark button
Alert button
Apr 15, 2024
Qiankun Liu, Yuqi Jiang, Zhentao Tan, Dongdong Chen, Ying Fu, Qi Chu, Gang Hua, Nenghai Yu

Viaarxiv icon

HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing

Apr 15, 2024
Mude Hui, Siwei Yang, Bingchen Zhao, Yichun Shi, Heng Wang, Peng Wang, Yuyin Zhou, Cihang Xie

Viaarxiv icon

Compressible and Searchable: AI-native Multi-Modal Retrieval System with Learned Image Compression

Apr 16, 2024
Jixiang Luo

Viaarxiv icon

FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining

Apr 15, 2024
Zou Zhen, Yu Hu, Zhao Feng

Viaarxiv icon