Alert button

"Image": models, code, and papers
Alert button

Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases

Dec 22, 2023
Zhangyang Qi, Ye Fang, Mengchen Zhang, Zeyi Sun, Tong Wu, Ziwei Liu, Dahua Lin, Jiaqi Wang, Hengshuang Zhao

Viaarxiv icon

DUSt3R: Geometric 3D Vision Made Easy

Dec 21, 2023
Shuzhe Wang, Vincent Leroy, Yohann Cabon, Boris Chidlovskii, Jerome Revaud

Viaarxiv icon

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding

Dec 21, 2023
Senqiao Yang, Jiaming Liu, Ray Zhang, Mingjie Pan, Zoey Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Yandong Guo, Shanghang Zhang

Viaarxiv icon

Hundred-Kilobyte Lookup Tables for Efficient Single-Image Super-Resolution

Dec 11, 2023
Binxiao Huang, Jason Chun Lok Li, Jie Ran, Boyu Li, Jiajun Zhou, Dahai Yu, Ngai Wong

Viaarxiv icon

Disentangling the Effects of Data Augmentation and Format Transform in Self-Supervised Learning of Image Representations

Dec 02, 2023
Neha Kalibhat, Warren Morningstar, Alex Bijamov, Luyang Liu, Karan Singhal, Philip Mansfield

Viaarxiv icon

Score-based diffusion priors for multi-target detection

Dec 13, 2023
Alon Zabatani, Shay Kreymer, Tamir Bendory

Figure 1 for Score-based diffusion priors for multi-target detection
Figure 2 for Score-based diffusion priors for multi-target detection
Figure 3 for Score-based diffusion priors for multi-target detection
Figure 4 for Score-based diffusion priors for multi-target detection
Viaarxiv icon

Implicit Affordance Acquisition via Causal Action-Effect Modeling in the Video Domain

Dec 18, 2023
Hsiu-Yu Yang, Carina Silberer

Viaarxiv icon

Breaking Temporal Consistency: Generating Video Universal Adversarial Perturbations Using Image Models

Nov 17, 2023
Hee-Seon Kim, Minji Son, Minbeom Kim, Myung-Joon Kwon, Changick Kim

Viaarxiv icon

ZeroRF: Fast Sparse View 360° Reconstruction with Zero Pretraining

Dec 14, 2023
Ruoxi Shi, Xinyue Wei, Cheng Wang, Hao Su

Viaarxiv icon

Emu Edit: Precise Image Editing via Recognition and Generation Tasks

Nov 16, 2023
Shelly Sheynin, Adam Polyak, Uriel Singer, Yuval Kirstain, Amit Zohar, Oron Ashual, Devi Parikh, Yaniv Taigman

Viaarxiv icon