Alert button

"Image": models, code, and papers
Alert button

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

Apr 11, 2024
Haotian Zhang, Haoxuan You, Philipp Dufter, Bowen Zhang, Chen Chen, Hong-You Chen, Tsu-Jui Fu, William Yang Wang, Shih-Fu Chang, Zhe Gan, Yinfei Yang

Viaarxiv icon

WaveMo: Learning Wavefront Modulations to See Through Scattering

Add code
Bookmark button
Alert button
Apr 11, 2024
Mingyang Xie, Haiyun Guo, Brandon Y. Feng, Lingbo Jin, Ashok Veeraraghavan, Christopher A. Metzler

Viaarxiv icon

Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs

Apr 11, 2024
Kanchana Ranasinghe, Satya Narayan Shukla, Omid Poursaeed, Michael S. Ryoo, Tsung-Yu Lin

Viaarxiv icon

Clean-image Backdoor Attacks

Mar 26, 2024
Dazhong Rong, Guoyao Yu, Shuheng Shen, Xinyi Fu, Peng Qian, Jianhai Chen, Qinming He, Xing Fu, Weiqiang Wang

Viaarxiv icon

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

Add code
Bookmark button
Alert button
Mar 30, 2024
Shihao Zhou, Jinshan Pan, Jinglei Shi, Duosheng Chen, Lishen Qu, Jufeng Yang

Viaarxiv icon

Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

Mar 28, 2024
Zhongliang Zhou, Jielu Zhang, Zihan Guan, Mengxuan Hu, Ni Lao, Lan Mu, Sheng Li, Gengchen Mai

Figure 1 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Figure 2 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Figure 3 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Figure 4 for Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation
Viaarxiv icon

A Mutual Inclusion Mechanism for Precise Boundary Segmentation in Medical Images

Add code
Bookmark button
Alert button
Apr 12, 2024
Yizhi Pan, Junyi Xin, Tianhua Yang, Teeradaj Racharak, Le-Minh Nguyen, Guanqun Sun

Viaarxiv icon

DreamScape: 3D Scene Creation via Gaussian Splatting joint Correlation Modeling

Apr 14, 2024
Xuening Yuan, Hongyu Yang, Yueming Zhao, Di Huang

Viaarxiv icon

In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition

Apr 14, 2024
Wiktor Mucha, Martin Kampel

Viaarxiv icon

Two-Phase Multi-Dose-Level PET Image Reconstruction with Dose Level Awareness

Apr 10, 2024
Yuchen Fei, Yanmei Luo, Yan Wang, Jiaqi Cui, Yuanyuan Xu, Jiliu Zhou, Dinggang Shen

Viaarxiv icon